Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-8150313

aarch64: optimise array copy using SIMD instructions

    Details

    • Type: Enhancement
    • Status: Resolved
    • Priority: P4
    • Resolution: Fixed
    • Affects Version/s: 9
    • Fix Version/s: 9
    • Component/s: hotspot
    • Labels:
      None
    • Subcomponent:
    • Resolved In Build:
      b112
    • CPU:
      aarch64
    • OS:
      linux

      Description

      This uses SIMD ldp/stp Qx, Qy instructions instead of scalar ldp/stp instructions, thereby loading/storing 32 bytes at a time instead of 16.

      It also extends the small copy code to copy 0-96 instead of 0-80 (because 80 is not divisible by 32).

      This improves performance on some micro-arches and not on others so I have provided a -XX:+UseSIMDForMemoryOps switch which defaults to false (we could look at enabling this by default for micro-arches where we know SIMD is better).

        Attachments

          Activity

            People

            • Assignee:
              enevill Ed Nevill
              Reporter:
              enevill Ed Nevill
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: