Uploaded image for project: 'JDK'
  1. JDK
  2. JDK-8277453

SIGSEGV at CopySwap::do_conjoint_swap | OpenJDK 8u302

    XMLWordPrintable

    Details

    • Subcomponent:
    • CPU:
      x86_64
    • OS:
      linux

      Description

      ADDITIONAL SYSTEM INFORMATION :
      Amazon Linux 2

      # JRE version: OpenJDK Runtime Environment (8.0_302-b08) (build 1.8.0_302-debug-b08)
      # Java VM: OpenJDK 64-Bit Server VM (25.302-b08-debug mixed mode linux-amd64 compressed oops)

      A DESCRIPTION OF THE PROBLEM :
      During the execution of a Apache Spark job, running over an Hadoop Yarn Cluster in Amazon's AWS EMR Cluster System, the jobs are aborted because of java processes aborting with a SIGSEGV.

      A fatal error message is written when crash happens.
      ------------------------------------------------------------------------------------------------------------
      #
      # A fatal error has been detected by the Java Runtime Environment:
      #
      # SIGSEGV (0xb) at pc=0x00007f001809021a, pid=2087, tid=0x00007effe181c700
      #
      # JRE version: OpenJDK Runtime Environment (8.0_302-b08) (build 1.8.0_302-debug-b08)
      # Java VM: OpenJDK 64-Bit Server VM (25.302-b08-debug mixed mode linux-amd64 compressed oops)
      # Problematic frame:
      # V [libjvm.so+0x6a021a] void CopySwap::do_conjoint_swap<unsigned int, (CopySwap::CopyDirection)0, true, true>(unsigned char*, unsigned char*, unsigned long)+0x4e
      #
      # Failed to write core dump. Core dumps have been disabled. To enable core dumping, try "ulimit -c unlimited" before starting Java again
      #
      # An error report file with more information is saved as:
      # /mnt2/yarn/usercache/hadoop/appcache/application_1636976870674_0010/container_1636976870674_0010_01_000002/hs_err_pid2087.log
      #
      # If you would like to submit a bug report, please visit:
      # https://forums.aws.amazon.com/
      #
      Current thread is 139637465138944
      Dumping core ...
      ```
      ---------------------------------------------------------------------------------------------------------------------
      Running the java process in a debugger the stacktrace when the abort happens is as follow:

      ----------------------------------------------------------------------------------------------------
      (gdb) bt
      #0 0x00007f001809021a in CopySwap::do_conjoint_swap<unsigned int, (CopySwap::CopyDirection)0, true, true> (src=0x7f1af93f9e3c <error: Cannot access memory at address 0x7f1af93f9e3c>,
          dst=0x783f9b418 "", byte_count=12) at /usr/src/debug/java-1.8.0-openjdk-1.8.0.302.b08-0.amzn2.0.1.x86_64/openjdk/hotspot/src/share/vm/utilities/copy.cpp:147
      #1 0x00007f001808fc21 in CopySwap::do_conjoint_swap<unsigned int, (CopySwap::CopyDirection)0> (src=0x7f1af93f9e3c <error: Cannot access memory at address 0x7f1af93f9e3c>, dst=0x783f9b418 "",
          byte_count=12) at /usr/src/debug/java-1.8.0-openjdk-1.8.0.302.b08-0.amzn2.0.1.x86_64/openjdk/hotspot/src/share/vm/utilities/copy.cpp:187
      #2 0x00007f001808f968 in CopySwap::do_conjoint_swap<(CopySwap::CopyDirection)0> (src=0x7f1af93f9e3c <error: Cannot access memory at address 0x7f1af93f9e3c>, dst=0x783f9b418 "", byte_count=12,
          elem_size=4) at /usr/src/debug/java-1.8.0-openjdk-1.8.0.302.b08-0.amzn2.0.1.x86_64/openjdk/hotspot/src/share/vm/utilities/copy.cpp:215
      #3 0x00007f001808f7ea in CopySwap::conjoint_swap (src=0x7f1af93f9e3c <error: Cannot access memory at address 0x7f1af93f9e3c>, dst=0x783f9b418 "", byte_count=12, elem_size=4)
          at /usr/src/debug/java-1.8.0-openjdk-1.8.0.302.b08-0.amzn2.0.1.x86_64/openjdk/hotspot/src/share/vm/utilities/copy.cpp:77
      #4 0x00007f001808f3fc in Copy::conjoint_swap (src=0x7f1af93f9e3c <error: Cannot access memory at address 0x7f1af93f9e3c>, dst=0x783f9b418 "", byte_count=12, elem_size=4)
          at /usr/src/debug/java-1.8.0-openjdk-1.8.0.302.b08-0.amzn2.0.1.x86_64/openjdk/hotspot/src/share/vm/utilities/copy.cpp:223
      #5 0x00007f00183bd7cb in JVM_CopySwapMemory (env=0x56439bbaab30, srcObj=0x0, srcOffset=139753827573308, dstObj=0x7effe181a090, dstOffset=16, size=12, elemSize=4)
          at /usr/src/debug/java-1.8.0-openjdk-1.8.0.302.b08-0.amzn2.0.1.x86_64/openjdk/hotspot/src/share/vm/prims/jvm.cpp:858
      #6 0x00007f0016ad5f49 in Java_java_nio_Bits_copySwapMemory0 (env=0x56439bbaab30, cls=0x7effe181a050, srcObj=0x0, srcOffset=139753827573308, dstObj=0x7effe181a090, dstOffset=16, size=12, elemSize=4)
          at /usr/src/debug/java-1.8.0-openjdk-1.8.0.302.b08-0.amzn2.0.1.x86_64/openjdk/jdk/src/share/native/java/nio/Bits.c:33


      ----------------------------------------------------------------------------------------------

      Unfortunately, it seems that the hs_err_pid2087.log file is not generated in this system, but I've saved the core file for further analysis.



      FREQUENCY : always


        Attachments

          Activity

            People

            Assignee:
            pnarayanaswa Praveen Narayanaswamy
            Reporter:
            webbuggrp Webbug Group
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: