Skip to content

Slate+rocm test fails on crusher #59

@wspear

Description

@wspear

@G-Ragghianti @mgates3

The slate standalone test defined here: https://github.com/E4S-Project/testsuite/tree/master/validation_tests/slate-rocm fails when run on the slate build installed as part of the e4s 22.11 deployment on crusher using these variants with the console output below:

-- linux-sles15-zen3 / gcc@11.2.0 -------------------------------
edojdwe slate@2022.07.00~cuda~ipo+mpi+openmp+rocm+shared amdgpu_target=gfx90a build_system=cmake build_type=RelWithDebInfo
7ej4aoh     blaspp@2022.07.00~cuda~ipo+openmp+rocm+shared amdgpu_target=gfx90a build_system=cmake build_type=RelWithDebInfo
c6gpjyk         cmake@3.24.2~doc+ncurses+ownlibs~qt build_system=generic build_type=Release
igbrz2c             ncurses@6.3~symlinks+termlib abi=none build_system=autotools
savxweu                 pkgconf@1.8.0 build_system=autotools
kq7i44v             openssl@1.1.1s~docs~shared build_system=generic certs=mozilla
6ki4n47                 ca-certificates-mozilla@2022-10-11 build_system=generic
ucjrwtm                 perl@5.36.0+cpanm+shared+threads build_system=generic
gqdvawb                     berkeley-db@18.1.40+cxx~docs+stl build_system=autotools patches=26090f4,b231fcc
g2bpsoz                     bzip2@1.0.8~debug~pic+shared build_system=generic
rnafwos                         diffutils@3.8 build_system=autotools
xfogkcu                             libiconv@1.16 build_system=autotools libs=shared,static
otqsxvg                     gdbm@1.23 build_system=autotools
6mvf2em                         readline@8.1.2 build_system=autotools
76b2zrq                     zlib@1.2.13+optimize+pic+shared build_system=makefile
bzm57qy         hip@5.2.0~ipo build_system=cmake build_type=Release patches=959d1fe
e5ldtkh         hsa-rocr-dev@5.2.0+image~ipo+shared build_system=cmake build_type=Release patches=71e6851
mm6mnhr         llvm-amdgpu@5.2.0~ipo~link_llvm_dylib~llvm_dylib~openmp+rocm-device-libs build_system=cmake build_type=Release patches=a08bbe1
bgpvt5g         openblas@0.3.21~bignuma~consistent_fpcsr+fortran~ilp64+locking+pic+shared build_system=makefile patches=d3d9b15 symbol_suffix=none threads=openmp
g2sf37k         rocblas@5.2.0~ipo+tensile amdgpu_target=auto build_system=cmake build_type=Release patches=81591d9
oaykapp     cray-mpich@8.1.17+wrappers build_system=generic
izppu2z     lapackpp@2022.07.00~cuda~ipo+rocm+shared amdgpu_target=gfx90a build_system=cmake build_type=RelWithDebInfo
orsl6og         rocsolver@5.2.0~ipo+optimal amdgpu_target=auto build_system=cmake build_type=Release
slate+rocm %gcc: edojdwe
terminate called after throwing an instance of 'std::out_of_range'
  what():  map::at
terminate called after throwing an instance of 'std::out_of_range'
  what():  map::at
terminate called after throwing an instance of 'std::out_of_range'
  what():  map::at
srun: error: crusher124: tasks 1-3: Aborted
srun: launch/slurm: _step_signal: Terminating StepId=230307.0
slurmstepd: error: *** STEP 230307.0 ON crusher124 CANCELLED AT 2022-12-14T18:10:38 ***
srun: error: crusher124: task 0: Terminated
srun: Force Terminated StepId=230307.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions