Skip to content

ELPA 2023.05.001.rc1

Choose a tag to compare

@marekandreas marekandreas released this 22 May 09:45
· 843 commits to master since this release
  • added CITATION.cff file
  • allow test programs to be run with 1 MPI task
  • correct a memory leak in the gpu stream setup
  • better handling of GPU BLAS handles
  • implement the execution of the AMD HIP code path on NVIDIA GPUs
  • implement the execution of the SYCL GPU code path on CPUs (debugging)
  • port generalized routines to SYCL GPU
  • PoC to use NVIDIA NCCL instead of MPI (not production ready)
  • somewhat cleanup of documentation