COSMA-v2.6.0 #114
kabicm
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This release enables COSMA to take advantage of fast GPU-to-GPU interconnects like NVLink, to efficiently utilize modern Multi-GPU Systems. This is achieved in 2 ways:
NCCL/RCCL
Libraries: by specifying-DCOSMA_WITH_NCCL=ON
cmake option.-DCOSMA_WITH_GPU_AWARE_MPI=ON
cmake option, as proposed here.See README and INSTALL for more info on how to build.
In addition, the following performance improvemets have been made:
all-gather
andreduce-scatter
collectives are improved.MPI_Reduce_scatterv
which we previously used. We padded all the blocks to be able to overcome this issue.This discussion was created from the release COSMA-v2.6.0.
Beta Was this translation helpful? Give feedback.
All reactions