-
Hello,
and using a smaller model, because I noticed in the NeMo code: Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi, MPI is used by default to bootstrap the user buffers (see the TransformerEngine documentation here). However, NCCL bootstrap should also be supported now. You can try setting |
Beta Was this translation helpful? Give feedback.
Hi, MPI is used by default to bootstrap the user buffers (see the TransformerEngine documentation here). However, NCCL bootstrap should also be supported now. You can try setting
tp_comm_bootstrap_backend="nccl"
here.