Skip to content

Conversation

@Issacwww
Copy link
Contributor

Issue #, if available:

Description of changes:
MPI5 runinto below issue on AL2 & AL23

[multi-node-nccl-test-launcher:00001] PMIX ERROR: PMIX_ERR_BAD_PARAM in file gds_utils.c at line 308
[multi-node-nccl-test-launcher:00001] PMIX ERROR: PMIX_ERR_BAD_PARAM in file gds_hash.c at line 496
[multi-node-nccl-test-launcher:00001] PRTE ERROR: Bad parameter in file base/odls_base_default_fns.c at line 768

MPI4 is recommended https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/efa-start-nccl.html
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

Copy link
Contributor

@ndbaker1 ndbaker1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm when ready

@Issacwww Issacwww merged commit 17bea29 into aws:main Oct 12, 2024
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants