Skip to content

Conversation

@cartermckinnon
Copy link
Contributor

Description of changes:

Timeouts during Setup can be caused by a failure in one of:

  1. mpi-operator
  2. nvidia-device-plugin
  3. efa-device-plugin

This adds some logging to narrow things down.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

@Issacwww Issacwww merged commit 9bc9c51 into main Sep 9, 2024
5 checks passed
@Issacwww Issacwww deleted the env-setup-logging branch September 9, 2024 21:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants