Feat/refactor process group #1137
Triggered via pull request
October 28, 2024 07:52
Status
Failure
Total duration
3h 40m 55s
Artifacts
–
e2e_test.yaml
on: pull_request
training_4GPU
40s
training_8GPU_ISP
35s
training_8GPU_ISP_CKPT
36s
training_8GPU_4DP2PP_ZB
41s
Matrix: training_16GPU_4DP2TP2PP_FSP
Matrix: training_16GPU_4DP2TP2PP_MSP
Matrix: training_16GPU_4DP2TP2PP_MTP
Matrix: training_8GPU_4DP2PP
Matrix: training_8GPU_4DP2TP
Matrix: training_8GPU_4DP2TPSP
Matrix: training_internlm2
Matrix: training_llama2
Annotations
18 errors and 14 warnings
training_16GPU_4DP2TP2PP_FSP (910B)
The job running on runner evo_910B has exceeded the maximum execution time of 15 minutes.
|
training_16GPU_4DP2TP2PP_FSP (910B)
The operation was canceled.
|
training_16GPU_4DP2TP2PP_MSP (910B)
Process completed with exit code 1.
|
training_16GPU_4DP2TP2PP_MTP (910B)
Process completed with exit code 1.
|
training_8GPU_4DP2PP (910B)
The job running on runner evo_910B has exceeded the maximum execution time of 15 minutes.
|
training_8GPU_4DP2PP (910B)
The operation was canceled.
|
training_8GPU_4DP2TP (910B)
RPC failed; curl 56 GnuTLS recv error (-110): The TLS connection was non-properly terminated.
|
training_8GPU_4DP2TP (910B)
expected 'acknowledgments'
|
training_8GPU_4DP2TP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': Failed to connect to github.com port 443: Connection timed out
|
training_8GPU_4DP2TP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': Failed to connect to github.com port 443: Connection timed out
|
training_8GPU_4DP2TP (910B)
The process '/usr/local/bin/git' failed with exit code 128
|
training_8GPU_4DP2TPSP (910B)
Process completed with exit code 1.
|
training_internlm2 (910B)
Process completed with exit code 1.
|
training_llama2 (910B)
Process completed with exit code 1.
|
training_4GPU
Process completed with exit code 143.
|
training_8GPU_4DP2PP_ZB
Process completed with exit code 143.
|
training_8GPU_ISP
Process completed with exit code 143.
|
training_8GPU_ISP_CKPT
Process completed with exit code 143.
|
training_16GPU_4DP2TP2PP_MSP (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_16GPU_4DP2TP2PP_MTP (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_8GPU_4DP2TP (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_8GPU_4DP2TPSP (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_internlm2 (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_llama2 (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_4GPU
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_4GPU
The Actions runner will no longer support your OS version on November 1, 2024. Please upgrade to a supported version. For information, refer https://github.blog/changelog/2024-08-19-notice-of-upcoming-deprecations-and-breaking-changes-in-github-actions-runners/
|
training_8GPU_4DP2PP_ZB
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_8GPU_4DP2PP_ZB
The Actions runner will no longer support your OS version on November 1, 2024. Please upgrade to a supported version. For information, refer https://github.blog/changelog/2024-08-19-notice-of-upcoming-deprecations-and-breaking-changes-in-github-actions-runners/
|
training_8GPU_ISP
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_8GPU_ISP
The Actions runner will no longer support your OS version on November 1, 2024. Please upgrade to a supported version. For information, refer https://github.blog/changelog/2024-08-19-notice-of-upcoming-deprecations-and-breaking-changes-in-github-actions-runners/
|
training_8GPU_ISP_CKPT
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|
training_8GPU_ISP_CKPT
The Actions runner will no longer support your OS version on November 1, 2024. Please upgrade to a supported version. For information, refer https://github.blog/changelog/2024-08-19-notice-of-upcoming-deprecations-and-breaking-changes-in-github-actions-runners/
|