Skip to content

feat(pipeline): Zero Bubble V Shape Memory Efficient Editon #1132

feat(pipeline): Zero Bubble V Shape Memory Efficient Editon

feat(pipeline): Zero Bubble V Shape Memory Efficient Editon #1132

Triggered via pull request October 28, 2024 02:01
Status Failure
Total duration 1h 25m 16s
Artifacts

e2e_test.yaml

on: pull_request
training_4GPU
25s
training_4GPU
training_8GPU_ISP
28s
training_8GPU_ISP
training_8GPU_ISP_CKPT
27s
training_8GPU_ISP_CKPT
training_8GPU_4DP2PP_ZB
27s
training_8GPU_4DP2PP_ZB
Matrix: training_16GPU_4DP2TP2PP_FSP
Matrix: training_16GPU_4DP2TP2PP_MSP
Matrix: training_16GPU_4DP2TP2PP_MTP
Matrix: training_8GPU_4DP2PP
Matrix: training_8GPU_4DP2TP
Matrix: training_8GPU_4DP2TPSP
Matrix: training_internlm2
Matrix: training_llama2
Fit to window
Zoom out
Zoom in

Annotations

25 errors and 12 warnings
training_16GPU_4DP2TP2PP_FSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': GnuTLS recv error (-110): The TLS connection was non-properly terminated.
training_16GPU_4DP2TP2PP_FSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': GnuTLS recv error (-110): The TLS connection was non-properly terminated.
training_16GPU_4DP2TP2PP_FSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': Failed to connect to github.com port 443: Connection timed out
training_16GPU_4DP2TP2PP_FSP (910B)
The process '/usr/local/bin/git' failed with exit code 128
training_4GPU
Process completed with exit code 143.
training_8GPU_4DP2PP_ZB
Process completed with exit code 2.
training_8GPU_ISP_CKPT
Process completed with exit code 143.
training_8GPU_ISP
Process completed with exit code 2.
training_16GPU_4DP2TP2PP_MSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': GnuTLS recv error (-110): The TLS connection was non-properly terminated.
training_16GPU_4DP2TP2PP_MSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': Failed to connect to github.com port 443: Connection timed out
training_16GPU_4DP2TP2PP_MSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': GnuTLS recv error (-110): The TLS connection was non-properly terminated.
training_16GPU_4DP2TP2PP_MSP (910B)
The process '/usr/local/bin/git' failed with exit code 128
training_16GPU_4DP2TP2PP_MTP (910B)
Process completed with exit code 1.
training_8GPU_4DP2PP (910B)
The job running on runner evo_910B has exceeded the maximum execution time of 15 minutes.
training_8GPU_4DP2PP (910B)
The operation was canceled.
training_8GPU_4DP2TP (910B)
The job running on runner evo_910B has exceeded the maximum execution time of 15 minutes.
training_8GPU_4DP2TP (910B)
The operation was canceled.
training_8GPU_4DP2TPSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': GnuTLS recv error (-110): The TLS connection was non-properly terminated.
training_8GPU_4DP2TPSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': Failed to connect to github.com port 443: Connection timed out
training_8GPU_4DP2TPSP (910B)
unable to access 'https://github.com/InternLM/InternEvo/': Failed to connect to github.com port 443: Connection timed out
training_8GPU_4DP2TPSP (910B)
The process '/usr/local/bin/git' failed with exit code 128
training_internlm2 (910B)
The job running on runner evo_910B has exceeded the maximum execution time of 20 minutes.
training_internlm2 (910B)
The operation was canceled.
training_llama2 (910B)
The job running on runner evo_910B has exceeded the maximum execution time of 20 minutes.
training_llama2 (910B)
The operation was canceled.
training_16GPU_4DP2TP2PP_FSP (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
training_4GPU
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
training_4GPU
The Actions runner will no longer support your OS version on November 1, 2024. Please upgrade to a supported version. For information, refer https://github.blog/changelog/2024-08-19-notice-of-upcoming-deprecations-and-breaking-changes-in-github-actions-runners/
training_8GPU_4DP2PP_ZB
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
training_8GPU_4DP2PP_ZB
The Actions runner will no longer support your OS version on November 1, 2024. Please upgrade to a supported version. For information, refer https://github.blog/changelog/2024-08-19-notice-of-upcoming-deprecations-and-breaking-changes-in-github-actions-runners/
training_8GPU_ISP_CKPT
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
training_8GPU_ISP_CKPT
The Actions runner will no longer support your OS version on November 1, 2024. Please upgrade to a supported version. For information, refer https://github.blog/changelog/2024-08-19-notice-of-upcoming-deprecations-and-breaking-changes-in-github-actions-runners/
training_8GPU_ISP
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
training_8GPU_ISP
The Actions runner will no longer support your OS version on November 1, 2024. Please upgrade to a supported version. For information, refer https://github.blog/changelog/2024-08-19-notice-of-upcoming-deprecations-and-breaking-changes-in-github-actions-runners/
training_16GPU_4DP2TP2PP_MSP (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
training_16GPU_4DP2TP2PP_MTP (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
training_8GPU_4DP2TPSP (910B)
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: actions/checkout@v3. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.