feat(refactor): improve usability #1299
Triggered via pull request
February 13, 2025 09:37
Status
Cancelled
Total duration
30m 39s
Artifacts
–
e2e_test.yaml
on: pull_request
training_4GPU
5m 33s
training_8GPU_ISP
0s
training_8GPU_ISP_CKPT
0s
training_8GPU_4DP2PP_ZB
0s
Matrix: training_16GPU_4DP2TP2PP_FSP
Matrix: training_16GPU_4DP2TP2PP_MSP
Matrix: training_16GPU_4DP2TP2PP_MTP
Matrix: training_8GPU_4DP2PP
Matrix: training_8GPU_4DP2TP
Matrix: training_8GPU_4DP2TPSP
Matrix: training_llama2
Annotations
12 errors and 4 warnings
training_16GPU_4DP2TP2PP_FSP (t_cluster)
Process completed with exit code 143.
|
training_16GPU_4DP2TP2PP_MSP (t_cluster)
Process completed with exit code 143.
|
training_16GPU_4DP2TP2PP_MTP (t_cluster)
Process completed with exit code 143.
|
training_8GPU_ISP
Canceling since a higher priority waiting request for 'e2e-tests-410' exists
|
training_8GPU_ISP_CKPT
Canceling since a higher priority waiting request for 'e2e-tests-410' exists
|
training_8GPU_4DP2TP (t_cluster)
Canceling since a higher priority waiting request for 'e2e-tests-410' exists
|
training_8GPU_4DP2TPSP (t_cluster)
Canceling since a higher priority waiting request for 'e2e-tests-410' exists
|
training_8GPU_4DP2PP_ZB
Canceling since a higher priority waiting request for 'e2e-tests-410' exists
|
training_llama2 (t_cluster)
Canceling since a higher priority waiting request for 'e2e-tests-410' exists
|
training_8GPU_4DP2PP (t_cluster)
Canceling since a higher priority waiting request for 'e2e-tests-410' exists
|
training_4GPU
Canceling since a higher priority waiting request for 'e2e-tests-410' exists
|
training_4GPU
The operation was canceled.
|
training_16GPU_4DP2TP2PP_FSP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_16GPU_4DP2TP2PP_MSP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_16GPU_4DP2TP2PP_MTP (t_cluster)
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|
training_4GPU
You are running out of disk space. The runner will stop working when the machine runs out of disk space. Free space left: 15 MB
|