You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello community,
we have the problem, that tasks that have a long running time are getting evicted - at least that is our assumption from the error code: "Error: exit status 143".
Accompanied by a warning in the argo-workflow wait container: "Non-transient error: context canceled"
We are guessing that is not a python error but rather the instance shutting down.
Now the problem is that we have 2 different cases:
Testing or actual errors in the code / with the data
Dubious error 143
And we do not want to retry errors that we need to fix first before running them again. Therefore my question: How can we retry only workflows that have this error? As far as Iwecould find there is no: "OnNonTransientError".
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hello community,
we have the problem, that tasks that have a long running time are getting evicted - at least that is our assumption from the error code:
"Error: exit status 143"
.Accompanied by a warning in the argo-workflow wait container:
"Non-transient error: context canceled"
We are guessing that is not a python error but rather the instance shutting down.
Now the problem is that we have 2 different cases:
And we do not want to retry errors that we need to fix first before running them again. Therefore my question: How can we retry only workflows that have this error? As far as Iwecould find there is no: "OnNonTransientError".
Beta Was this translation helpful? Give feedback.
All reactions