-
Notifications
You must be signed in to change notification settings - Fork 674
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Persisting task on PhaseTimeout #4765
Comments
🤖 Hey @pvditt, good to see you again! Hope you're doing well. Thanks for bringing this to our attention. Based on the information provided, it appears that the task state shows up as "UNKNOWN" instead of "ABORTED" in the Flyte admin when a task fails due to timing out with respect to the node-active-deadline because the phase of the task is not being set to "ABORTED" in the code when the task times out. In the provided code, there are different phases defined for a task such as However, when a task times out, the phase of the task is not being set to To fix this issue, you would need to update the code to set the phase of the task to Please note that without the complete context or the rest of the codebase, this is the best assumption that can be made based on the provided code. You can refer to the flyteplugins/go/tasks/pluginmachinery/core/phase.go for more details on the different phases of a task. If you need further clarification or if there's anything else I can assist you with, please let me know. SourcesThis response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research. Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant. If you want to continue the conversation, start your reply with @dosu-bot. |
1 similar comment
🤖 Hey @pvditt, good to see you again! Hope you're doing well. Thanks for bringing this to our attention. Based on the information provided, it appears that the task state shows up as "UNKNOWN" instead of "ABORTED" in the Flyte admin when a task fails due to timing out with respect to the node-active-deadline because the phase of the task is not being set to "ABORTED" in the code when the task times out. In the provided code, there are different phases defined for a task such as However, when a task times out, the phase of the task is not being set to To fix this issue, you would need to update the code to set the phase of the task to Please note that without the complete context or the rest of the codebase, this is the best assumption that can be made based on the provided code. You can refer to the flyteplugins/go/tasks/pluginmachinery/core/phase.go for more details on the different phases of a task. If you need further clarification or if there's anything else I can assist you with, please let me know. SourcesThis response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research. Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant. If you want to continue the conversation, start your reply with @dosu-bot. |
this issue should be closed by #4766 |
Describe the bug
When a task fails due to timing out with respect to the node-active-deadline, the node is set to NodePhaseTimingOut -> node is aborted -> NodePhaseTimedOut which is a terminal phase.
Since the abort is not set as a final transition, the state does not get persisted to admin causing for the task state to show up as "UNKNOWN" instead of "ABORTED"
Expected behavior
When a task is aborted, the aborted state should get persisted to admin.
Additional context to reproduce
Set node-active-deadline to small value in Flyte config:
propeller:
node-config:
default-deadlines:
node-active-deadline: 10s
run workflow that runs longer than that deadline + don't set an execution timeout:
Screenshots
Are you sure this issue hasn't been raised already?
Have you read the Code of Conduct?
The text was updated successfully, but these errors were encountered: