-
Notifications
You must be signed in to change notification settings - Fork 617
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using TasksEqualStable in dispatcher breaks global services #1291
Comments
For 1.12.1 we could work around this by treating a transition to ASSIGNED as a modification, but ignore other status changes. This seems very hacky, though. Perhaps the scheduler should be advancing the desired state of the task, not just its observed state. Global tasks would start out with |
I like this idea. What should the start out DesiredState, |
It should start as The downside to this is that previously only the orchestrator controlled desired state. Having the scheduler change it may not be a good idea, because it could fight with the orchestrator. For example, if the orchestrator decides it doesn't want that task anymore, it would set the desired state to |
How about this?
|
I like that design but I think doing batching of the desired state change might make the code complex. WDYT? |
Global orchestrator has not done much batching today. |
@aaronlehmann Since the change is temporary, should we reopen (and rename) this issue for 1.13? We may still go this way but should make the decision along with other changes. |
Sure, feel free to file a new issue that references this one. I think it should be a different issue since it's more about "find a better solution" than "fix this bug". |
Global service tasks are held at the scheduler until the scheduler determines that the node has the required resources. When it confirms this, it sets the task state (NOT desired state) to "assigned".
Because of recent changes to the dispatcher, this state change is no longer a sufficient reason to send an assignment set update to the agent. So until something else triggers an assignment set update, the task can be stuck in ASSIGNED.
This is a race that doesn't happen as much before #1287, but after #1287, it nearly always happens. I can still trigger it sometimes without #1287, though.
cc @dongluochen @aluzzardi
The text was updated successfully, but these errors were encountered: