WIP: layered/cost: move refilling budgets into dispatch #870

JakeHillion · 2024-10-31T15:58:35Z

Currently budgets are refreshed in layered_stopping as part of calling
record_cpu_cost. This attempts to bill for the usage that has already
happened, and if that takes the budget negative it refreshes from the global
budgets, refreshing them if needed to stay above 0. This means that the budget
for the previously scheduled layer will always be >=0, though not necessarily
at least one time slice, potentially making it impossible to schedule that
layer in the future (particularly bad for confined layers).

While billing in stopping makes sense for accurate attribution, refreshing
here might not. To stay fair we should only refresh our budgets when there is
nothing capable of running without refreshing the budgets.

This changes alters the logic to run in dispatch. We first attempt a full set
of dispatch loops with the existing per-CPU budgets. After this fails (there
was nothing this CPU could run within the constraints of the local budgets), we
query the global budgets and refill local budgets as best we can. If we still
can't schedule anything, we refill the global budgets and try again.

The new flow in dispatch is as follows:

Attempt dispatching with current local budgets.
Refill local budgets from global budgets without refreshing them, and attempt
dispatching with these local budgets.
Refresh global budgets, refilling local budgets to capacity at the same time,
and attempt dispatching with these local budgets.

This should defer to the local budgets in the common case, drain from the
global budgets wherever that achieves forward progress, and refill the global
budgets only when necessary (forward process cannot be made on this CPU). We
may benefit from a spin-lock on refreshing the global budgets to prevent
multiple CPUs doing it at the same time.

Test plan:

TBD

Currently budgets are refreshed in `layered_stopping` as part of calling `record_cpu_cost`. This attempts to bill for the usage that has already happened, and if that takes the budget negative it refreshes from the global budgets, refreshing them if needed to stay above 0. This means that the budget for the previously scheduled layer will always be >=0, though not necessarily at least one time slice, potentially making it impossible to schedule that layer in the future (particularly bad for confined layers). While billing in `stopping` makes sense for accurate attribution, refreshing here might not. To stay fair we should only refresh our budgets when there is nothing capable of running without refreshing the budgets. This changes alters the logic to run in `dispatch`. We first attempt a full set of dispatch loops with the existing per-CPU budgets. After this fails (there was nothing this CPU could run within the constraints of the local budgets), we query the global budgets and refill local budgets as best we can. If we still can't schedule anything, we refill the global budgets and try again. The new flow in `dispatch` is as follows: - Attempt dispatching with current local budgets. - Refill local budgets from global budgets without refreshing them, and attempt dispatching with these local budgets. - Refresh global budgets, refilling local budgets to capacity at the same time, and attempt dispatching with these local budgets. This should defer to the local budgets in the common case, drain from the global budgets wherever that achieves forward progress, and refill the global budgets only when necessary (forward process cannot be made on this CPU). We may benefit from a spin-lock on refreshing the global budgets to prevent multiple CPUs doing it at the same time. Test plan: - TBD

JakeHillion mentioned this pull request Nov 1, 2024

scx_layered: Add additional drain to fallback DSQs #874

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: layered/cost: move refilling budgets into dispatch #870

WIP: layered/cost: move refilling budgets into dispatch #870

JakeHillion commented Oct 31, 2024

WIP: layered/cost: move refilling budgets into dispatch #870

Are you sure you want to change the base?

WIP: layered/cost: move refilling budgets into dispatch #870

Conversation

JakeHillion commented Oct 31, 2024