scx_rustland: introduce fifo mode #305

arighi · 2024-05-21T16:00:56Z

This is a v2 of the FIFO mode feature, more tested and tuned, based on additional experimental results (specifically to determine the optimal USERSCHED_TIMER_NS and the conditions to automatically enter and exit to/from FIFO mode).

The idea is to give an option to automatically transition to a FIFO scheduler when the system is underutilized and switch to the user-space scheduler only when the system is over commissioned.

This allows to maximize performance during regular system use, for example gaming without additional stress tests running, while also ensuring responsiveness if a CPU-intensive workload is suddenly started.

FIFO mode can lead to less predictable performance (due to the potential transitions between the scheduling policies), therefore it is provided as an optional feature that can be disabled when performance predictability is crucial, such as in real-time audio applications or during live streaming.

Simplify the CPU idle selection logic relying on the built-in logic. If something can be improved in this logic it should be done in the backend, changing the default idle selection logic, rustland doesn't need to do anything special here for now. Signed-off-by: Andrea Righi <[email protected]>

Provide a knob in scx_rustland_core to automatically turn the scheduler into a simple FIFO when the system is underutilized. This choice is based on the assumption that, in the case of system underutilization (less tasks running than the amount of available CPUs), the best scheduling policy is FIFO. With this option enabled the scheduler starts in FIFO mode. If most of the CPUs are busy (nr_running >= num_cpus - 1), the scheduler immediately exits from FIFO mode and starts to apply the logic implemented by the user-space component. Then the scheduler can switch back to FIFO if there are no tasks waiting to be scheduled (evaluated using a moving average). This option can be enabled/disabled by the user-space scheduler using the fifo_sched parameter in BpfScheduler: if set, the BPF component will periodically check for system utilization and switch back and forth to FIFO mode based on that. This allows to improve performance of workloads that are using a small amount of the available CPUs in the system, while still maintaining the same good level of performance for interactive tasks when the system is over commissioned. In certain video games, such as Baldur's Gate 3 or Counter-Strike 2, running in "normal" system conditions, we can experience a boost in fps of approximately 4-8% with this change applied. Signed-off-by: Andrea Righi <[email protected]>

Do not always assign the maximum time slice to interactive tasks, but use the same value of the dynamic time slice for everyone. This seems to prevent potential audio cracking when the system is over commissioned. Signed-off-by: Andrea Righi <[email protected]>

The shared DSQ is typically used to prioritize tasks and dispatch them on the first CPU available, so consume from the shared DSQ before the local CPU DSQ. Signed-off-by: Andrea Righi <[email protected]>

Dispatch non-interactive tasks on the CPU selected by the built-in idle selection logic and allow interactive tasks to be dispatched on any CPU. Signed-off-by: Andrea Righi <[email protected]>

Signed-off-by: Andrea Righi <[email protected]>

htejun · 2024-05-21T20:12:04Z

rust/scx_rustland_core/assets/bpf/main.bpf.c

 	 */
 	cpu = scx_bpf_select_cpu_dfl(p, prev_cpu, wake_flags, &is_idle);
-	if (is_idle) {
+	if (is_idle && !full_user) {


So, this would mark the picked CPU as not idle and then if !full_user ignore it, which will strand the cpu for a while. It probably would make sense to test full_user before calling scx_bpf_select_cpu_dfl().

@htejun the idea here was to use the built-in logic to pick an idle CPU, but not directly dispatch it if full_user is specified, so that the task can be forced to go to the user-space scheduler.

Probably with full_user enabled it just makes more sense to simply return prev_cpu and let the user-space scheduler decide the CPU to use, according to its own idle tracking logic. At the end full_user is provided mostly for debugging purposes, so I'm not really worried about performance here.

Re-thinking more about this, I like a lot more to simply ignore the built-in idle selection logic in full-user mode, also from a design perspective --full-user means that all the scheduling decisions are delegated to the user-space scheduler, idle selection logic included.

Therefore I pushed a change on top to completely ignore the built-in idle selection logic when running in full-user mode and make this option incompatible with --builtin-idle.

arighi · 2024-05-22T06:55:00Z

@htejun sorry... I just realized that I've pushed too much stuff in this PR... but it's still something that I've been tested a lot anyway, so it's not totally bad. :) If you think it's better I can revert the additional commits and create a separate PR.

The extra changes seem to mitigate the audio cracking issues that I was getting when the system is massively overloaded.

Just for the records the extra changes are the following:

dispatch interactive tasks on the first CPU available and non-interactive tasks on the CPU selected by the idle selection logic
assign the same time slice both to interactive and non-interactive tasks
implement a second-chance migration in select_cpu(): if a task has dispatched directly do not migrate it immediately but try to keep it on prev_cpu for another round

arighi · 2024-05-22T07:02:44Z

... actually let me do things properly, I'll revert this one and will send a new one with the right commits.

This merge included additional commits that were supposed to be included in a separate pull request and have nothing to do with the fifo-mode changes. Therefore, revert the whole pull request and create a separate one with the correct list of commits required to implement this feature. Signed-off-by: Andrea Righi <[email protected]>

Andrea Righi added 6 commits May 21, 2024 17:08

scx_rustland_core: consume from the shared DSQ before local DSQ

778ee14

The shared DSQ is typically used to prioritize tasks and dispatch them on the first CPU available, so consume from the shared DSQ before the local CPU DSQ. Signed-off-by: Andrea Righi <[email protected]>

scx_rustland: dispatch interactive tasks on any CPU

f38d91b

Dispatch non-interactive tasks on the CPU selected by the built-in idle selection logic and allow interactive tasks to be dispatched on any CPU. Signed-off-by: Andrea Righi <[email protected]>

scx_rustland_core: second chance CPU migration

f27e67d

Signed-off-by: Andrea Righi <[email protected]>

htejun approved these changes May 21, 2024

View reviewed changes

arighi merged commit e79ab40 into main May 21, 2024
1 check passed

arighi deleted the rustland-fifo-mode branch May 21, 2024 22:06

arighi mentioned this pull request May 22, 2024

scx_rustland: fix fifo mode support #306

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scx_rustland: introduce fifo mode #305

scx_rustland: introduce fifo mode #305

arighi commented May 21, 2024

htejun May 21, 2024

arighi May 21, 2024

arighi May 21, 2024

arighi commented May 22, 2024 •

edited

Loading

arighi commented May 22, 2024

scx_rustland: introduce fifo mode #305

scx_rustland: introduce fifo mode #305

Conversation

arighi commented May 21, 2024

htejun May 21, 2024

Choose a reason for hiding this comment

arighi May 21, 2024

Choose a reason for hiding this comment

arighi May 21, 2024

Choose a reason for hiding this comment

arighi commented May 22, 2024 • edited Loading

arighi commented May 22, 2024

arighi commented May 22, 2024 •

edited

Loading