scx_lavd: mitigate the lock holder preemption problem #779

multics69 · 2024-10-11T09:56:55Z

If a task holding a lock (i.e., lock holder) is preempted for some reason, it will hinder the forward progress of a system and cause unnecessary task switches. To mitigate the lock holder preemption problem, this PR does the following:

Track all blocking locks in kernel and userspace, so the scheduler can easily decide if a task holds a lock of not.
When a task holds a lock, the lock holder task should be not preempted or should not yield its execution to other tasks.
When a lock holder exhausts its time slice, it should be scheduled as soon as possible, so system-wide progress can be made.

In addition to the above major changes, this PR includes the bug fix in the preeemptability test (#773)

Overall this brings more consistent results with less preemption.

Trace the acquisition and release of blocking locks for kernel and fuxtexes for user-space. This is necessary to boost a lock holder task in terms of latency and time slice. We do not boost shared lock holders (e.g., read lock in rw_semaphore) since the kernel already prioritizes the readers over writers. Signed-off-by: Changwoo Min <[email protected]>

When a lock holder exhausts its time slide, it will be re-enqueued to a DSQ waiting for shceduling while holding a lock. In this case, prioritize its latency criticality proportionally, so a lock holder would be not stuck in a DSQ for a long time, improving system-wide progress. Signed-off-by: Changwoo Min <[email protected]>

When a task holds a lock, it should not yield its time slice or it should not be preempted out. In this way, we can mitigate harmful preemption of lock holders and reduce the total preemption counts. Signed-off-by: Changwoo Min <[email protected]>

Signed-off-by: Changwoo Min <[email protected]>

arighi

Look great! Thanks for doing this, especially for all the locking investigation. I'm really curious to test this out.

arighi · 2024-10-11T13:57:31Z

scheds/rust/scx_lavd/src/bpf/lock.bpf.c

+{
+	if (taskc && cpuc) {
+		taskc->lock_boost++;
+		cpuc->lock_holder = is_lock_holder(taskc);


Can we assume this is always true, since taskc->lock_boost has been just incremented?

taskc can be NULL if a task is not under scx, so it is necessary. Also, it is necessary to satisfy the verifier.

arighi · 2024-10-11T14:05:33Z

scheds/rust/scx_lavd/src/bpf/main.bpf.c

+		 * Reset task's lock and futex boost count
+		 * for a lock holder to be boosted only once.
+		 */
+		reset_lock_futex_boost(taskc);


So if we reset the counter here, when the task releases the lock its counter will become negative? I'm wondering if a boolean logic would be simpler and accomplish the same, like acquire a lock => boost = true, release a lock => boost = false, and after using the boost => boost = false.

Yes, the counter could become negative so the decrement logic test if the counter is positive before decrementing it.
I think using counter is necessary because a task can hold multiple locks.

multics69 · 2024-10-12T02:42:21Z

Thanks @arighi for the review!

Changwoo Min added 3 commits October 11, 2024 17:03

multics69 requested a review from htejun October 11, 2024 09:56

scx_lavd: fix incorrect task comparison for preemption

648c95b

Signed-off-by: Changwoo Min <[email protected]>

multics69 mentioned this pull request Oct 11, 2024

scx_lavd: trace WINESYNC #784

Open

arighi approved these changes Oct 11, 2024

View reviewed changes

multics69 added this pull request to the merge queue Oct 12, 2024

Merged via the queue into sched-ext:main with commit 836cf9f Oct 12, 2024
25 checks passed

multics69 deleted the lavd-futex-v2 branch October 12, 2024 02:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scx_lavd: mitigate the lock holder preemption problem #779

scx_lavd: mitigate the lock holder preemption problem #779

multics69 commented Oct 11, 2024 •

edited

Loading

arighi left a comment

arighi Oct 11, 2024

multics69 Oct 12, 2024

arighi Oct 11, 2024

multics69 Oct 12, 2024

multics69 commented Oct 12, 2024

scx_lavd: mitigate the lock holder preemption problem #779

scx_lavd: mitigate the lock holder preemption problem #779

Conversation

multics69 commented Oct 11, 2024 • edited Loading

arighi left a comment

Choose a reason for hiding this comment

arighi Oct 11, 2024

Choose a reason for hiding this comment

multics69 Oct 12, 2024

Choose a reason for hiding this comment

arighi Oct 11, 2024

Choose a reason for hiding this comment

multics69 Oct 12, 2024

Choose a reason for hiding this comment

multics69 commented Oct 12, 2024

multics69 commented Oct 11, 2024 •

edited

Loading