fix: reduce max packet receive time during leader window #2801

cavemanloverboy · 2024-08-30T18:44:28Z

Problem

In certain cases (if the transaction container is emptied during a leader's window), the scheduler controller may wait up to 100 milliseconds for incoming packets.

Summary of Changes

Reduce the constant max wait from 100 ms to 10 ms.

apfitzge

@bw-solana Could you review this simple change; I have tested very similar changes previouly.

As @cavemanloverboy mentioned, something that can happen is we schedule or drop all txs in our buffer and enter a 100ms receive while leader - this can be extremely slow.

Before we get a better type for deserialization I think this is a reasonable stop-gap solution.
Code change itself is straight-forward, but since I've made previous changes in past I think its' a bit sketch for me to approve this.

bw-solana

LGTM.

"Okay, it's bad we'll wait up to 100ms, but surely we'll hit the packet limit first in the practical case"

checks packet limit

🤡

cavemanloverboy · 2024-08-30T22:19:03Z

LGTM.

"Okay, it's bad we'll wait up to 100ms, but surely we'll hit the packet limit first in the practical case"

checks packet limit *

🤡

so then I said its okay we just need to get 700,000 packets

apfitzge · 2024-10-08T19:05:33Z

i never added the damn ci flag. will watch this and merge

Jac0xb · 2024-10-09T01:41:01Z

What are the implications of this change?

cavemanloverboy · 2024-10-09T03:56:51Z

What are the implications of this change?

from OP: "In certain cases (if the transaction container is emptied during a leader's window), the scheduler controller may wait up to 100 milliseconds for incoming packets."

There is a (somewhat rare) case where the scheduler will collect packets for 100 ms before it even begins scheduling. That's 1/4 of the a slot... This will reduce this time to 10ms so that there is never a time where the scheduler is twiddling its thumbs waiting for packets.

mergify · 2025-01-20T04:01:55Z

Backports to the stable branch are to be avoided unless absolutely necessary for fixing bugs, security issues, and perf regressions. Changes intended for backport should be structured such that a minimum effective diff can be committed separately from any refactoring, plumbing, cleanup, etc that are not strictly necessary to achieve the goal. Any of the latter should go only into master and ride the normal stabilization schedule.

(cherry picked from commit 20e0df4)

…ort of #2801) (#4544) fix: reduce max packet receive time during leader window (#2801) (cherry picked from commit 20e0df4) Co-authored-by: cavemanloverboy <[email protected]>

* v2.0: Reclaims more old accounts in `clean` (backport of anza-xyz#4044) (anza-xyz#4089) * Reclaims more old accounts in `clean` (anza-xyz#4044) (cherry picked from commit 3d43824) # Conflicts: # accounts-db/src/accounts_db.rs # accounts-db/src/accounts_db/tests.rs * fix merge conflicts --------- Co-authored-by: Brooks <[email protected]> * v2.0: Fixes clean_old_storages_with_reclaims tests (backport of anza-xyz#4147) (anza-xyz#4166) * Fixes clean_old_storages_with_reclaims tests (anza-xyz#4147) (cherry picked from commit 4eabeed) # Conflicts: # accounts-db/src/accounts_db/tests.rs * fix merge conflicts --------- Co-authored-by: Brooks <[email protected]> * v2.0: blockstore: mark slot as dead on data shred merkle root conflict (backport of anza-xyz#3970) (anza-xyz#4074) * blockstore: mark slot as dead on data shred merkle root conflict (anza-xyz#3970) (cherry picked from commit 5564a94) # Conflicts: # ledger/src/blockstore.rs * fix conflicts --------- Co-authored-by: Ashwin Sekar <[email protected]> Co-authored-by: Ashwin Sekar <[email protected]> * Bump version to v2.0.22 (anza-xyz#4200) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * v2.0: hardcode rust version for publish-crate (anza-xyz#4228) * Bump version to v2.0.23 (anza-xyz#4419) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * v2.0: rolls out chained Merkle shreds to ~21% of mainnet slots (backport of anza-xyz#4431) (anza-xyz#4434) rolls out chained Merkle shreds to ~21% of mainnet slots (anza-xyz#4431) (cherry picked from commit 9d09787) Co-authored-by: behzad nouri <[email protected]> * v2.0: [rpc] Fatal `getSignaturesForAddress()` when Bigtable errors (backport of anza-xyz#3700) (anza-xyz#4442) [rpc] Fatal `getSignaturesForAddress()` when Bigtable errors (anza-xyz#3700) * Unindent code in `get_signatures_for_address` * Add a custom JSON-RPC error to throw when long-term storage (ie. Bigtable) can't be reached * When the `before`/`until` signatures can't be found, throw `SignatureNotFound` instead of `RowNotFound` * Fatal `getSignaturesForAddress` calls when Bigtable must be queried but can't be reached (cherry picked from commit 52f132c) Co-authored-by: Steven Luscher <[email protected]> * v2.0: ci: bump [upload|download]-artifact to v4 (anza-xyz#4501) ci: bump [upload|download]-artifact to v4 * v2.0: ci: hardcode crate publishing version (anza-xyz#4515) ci: hardcode rust version for publish-crate * Bump version to v2.0.24 (anza-xyz#4528) Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * v2.0: fix: reduce max packet receive time during leader window (backport of anza-xyz#2801) (anza-xyz#4544) fix: reduce max packet receive time during leader window (anza-xyz#2801) (cherry picked from commit 20e0df4) Co-authored-by: cavemanloverboy <[email protected]> * v2.0: Scheduler Frequency Fixes (backport of anza-xyz#4545) (anza-xyz#4576) * Change prio_graph_scheduler configurations for 1k maxs, 256 look ahead * Break loop on scanned transaction count * make Hold decision behave same as Consume during receive * receive maximum of 5_000 packets - loose max * receive_completed before process_transactions --------- Co-authored-by: Andrew Fitzgerald <[email protected]> --------- Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by: Brooks <[email protected]> Co-authored-by: Ashwin Sekar <[email protected]> Co-authored-by: Ashwin Sekar <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Yihau Chen <[email protected]> Co-authored-by: behzad nouri <[email protected]> Co-authored-by: Steven Luscher <[email protected]> Co-authored-by: cavemanloverboy <[email protected]> Co-authored-by: Andrew Fitzgerald <[email protected]>

fix: reduce max packet receive time during leader window

ed9b1bb

mergify bot added community need:merge-assist labels Aug 30, 2024

mergify bot requested a review from a team August 30, 2024 18:45

apfitzge added the CI Pull Request is ready to enter CI label Aug 30, 2024

anza-team removed the CI Pull Request is ready to enter CI label Aug 30, 2024

apfitzge reviewed Aug 30, 2024

View reviewed changes

apfitzge requested a review from bw-solana August 30, 2024 21:40

bw-solana approved these changes Aug 30, 2024

View reviewed changes

Merge branch 'master' into master

422d088

apfitzge added the CI Pull Request is ready to enter CI label Oct 8, 2024

anza-team removed the CI Pull Request is ready to enter CI label Oct 8, 2024

apfitzge added the automerge automerge Merge this Pull Request automatically once CI passes label Oct 8, 2024

mergify bot merged commit 20e0df4 into anza-xyz:master Oct 8, 2024
42 checks passed

ray-kast pushed a commit to abklabs/agave that referenced this pull request Nov 27, 2024

fix: reduce max packet receive time during leader window (anza-xyz#2801)

90fa282

apfitzge added the v2.0 Backport to v2.0 branch label Jan 20, 2025

mergify bot pushed a commit that referenced this pull request Jan 20, 2025

fix: reduce max packet receive time during leader window (#2801)

b75eea9

(cherry picked from commit 20e0df4)

mergify bot mentioned this pull request Jan 20, 2025

v2.0: fix: reduce max packet receive time during leader window (backport of #2801) #4544

Merged

bw-solana pushed a commit that referenced this pull request Jan 21, 2025

fix: reduce max packet receive time during leader window (#2801)

993040d

(cherry picked from commit 20e0df4)

apfitzge pushed a commit that referenced this pull request Jan 21, 2025

fix: reduce max packet receive time during leader window (#2801)

6c17902

(cherry picked from commit 20e0df4)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: reduce max packet receive time during leader window #2801

fix: reduce max packet receive time during leader window #2801

cavemanloverboy commented Aug 30, 2024

apfitzge left a comment

bw-solana left a comment •

edited

Loading

cavemanloverboy commented Aug 30, 2024

apfitzge commented Oct 8, 2024

Jac0xb commented Oct 9, 2024

cavemanloverboy commented Oct 9, 2024

mergify bot commented Jan 20, 2025

fix: reduce max packet receive time during leader window #2801

fix: reduce max packet receive time during leader window #2801

Conversation

cavemanloverboy commented Aug 30, 2024

Problem

Summary of Changes

apfitzge left a comment

Choose a reason for hiding this comment

bw-solana left a comment • edited Loading

Choose a reason for hiding this comment

cavemanloverboy commented Aug 30, 2024

apfitzge commented Oct 8, 2024

Jac0xb commented Oct 9, 2024

cavemanloverboy commented Oct 9, 2024

mergify bot commented Jan 20, 2025

bw-solana left a comment •

edited

Loading