SIMD-0118: Partitioned Epoch Rewards, amend/extend design #118

CriesofCarrots · 2024-02-16T22:55:45Z

This SIMD supersedes SIMD-0015 with some new design elements.
I have begun by copying in the original SIMD so that the changes can be seen more easily in subsequent commits.

#116 may be useful as a reference, as it describes the existing Labs implementation.

CriesofCarrots · 2024-02-17T18:04:46Z

@t-nelson (can't request as reviewer)

jstarry

Looks really good! Thanks for writing this up and consolidating everything.

proposals/0118-partitioned-epoch-reward-distribution.md

t-nelson

left a couple clarifying suggestions. the changes lgtm otherwise. thanks for committing the copy of the original simd with the proposed modifications, it made review very pleasant!

i did notice some other, non-technical changes that might make the document more clear, but probably couldn't justify their own simd. do we want to entertain those here?

proposals/0118-partitioned-epoch-reward-distribution.md

CriesofCarrots

i did notice some other, non-technical changes that might make the document more clear, but probably couldn't justify their own simd. do we want to entertain those here?

I say yes. Let's get this as complete and useful as possible. I will keep things separated by commit.

proposals/0118-partitioned-epoch-reward-distribution.md

t-nelson

i left quite a bit of eye twitch intact for the sake of brevity. these suggestions knock out most of the stumbling i came across that actually brought confusion

proposals/0118-partitioned-epoch-reward-distribution.md

HaoranYi · 2024-02-25T14:19:42Z

Handling the new field is much easier than recompute. And it is a one time cost. After it is implemented, you don't need to worry about it in future. However, adding another code path introduce maintaining cost in future too.

t-nelson · 2024-02-25T17:30:37Z

the snapshot shortcut is only "easier" in the short term. it is punting the problem down the road to snapshot encoding, (de)ser, storage and distribution. success requires long term thinking, not taking the easy way out.

HaoranYi · 2024-02-26T16:17:10Z

Regarding the original partitioned reward design to have the sysvar account carries the pending reward balance.

I think there was a concern raised by Anatoly that the total capital did not stay stable during the epoch.

Anatoly has an idea of merkling all the rewards at epoch boundary and depositing the rewards into a reserve account at epoch boundary. Then stake account, when withdrawing the rewards from the reserve account, will submit a merkle proof to verify the reward payment, then transfer the balance from the reserve account.

Later on, we simplified it a bit and removed merkle tree, but we still keep the reward balance in the sysvar so that the total capital is stable during the epoch to address the above concern.

CriesofCarrots · 2024-02-26T16:56:10Z

I think there was a concern raised by Anatoly that the total capital did not stay stable during the epoch.

Total capital is not stable during an epoch now, as run_incinerator is triggered on every Bank freeze and decreases the capitalization by the amount in that account.

HaoranYi · 2024-02-26T17:12:43Z

How much balance does incinerator account get per epoch? And where does it get the blance?

CriesofCarrots · 2024-02-26T18:17:35Z

How much balance does incinerator account get per epoch? And where does it get the blance?

I don't know how much it receives; the runtime isn't logging that data. I suppose we could write a geyser plugin that would report that (or an indexer might know already). Anyone can transfer lamports to the incinerator to burn them.

t-nelson · 2024-02-26T18:47:31Z

i'm not sure how total capitalization nor incinerator are relevant here? we just need to be able to recover remaining partition distributions from minimal state when loading snapshots. if we have total&distributed epoch reward lamports and total&distributed epoch credits, Delegations are in accounts (or snapshotted stakes cache? 🫠). what else do we need do this practically?

HaoranYi · 2024-02-26T20:31:36Z

One thing people may start noticing is that the Sol Supply on solana explorer will starting slowly increase at the epoch boundary.

It is more noticeable than a onetime increase at the epoch boundary. Probably, people would want an explanation for that. A slow and gradual increase in the total supply of sol per block may make people worry about inflation...

HaoranYi · 2024-02-26T20:36:01Z

Total capital is not stable during an epoch now, as run_incinerator is triggered on every Bank freeze and decreases the capitalization by the amount in that account.

The capital change due to incinerator is very minimal. It doesn't make material impact on the total sol. While reward change are much more noticeable than incinerator burning.

HaoranYi · 2024-02-26T20:53:28Z

i'm not sure how total capitalization nor incinerator are relevant here? we just need to be able to recover remaining partition distributions from minimal state when loading snapshots. if we have total&distributed epoch reward lamports and total&distributed epoch credits, Delegations are in accounts (or snapshotted stakes cache? 🫠). what else do we need do this practically?

Because, that's related to one of the main changes for this SIMD. If I read the SIMD correctly, there are two major chagnes:

sysvar no longer carries the total reward balance.
recompute rewards at restart.

CriesofCarrots · 2024-02-26T21:46:26Z

The capital change due to incinerator is very minimal. It doesn't make material impact on the total sol. While reward change are much more noticeable than incinerator burning.

Because that is user-dependent, we absolutely cannot rely on that assumption. However, it sounds like epoch-capitalization stability is not a technical concern in the first place; just a comms issue. Since there will need to be communication about things like stake withdrawals being unavailable during the rewards period anyway, we can also explain the new shape of supply increases.

t-nelson

r+ one nit.

proposals/0118-partitioned-epoch-reward-distribution.md

riptl

Looks great. I have two nits. (Sorry am on vacation so can't use my work account @ripatel-fd)

proposals/0118-partitioned-epoch-reward-distribution.md

godmodegalactus

Overall looks a very good direction

proposals/0118-partitioned-epoch-reward-distribution.md

…rds sysvar, happen before tx processing each block

topointon-jump · 2024-03-17T15:30:48Z

proposals/0118-partitioned-epoch-reward-distribution.md

+The distribution of epoch rewards at the start block of an epoch becomes a
+significant bottleneck due to the rising number of stake accounts and voting
+nodes on the network.
+
+To address this bottleneck, we propose a new approach for distributing the
+epoch rewards over multiple blocks.


Just curious - have we measured that it is the distribution (write-back) versus the calculation that is the bottleneck?

Yes, definitely, although it doesn't seem like we have that data summarized concisely in any one place. It seems to be mostly in various places in the #proj-epoch-boundary-optimization channel on discord. For instance, here's a comment about the calculation time: https://discord.com/channels/428295358100013066/960593861732884520/1096146390037561414 (64ms for 550K stake accounts)
Whereas distribution is more like 5-10s.

topointon-jump · 2024-03-17T15:44:27Z

proposals/0118-partitioned-epoch-reward-distribution.md

+When booting from a snapshot, a node must check the EpochRewards sysvar account
+to determine whether the distribution phase is active. If so, the node must
+rerun the rewards partitioning using the `EpochRewards::num_partitions` and
+`EpochRewards::parent_blockhash` sysvar fields and determining the upcoming
+partitions by comparing its current block height to
+`EpochRewards::distribution_starting_block_height`. Then the runtime must
+recalculate the remaining rewards using the `EpochRewards::total_points` and
+`EpochRewards::total_rewards` sysvar fields, as well as the `EpochStakes` in the
+snapshot. The recalculated rewards can be confirmed by comparing a sum of the
+rewards remaining (those partitions expected to not yet have been distributed)
+with the difference between the `EpochRewards::total_rewards` and
+`EpochRewards::distributed_rewards` fields. Partitions for blocks prior to the
+current block height can be discarded.


The rewards calculation assumes that the calculation is done at the epoch boundary (for example, using VoteState::epoch_credits). We need to make sure that re-calculating rewards when booting off a snapshot doesn't break the calculation, as this assumption is no longer true. I'm a bit worried that calculating rewards not at an epoch boundary will produce a different result.

I might be wrong but just wanted to flag as something to watch for when implementing this 😄

In particular, I think solana_stake_program::stake_state::calculate_stake_points_and_credits might end up being tweaked slightly, as well as anywhere which uses VoteState::epoch_credits.

Yes, this is certainly the most sensitive aspect of the implementation. However, are you asking for any SIMD changes here?

No, not in the SIMD, as I don't think this will be fully known until it's implemented. Just wanted to flag 😄

proposals/0118-partitioned-epoch-reward-distribution.md

lheeger-jump

Approved

proposals/0118-partitioned-epoch-reward-distribution.md

Co-authored-by: Trent Nelson <[email protected]>

t-nelson

jacobcreech

Looks like consensus has been reached between the Firedancer and Anza teams. Thank you @CriesofCarrots for championing this!

billythedummy · 2024-04-02T21:30:11Z

To clarify, with this change, would the various stake pool programs (spl, marinade) need to upgrade to have their epoch update crank instruction fail if EpochRewards.active is true?

CriesofCarrots · 2024-04-03T17:33:35Z

To clarify, with this change, would the various stake pool programs (spl, marinade) need to upgrade to have their epoch update crank instruction fail if EpochRewards.active is true?

@billythedummy , I'm not personally familiar with the various stake-pool programs or the instruction you reference, but if they have operations that depend on rewards distribution being complete, then the answer is yes.
With this change, the Stake Program will be updated so that only credit-only stake-account operations will succeed when EpochRewards.active, so stake-pool operations that depend on mutating stake accounts will fail during the rewards period.

billythedummy · 2024-04-04T13:39:12Z

@CriesofCarrots each stake pool program basically have crank instructions that on every epoch reads the new balances of the stake accounts it owns, which are supposed to have increased at the epoch boundary due to staking rewards, and updates the exchange rate between SOL and the pool’s LST. These instructions dont mutate stake accounts in all cases, so i think we would need the EpochRewards.active check on the code paths that only read the balances of the stake accounts.

Tagging stake pool programs maintainers here for visibility @joncinque @ochaloup

CriesofCarrots force-pushed the simd-118 branch from 60b1507 to f027d0c Compare February 16, 2024 22:57

CriesofCarrots requested review from HaoranYi and jstarry February 16, 2024 22:59

jstarry reviewed Feb 19, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Outdated Show resolved Hide resolved

proposals/0118-partitioned-epoch-reward-distribution.md Outdated Show resolved Hide resolved

jstarry mentioned this pull request Feb 19, 2024

Amend SIMD-15 with latest labs implementation details #116

Closed

HaoranYi reviewed Feb 20, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Show resolved Hide resolved

t-nelson reviewed Feb 20, 2024

View reviewed changes

CriesofCarrots commented Feb 21, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Show resolved Hide resolved

proposals/0118-partitioned-epoch-reward-distribution.md Show resolved Hide resolved

brooksprumo reviewed Feb 21, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Show resolved Hide resolved

jstarry approved these changes Feb 22, 2024

View reviewed changes

t-nelson reviewed Feb 22, 2024

View reviewed changes

HaoranYi reviewed Feb 23, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Show resolved Hide resolved

proposals/0118-partitioned-epoch-reward-distribution.md Outdated Show resolved Hide resolved

t-nelson previously approved these changes Feb 27, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Outdated Show resolved Hide resolved

t-nelson reviewed Feb 29, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Show resolved Hide resolved

t-nelson mentioned this pull request Feb 29, 2024

SIMD-0075: Secp256r1 Precompile (Supersedes SIMD-0048) #75

Merged

riptl approved these changes Mar 1, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Show resolved Hide resolved

proposals/0118-partitioned-epoch-reward-distribution.md Show resolved Hide resolved

godmodegalactus approved these changes Mar 5, 2024

View reviewed changes

CriesofCarrots dismissed t-nelson’s stale review via feeb48e March 12, 2024 19:44

CriesofCarrots added 4 commits March 14, 2024 13:37

Heavier hand clarifying details from original simd

95b7c59

Add link to rewards calc doc

627fb5d

Add total_points to sysvar and rework snapshot boot accordingly

8d0a7bb

Clarify what happens beyond distributions-per-block cap

394e97b

CriesofCarrots force-pushed the simd-118 branch from feeb48e to d71daa0 Compare March 14, 2024 19:37

CriesofCarrots added 2 commits March 14, 2024 13:46

Update SIMD-0015 header

3305d6b

Add offset prefixes to EpochRewards

a7f0dbf

CriesofCarrots force-pushed the simd-118 branch from d71daa0 to a7f0dbf Compare March 14, 2024 19:46

Pointless reformatting to get by line-length limit in code block

772c38f

lheeger-jump requested changes Mar 15, 2024

View reviewed changes

CriesofCarrots added 2 commits March 15, 2024 13:47

Fix typo

25490d7

Clarify that rewards distribution, and hence updates to the EpochRewa…

8ab29c5

…rds sysvar, happen before tx processing each block

topointon-jump reviewed Mar 17, 2024

View reviewed changes

CriesofCarrots added 2 commits March 18, 2024 13:32

Update sysvar offsets/alignment

5077401

Elaborate stake-account mutation restriction

a268379

lheeger-jump previously approved these changes Mar 19, 2024

View reviewed changes

CriesofCarrots requested review from t-nelson and ripatel-fd March 20, 2024 20:22

t-nelson previously approved these changes Mar 20, 2024

View reviewed changes

proposals/0118-partitioned-epoch-reward-distribution.md Outdated Show resolved Hide resolved

CriesofCarrots dismissed stale reviews from t-nelson and lheeger-jump via cdda8cd March 20, 2024 23:30

Clarify "balances"

cdda8cd

Co-authored-by: Trent Nelson <[email protected]>

t-nelson approved these changes Mar 21, 2024

View reviewed changes

jacobcreech approved these changes Mar 21, 2024

View reviewed changes

jacobcreech merged commit 2b1640f into solana-foundation:main Mar 21, 2024
2 checks passed

CriesofCarrots mentioned this pull request Mar 26, 2024

Simd 118: rekey partitioned epoch rewards feature anza-xyz/agave#427

Merged

CriesofCarrots mentioned this pull request Apr 23, 2024

Add StakeError index comments anza-xyz/agave#1012

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SIMD-0118: Partitioned Epoch Rewards, amend/extend design #118

SIMD-0118: Partitioned Epoch Rewards, amend/extend design #118

CriesofCarrots commented Feb 16, 2024

CriesofCarrots commented Feb 17, 2024

jstarry left a comment

t-nelson left a comment

CriesofCarrots left a comment

t-nelson left a comment

HaoranYi commented Feb 25, 2024

t-nelson commented Feb 25, 2024

HaoranYi commented Feb 26, 2024

CriesofCarrots commented Feb 26, 2024

HaoranYi commented Feb 26, 2024

CriesofCarrots commented Feb 26, 2024

t-nelson commented Feb 26, 2024

HaoranYi commented Feb 26, 2024

HaoranYi commented Feb 26, 2024

HaoranYi commented Feb 26, 2024 •

edited

Loading

CriesofCarrots commented Feb 26, 2024

t-nelson left a comment

riptl left a comment

godmodegalactus left a comment

topointon-jump Mar 17, 2024

CriesofCarrots Mar 19, 2024

topointon-jump Mar 17, 2024 •

edited

Loading

topointon-jump Mar 17, 2024 •

edited

Loading

CriesofCarrots Mar 18, 2024

topointon-jump Mar 19, 2024

lheeger-jump left a comment

t-nelson left a comment

jacobcreech left a comment

billythedummy commented Apr 2, 2024

CriesofCarrots commented Apr 3, 2024

billythedummy commented Apr 4, 2024

SIMD-0118: Partitioned Epoch Rewards, amend/extend design #118

SIMD-0118: Partitioned Epoch Rewards, amend/extend design #118

Conversation

CriesofCarrots commented Feb 16, 2024

CriesofCarrots commented Feb 17, 2024

jstarry left a comment

Choose a reason for hiding this comment

t-nelson left a comment

Choose a reason for hiding this comment

CriesofCarrots left a comment

Choose a reason for hiding this comment

t-nelson left a comment

Choose a reason for hiding this comment

HaoranYi commented Feb 25, 2024

t-nelson commented Feb 25, 2024

HaoranYi commented Feb 26, 2024

CriesofCarrots commented Feb 26, 2024

HaoranYi commented Feb 26, 2024

CriesofCarrots commented Feb 26, 2024

t-nelson commented Feb 26, 2024

HaoranYi commented Feb 26, 2024

HaoranYi commented Feb 26, 2024

HaoranYi commented Feb 26, 2024 • edited Loading

CriesofCarrots commented Feb 26, 2024

t-nelson left a comment

Choose a reason for hiding this comment

riptl left a comment

Choose a reason for hiding this comment

godmodegalactus left a comment

Choose a reason for hiding this comment

topointon-jump Mar 17, 2024

Choose a reason for hiding this comment

CriesofCarrots Mar 19, 2024

Choose a reason for hiding this comment

topointon-jump Mar 17, 2024 • edited Loading

Choose a reason for hiding this comment

topointon-jump Mar 17, 2024 • edited Loading

Choose a reason for hiding this comment

CriesofCarrots Mar 18, 2024

Choose a reason for hiding this comment

topointon-jump Mar 19, 2024

Choose a reason for hiding this comment

lheeger-jump left a comment

Choose a reason for hiding this comment

t-nelson left a comment

Choose a reason for hiding this comment

jacobcreech left a comment

Choose a reason for hiding this comment

billythedummy commented Apr 2, 2024

CriesofCarrots commented Apr 3, 2024

billythedummy commented Apr 4, 2024

HaoranYi commented Feb 26, 2024 •

edited

Loading

topointon-jump Mar 17, 2024 •

edited

Loading

topointon-jump Mar 17, 2024 •

edited

Loading