-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(store/v2): Fix PebbleDB Iteration Edge Cases #18948
Conversation
WalkthroughThe recent changes involve enhancements to the iteration mechanics in a storage system, specifically targeting the way iterators handle the progression to subsequent keys and versions. A notable update includes the safeguarding of key integrity in the Changes
Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (invoked as PR comments)
Additionally, you can add CodeRabbit Configration File (
|
@alexanderbez your pull request is missing a changelog! |
could you touch on the edge case being fixed here in the description. Helps with future readers |
Yeah, so essentially @Kbhat1 and the team have been doing extensive battle testing of the SS PebbleDB backend and came across an edge case of where iteration would go into an infinite loop. The edge case arises from a specific set of keys and the versions they're set on. @Kbhat1 could you expand more in it? |
store/storage/storage_test_suite.go
Outdated
func (s *StorageTestSuite) TestDatabaseIterator_SkipVersion() { | ||
db, err := s.NewDB(s.T().TempDir()) | ||
s.Require().NoError(err) | ||
|
||
defer db.Close() | ||
|
||
cs := store.NewChangeset(map[string]store.KVPairs{storeKey1: { | ||
{Key: []byte("keyC"), Value: []byte("value003")}, | ||
}}) | ||
s.Require().NoError(db.ApplyChangeset(58827506, cs)) | ||
|
||
cs = store.NewChangeset(map[string]store.KVPairs{storeKey1: { | ||
{Key: []byte("keyE"), Value: []byte("value000")}, | ||
}}) | ||
s.Require().NoError(db.ApplyChangeset(58827506, cs)) | ||
|
||
cs = store.NewChangeset(map[string]store.KVPairs{storeKey1: { | ||
{Key: []byte("keyF"), Value: []byte("value000")}, | ||
}}) | ||
s.Require().NoError(db.ApplyChangeset(58827506, cs)) | ||
|
||
cs = store.NewChangeset(map[string]store.KVPairs{storeKey1: { | ||
{Key: []byte("keyC"), Value: []byte("value004")}, | ||
}}) | ||
s.Require().NoError(db.ApplyChangeset(58833605, cs)) | ||
|
||
cs = store.NewChangeset(map[string]store.KVPairs{storeKey1: { | ||
{Key: []byte("keyD"), Value: []byte("value006")}, | ||
}}) | ||
s.Require().NoError(db.ApplyChangeset(58833606, cs)) | ||
|
||
itr, err := db.Iterator(storeKey1, 58831525, []byte("key"), nil) | ||
s.Require().NoError(err) | ||
defer itr.Close() | ||
|
||
count := make(map[string]struct{}) | ||
for ; itr.Valid(); itr.Next() { | ||
count[string(itr.Key())] = struct{}{} | ||
} | ||
|
||
s.Require().Equal(3, len(count)) | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The ast-grep command has provided the loop's body within the TestDatabaseIterator_SkipVersion
function. The test increments a count based on the keys returned by the iterator, but there are no assertions on the values or explicit checks for skipped versions. This confirms the points raised in the review comment about the need for additional assertions to verify the correct key-value pairs and to ensure versions are being skipped as expected.
Analysis chain
The new test TestDatabaseIterator_SkipVersion
has been added to ensure the iterator correctly skips versions. However, there are a few points to consider:
- The test applies several changesets to the same version (
58827506
). This seems unusual as typically each changeset would correspond to a unique version. Verify if this is intentional and correct. - The test does not assert the expected values of the keys, only the count of unique keys. It would be more robust to also check the values to ensure the iterator is not only returning the correct number of keys but also the correct key-value pairs.
- The test name and the keys used (
keyC
,keyE
,keyF
) suggest that it is testing the skipping of versions, but there is no explicit check that versions are being skipped. It would be beneficial to add assertions that verify the iterator is indeed skipping over certain versions as expected. - The iterator is created with a start key of
[]byte("key")
, which does not match the pattern of the keys inserted (keyC
,keyE
,keyF
). This could be intentional to test prefix iteration, but it should be clarified or corrected if it's a mistake.
The ast-grep command to verify the application of multiple changesets to the same version number 58827506
returned matches, confirming that this is indeed the case in the test. However, the second ast-grep command failed due to a syntax error. To proceed, we need to correct the pattern and rerun the command to verify the iterator's behavior in the test.
The ast-grep command has confirmed the presence of itr.Valid()
within the TestDatabaseIterator_SkipVersion
function. To understand the assertions made within the loop, we need to capture the body of the loop. Let's adjust the ast-grep pattern to do so.
The ast-grep command did not produce the expected output, which suggests a potential issue with the complexity of the pattern. To capture the body of the loop within the TestDatabaseIterator_SkipVersion
function, we need to simplify the pattern.
The attempt to use the --within
flag with ast-grep was not successful. We need to revert to a simpler pattern that directly targets the loop's body within the TestDatabaseIterator_SkipVersion
function.
Scripts Executed
The following scripts were executed for the analysis:
Script:
#!/bin/bash
# Verify that applying multiple changesets to the same version is correct.
ast-grep --lang go --pattern $'db.ApplyChangeset($_, $_)' | grep '58827506'
# Verify that the iterator is correctly skipping versions by checking the presence of specific key-value pairs.
ast-grep --lang go --pattern $'itr.Valid(); itr.Next() { $$$ }' | grep 'TestDatabaseIterator_SkipVersion'
Length of output: 921
Script:
#!/bin/bash
# Correct the ast-grep pattern to verify the iterator's behavior in the test.
ast-grep --lang go --pattern $'func ($_ *StorageTestSuite) TestDatabaseIterator_SkipVersion() { $$$ }' | grep 'itr.Valid'
Length of output: 195
Script:
#!/bin/bash
# Adjust the ast-grep pattern to capture the body of the loop within the TestDatabaseIterator_SkipVersion function.
ast-grep --lang go --pattern $'func ($_ *StorageTestSuite) TestDatabaseIterator_SkipVersion() { $$$ for ; itr.Valid(); itr.Next() { $$$ } $$$ }'
Length of output: 145
Script:
#!/bin/bash
# Simplify the ast-grep pattern to focus on the loop's body within the TestDatabaseIterator_SkipVersion function.
ast-grep --lang go --pattern $'for ; itr.Valid(); itr.Next() { $$$ }' --within $'func ($_ *StorageTestSuite) TestDatabaseIterator_SkipVersion() { $$$ }'
Length of output: 352
Script:
#!/bin/bash
# Attempt a simpler ast-grep pattern to directly target the loop's body within the TestDatabaseIterator_SkipVersion function.
ast-grep --lang go --pattern $'for ; itr.Valid(); itr.Next() { $$$ }'
Length of output: 16366
currKey, _, ok := SplitMVCCKey(itr.source.Key()) | ||
if !ok { | ||
// XXX: This should not happen as that would indicate we have a malformed | ||
// MVCC key. | ||
panic(fmt.Sprintf("invalid PebbleDB MVCC key: %s", itr.source.Key())) | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The use of panic
should be replaced with proper error handling to prevent application crashes due to malformed MVCC keys.
- panic(fmt.Sprintf("invalid PebbleDB MVCC key: %s", itr.source.Key()))
+ // Handle error properly instead of panic
+ return fmt.Errorf("invalid PebbleDB MVCC key: %s", itr.source.Key())
Committable suggestion
❗ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation.
currKey, _, ok := SplitMVCCKey(itr.source.Key()) | |
if !ok { | |
// XXX: This should not happen as that would indicate we have a malformed | |
// MVCC key. | |
panic(fmt.Sprintf("invalid PebbleDB MVCC key: %s", itr.source.Key())) | |
} | |
currKey, _, ok := SplitMVCCKey(itr.source.Key()) | |
if !ok { | |
// XXX: This should not happen as that would indicate we have a malformed | |
// MVCC key. | |
// Handle error properly instead of panic | |
return fmt.Errorf("invalid PebbleDB MVCC key: %s", itr.source.Key()) | |
} |
The panic in the Next
method when encountering a malformed MVCC key is a drastic measure that could cause the entire application to crash. Consider replacing this with proper error handling that allows the application to recover gracefully from such an error.
return | ||
} | ||
|
||
// Move the iterator to the closest version to the desired version, so we | ||
// Move the iterator to the closest version of the desired version, so we | ||
// append the current iterator key to the prefix and seek to that key. | ||
itr.valid = itr.source.SeekLT(MVCCEncode(nextKey, itr.version+1)) | ||
|
||
tmpKey, _, ok := SplitMVCCKey(itr.source.Key()) | ||
if !ok { | ||
// XXX: This should not happen as that would indicate we have a malformed | ||
// MVCC key. | ||
itr.valid = false | ||
return | ||
} | ||
|
||
// There exists cases where the SeekLT() call moved us back to the same key | ||
// we started at, so we must move to next key, i.e. two keys forward. | ||
if bytes.Equal(tmpKey, currKey) { | ||
if itr.source.NextPrefix() { | ||
itr.Next() | ||
} else { | ||
itr.valid = false | ||
return | ||
} | ||
} | ||
|
||
// The cursor might now be pointing at a key/value pair that is tombstoned. | ||
// If so, we must move the cursor. | ||
if itr.valid && itr.cursorTombstoned() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note: This review was outside the patches, so it was mapped to the patch with the greatest overlap. Original lines [135-171]
The logic for handling reverse iteration and seeking to the next prefix seems to be missing the actual implementation for reverse iteration, as the SeekLT
method is called regardless of the itr.reverse
value. This could be an oversight or incomplete implementation. The code should either implement the reverse iteration logic or remove the itr.reverse
check if reverse iteration is not supported.
// First move the iterator to the next prefix, which may not correspond to the | ||
// desired version for that key, e.g. if the key was written at a later version, | ||
// so we seek back to the latest desired version, s.t. the version is <= itr.version. | ||
// so we seek back to the latest desired version, s.t. the version <= itr.version. | ||
if next { | ||
nextKey, _, ok := SplitMVCCKey(itr.source.Key()) | ||
if !ok { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The logic for handling reverse iteration and seeking to the next prefix seems to be missing the actual implementation for reverse iteration, as the SeekLT
method is called regardless of the itr.reverse
value. This could be an oversight or incomplete implementation. The code should either implement the reverse iteration logic or remove the itr.reverse
check if reverse iteration is not supported.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
utACK, for other reviewers: bytes.Clone == Sei's utils.Clone (it was introduced in Go 1.20, they might be a version behind or something)
Description
Thanks Kartik for finding this edge case.
ref: sei-protocol/sei-db#44
Changelog
Author Checklist
All items are required. Please add a note to the item if the item is not applicable and
please add links to any relevant follow up issues.
I have...
!
in the type prefix if API or client breaking changeCHANGELOG.md
Reviewers Checklist
All items are required. Please add a note if the item is not applicable and please add
your handle next to the items reviewed if you only reviewed selected items.
I have...