[Enhancement] metadata support LRU memory evict strategy in shared-nothing cluster (backport #48832) #49170

mergify · 2024-07-30T14:54:16Z

Why I'm doing:

In previous implementation, there is no memory limit for metadata in shared-nothing cluster. So if there are too many tablets and segment files, BE will OOM. We need a controllable metadata memory strategy.

What I'm doing:

Add a lru cache for metadata, its capacity is controlled by be.conf metadata_cache_memory_limit_percent.
When Rowset performs a load action, it adds the Rowset to the lru cache, and if the lru cache memory exceeds the limit, it selectively eliminates the loaded Rowset, ultimately realizing the goal of controllable metadata memory.
This strategy is only support non-pk table now.

There will be two cases when evict the Rowset:

There is no reference hold by other user, Rowset's state will change from ROWSET_LOADED to ROWSET_UNLOADED , and then release the memory.
There are reference hold by other user (e.g. compaction or query), Rowset's state will change from ROWSET_LOADED to ROWSET_UNLOADING. Memory will be release after compaction or query finish.

Test Result

Before turning on LRU cache control strategy, metadata continues to increase without limit.
After enabling LRU cache control strategy (with limit set to 1GB), metadata memory is stabilized at 1GB.

What type of PR is this:

Does this PR entail a change in behavior?

Yes, this PR will result in a change in behavior.
No, this PR will not result in a change in behavior.

If yes, please specify the type of change:

Interface/UI changes: syntax, type conversion, expression evaluation, display information
Parameter changes: default values, similar parameters but with different default values
Policy changes: use new policy to replace old one, functionality automatically enabled
Feature removed
Miscellaneous: upgrade & downgrade compatibility, etc.

Checklist:

I have added test cases for my bug fix or my new feature
This pr needs user documentation (for new or modified features or behaviors)
- I have added documentation for my new feature or new function
This is a backport pr

Bugfix cherry-pick branch check:

This is an automatic backport of pull request #48832 done by [Mergify](https://mergify.com). ## Why I'm doing: In previous implementation, there is no memory limit for `metadata` in shared-nothing cluster. So if there are too many tablets and segment files, BE will OOM. We need a controllable `metadata` memory strategy.

What I'm doing:

Add a lru cache for metadata, its capacity is controlled by be.conf metadata_cache_memory_limit_percent.
When Rowset performs a load action, it adds the Rowset to the lru cache, and if the lru cache memory exceeds the limit, it selectively eliminates the loaded Rowset, ultimately realizing the goal of controllable metadata memory.
This strategy is only support non-pk table now.

There will be two cases when evict the Rowset:

There is no reference hold by other user, Rowset's state will change from ROWSET_LOADED to ROWSET_UNLOADED , and then release the memory.
There are reference hold by other user (e.g. compaction or query), Rowset's state will change from ROWSET_LOADED to ROWSET_UNLOADING. Memory will be release after compaction or query finish.

Test Result

Before turning on LRU cache control strategy, metadata continues to increase without limit.
After enabling LRU cache control strategy (with limit set to 1GB), metadata memory is stabilized at 1GB.

What type of PR is this:

Does this PR entail a change in behavior?

Yes, this PR will result in a change in behavior.
No, this PR will not result in a change in behavior.

If yes, please specify the type of change:

Interface/UI changes: syntax, type conversion, expression evaluation, display information
Parameter changes: default values, similar parameters but with different default values
Policy changes: use new policy to replace old one, functionality automatically enabled
Feature removed
Miscellaneous: upgrade & downgrade compatibility, etc.

Checklist:

I have added test cases for my bug fix or my new feature
This pr needs user documentation (for new or modified features or behaviors)
- I have added documentation for my new feature or new function
This is a backport pr

…thing cluster (#48832) Signed-off-by: luohaha <[email protected]> (cherry picked from commit 0f639ad) # Conflicts: # be/src/storage/CMakeLists.txt

mergify · 2024-07-30T14:54:17Z

Cherry-pick of 0f639ad has failed:

On branch mergify/bp/branch-3.2/pr-48832
Your branch is up to date with 'origin/branch-3.2'.

You are currently cherry-picking commit 0f639ad566.
  (fix conflicts and run "git cherry-pick --continue")
  (use "git cherry-pick --skip" to skip this patch)
  (use "git cherry-pick --abort" to cancel the cherry-pick operation)

Changes to be committed:
	modified:   be/src/common/config.h
	new file:   be/src/storage/rowset/metadata_cache.cpp
	new file:   be/src/storage/rowset/metadata_cache.h
	modified:   be/src/storage/rowset/rowset.cpp
	modified:   be/src/storage/rowset/rowset.h
	modified:   be/src/storage/storage_engine.cpp
	modified:   be/src/storage/tablet_manager.cpp
	modified:   be/src/storage/tablet_manager.h
	modified:   be/src/util/starrocks_metrics.h
	modified:   be/test/CMakeLists.txt
	new file:   be/test/storage/rowset/metadata_cache_test.cpp
	modified:   be/test/storage/tablet_mgr_test.cpp

Unmerged paths:
  (use "git add <file>..." to mark resolution)
	both modified:   be/src/storage/CMakeLists.txt

To fix up this pull request, you can check it out locally. See documentation: https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/reviewing-changes-in-pull-requests/checking-out-pull-requests-locally

mergify · 2024-07-30T14:54:55Z

@mergify[bot]: Backport conflict, please reslove the conflict and resubmit the pr

Signed-off-by: Yixin Luo <[email protected]>

[Enhancement] metadata support LRU memory evict strategy in shared-no…

8183884

…thing cluster (#48832) Signed-off-by: luohaha <[email protected]> (cherry picked from commit 0f639ad) # Conflicts: # be/src/storage/CMakeLists.txt

mergify bot added the conflicts label Jul 30, 2024

mergify bot mentioned this pull request Jul 30, 2024

[Enhancement] metadata support LRU memory evict strategy in shared-nothing cluster #48832

Merged

24 tasks

github-actions bot assigned luohaha Jul 30, 2024

mergify bot closed this Jul 30, 2024

github-actions bot added automerge behavior_changed labels Jul 30, 2024

mergify bot deleted the mergify/bp/branch-3.2/pr-48832 branch July 30, 2024 14:55

luohaha restored the mergify/bp/branch-3.2/pr-48832 branch July 30, 2024 15:19

luohaha reopened this Jul 30, 2024

wanpengfei-git enabled auto-merge (squash) July 30, 2024 15:20

resolve

86c1a0e

Signed-off-by: Yixin Luo <[email protected]>

luohaha approved these changes Jul 30, 2024

View reviewed changes

wanpengfei-git merged commit 4e55c60 into branch-3.2 Jul 30, 2024
28 of 29 checks passed

wanpengfei-git deleted the mergify/bp/branch-3.2/pr-48832 branch July 30, 2024 16:00

github-actions bot added the version:3.2.10 label Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] metadata support LRU memory evict strategy in shared-nothing cluster (backport #48832) #49170

[Enhancement] metadata support LRU memory evict strategy in shared-nothing cluster (backport #48832) #49170

mergify bot commented Jul 30, 2024 •

edited by wanpengfei-git

Loading

mergify bot commented Jul 30, 2024

mergify bot commented Jul 30, 2024

[Enhancement] metadata support LRU memory evict strategy in shared-nothing cluster (backport #48832) #49170

[Enhancement] metadata support LRU memory evict strategy in shared-nothing cluster (backport #48832) #49170

Conversation

mergify bot commented Jul 30, 2024 • edited by wanpengfei-git Loading

Why I'm doing:

What I'm doing:

Test Result

What type of PR is this:

Checklist:

Bugfix cherry-pick branch check:

What I'm doing:

Test Result

What type of PR is this:

Checklist:

mergify bot commented Jul 30, 2024

mergify bot commented Jul 30, 2024

mergify bot commented Jul 30, 2024 •

edited by wanpengfei-git

Loading