Pauses bulk imports into tablets w/ too many files #5104

keith-turner · 2024-11-24T19:56:00Z

Bulk imports can add files to a tablet faster than compactions can shrink the number of files. There are many scenarios that can cause this. The following are some of the situations that could cause this.

Compactors are all busy when new bulk imports arrive.
Many processes bulk import a few files to a single tablet at around the same time
A single process bulk imports a lot of files to a single tablet

When a tablet has too many files it can eventually cause cascading problems for compaction and scan. The change adds two properties to help avoid this problem.

The first property istable.file.pause. The behavior of this property is to pause bulk imports, and eventually minor compactions, when a tablets current file counts exceeds this property. The default is unlimited and therefore the default will never pause.

The second property is table.bulk.max.tablet.files. This property determines the maximum number of files a bulk import can add to a single tablet. When this limit is exceeded the bulk import operation will fail w/o making changes to any tablets.

Below is an example of how these properties behave.

Set table.file.pause=30
Set table.bulk.max.tablet.files=100
Import 20 files into tablet A, this causes tablet A to have 20 files
Import 20 files into tablet A, this causes tablet A to have 40 files
Import 20 files into tablet A. Because the tablet currently has 40 files and the pause limit is 30, this bulk import will pause.
Tablet A compacts 10 files, this causes tablet A to have 31 files. It is still above the pause limit so the bulk import does not progress.
Tablet A compacts 10 files, this causes tablet A to have 22 files.
The paused bulk import proceeds, this causes tablet A to have 42 files.
Import 200 files into tablet B and one file in tablet A. This operation fails w/o changing tablet A or B because 200 exceeds the value of table.bulk.max.tablet.files.

While making this change ran into two preexisting problems. One was with bulk import setting time. For the case of multiple files the behavior of setting time was incorrect and inconsistent depending on the table time type and if the tablet was hosted or not. Made the behavior consistent for hosted or unhosted and the two table time types. The behavior is that single time stamp is allocated for all files in all cases. The code used to allocate different number of timestamps in the four different cases. This behavior was causing tablet refresh to fail and these changes to fail. Fixed this existing issue since progress could not be made on these changes without fixing it. The new test in this PR that add lots of files to a single tablet and requested bulk import to set time uncovered the existing problem.

The second problem was the existing code had handling for the case of a subset of files being added to a tablet by bulk import. This should never happen because files are added via a mutation. Expect either the entire mutation to go through or nothing. Removed this handling for a subset and changed the code to throw an exception if a subset other than the empty set is seen. This change greatly simplified implementing this feature.

fixes #5023

Bulk imports can add files to a tablet faster than compactions can shrink the number of files. There are many scenarios that can cause this. The following are some of the situations that could cause this. * Compactors are all busy when new bulk imports arrive. * Many processes bulk import a few files to a single tablet at around the same time * A single process bulk imports a lot of files to a single tablet When a tablet has too many files it can eventually cause cascading problems for compaction and scan. The change adds two properties to help avoid this problem. The first property `table.file.pause`. The behavior of this property is to pause bulk imports, and eventually minor compactions, when a tablets current file counts exceeds this property. The default is unlimited and therefore the default will never pause. The second property is `table.bulk.max.tablet.files`. This property determines the maximum number of files a bulk import can add to a single tablet. When this limit is exceeded the bulk import operation will fail w/o making changes to any tablets. Below is an example of how these properties behave. 1. Set table.file.pause=30 2. Set table.bulk.max.tablet.files=100 3. Import 20 files into tablet A, this causes tablet A to have 20 files 4. Import 20 files into tablet A, this causes tablet A to have 40 files 5. Import 20 files into tablet A. Because the tablet currently has 40 files and the pause limit is 30, this bulk import will pause. 6. Tablet A compacts 10 files, this causes tablet A to have 31 files. It is still above the pause limit so the bulk import does not progress. 7. Tablet A compacts 10 files, this causes tablet A to have 22 files. 8. The paused bulk import proceeds, this causes tablet A to have 42 files. 9. Import 200 files into tablet B and one file in tablet A. This operation fails w/o changing tablet A or B because 200 exceeds the value of table.bulk.max.tablet.files. While making this change ran into two preexisting problems. One was with bulk import setting time. For the case of multiple files the behavior of setting time was incorrect and inconsistent depending on the table time type and if the tablet was hosted or not. Made the behavior consistent for hosted or unhosted and the two table time types. The behavior is that single time stamp is allocated for all files in all cases. The code used to allocate different number of timestamps in the four different cases. This behavior was causing tablet refresh to fail and these changes to fail. Fixed this existing issue since progress could not be made on these changes without fixing it. The new test in this PR that add lots of files to a single tablet and set request bulk import to set time uncovered the existing problem. The second problem was the existing code had handling for the case of a subset of files being added to a tablet by bulk import. This should never happen because files are added via a mutation. Expect either the entire mutation to go through or nothing. Removed this handling for a subset and changed the code to throw an exception if a subset other than the empty set is seen. This change greatly simplified implementing this feature. fixes apache#5023

keith-turner · 2024-11-24T20:46:26Z

Before closing this I plan to open a follow on issue about making minor compactions pause also.

dlmarion · 2024-11-25T15:16:36Z

#5105 (comment) mentions this PR.

cshannon

I took a look at this didn't spot any issues with it just from looking at the small scale unit tests and reviewing the code. Overall it seems like a nice change to pause the bulk import fate ops until compaction can catch up. I am curious to see how this change performs on a real cluster under load testing and if we discover anything that needs to be tweaked as a follow on.

dlmarion · 2024-12-07T14:50:42Z

New failure in SplitiT after this change, may be due to this:

org.opentest4j.AssertionFailedError: expected: <0> but was: <1>
	at org.junit.jupiter.api.AssertionFailureBuilder.build(AssertionFailureBuilder.java:151)
	at org.junit.jupiter.api.AssertionFailureBuilder.buildAndThrow(AssertionFailureBuilder.java:132)
	at org.junit.jupiter.api.AssertEquals.failNotEqual(AssertEquals.java:197)
	at org.junit.jupiter.api.AssertEquals.assertEquals(AssertEquals.java:150)
	at org.junit.jupiter.api.AssertEquals.assertEquals(AssertEquals.java:145)
	at org.junit.jupiter.api.Assertions.assertEquals(Assertions.java:531)
	at org.apache.accumulo.test.functional.SplitIT.bulkImportThatCantSplitHangsCompaction(SplitIT.java:515)
	at java.base/java.lang.reflect.Method.invoke(Method.java:569)
	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
	at java.base/java.lang.Thread.run(Thread.java:840)

keith-turner · 2024-12-07T16:56:11Z

Opened #5148 to fix that test issue, seems like that test had a preexisting problem and was just flaky.

After the changes in apache#5104 bulk import would read a tablets files, examine the files, and then require the set of files to be the same to update the tablet. The code only cared that the count of files was below a specified limit. Requiring the set of files not to change was causing the conditional mutation to fail on a busy tablet. This change modifies the condition to check the count of file which avoids contention on a busy tablet where the set of files is constantly changing.

…5159) After the changes in #5104 bulk import would read a tablets files, examine the files, and then require the set of files to be the same to update the tablet. The code only cared that the count of files was below a specified limit. Requiring the set of files not to change was causing the conditional mutation to fail on a busy tablet. This change modifies the condition to check the count of file which avoids contention on a busy tablet where the set of files is constantly changing.

keith-turner added this to the 4.0.0 milestone Nov 24, 2024

dlmarion mentioned this pull request Nov 25, 2024

Prefer splitting over compacting #5105

Merged

keith-turner mentioned this pull request Dec 4, 2024

Change defaults on bulk import from unlimited. #5138

Closed

cshannon approved these changes Dec 6, 2024

View reviewed changes

keith-turner merged commit 3787f1b into apache:main Dec 6, 2024
8 checks passed

keith-turner deleted the accumulo-5023 branch December 6, 2024 21:24

keith-turner mentioned this pull request Dec 7, 2024

Hold minor compactions when tablets files exceed pause limit #5151

Open

keith-turner mentioned this pull request Dec 9, 2024

avoid checking all tablet files in bulk import conditional mutation #5159

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pauses bulk imports into tablets w/ too many files #5104

Pauses bulk imports into tablets w/ too many files #5104

keith-turner commented Nov 24, 2024 •

edited

Loading

keith-turner commented Nov 24, 2024

dlmarion commented Nov 25, 2024

cshannon left a comment

dlmarion commented Dec 7, 2024

keith-turner commented Dec 7, 2024

Pauses bulk imports into tablets w/ too many files #5104

Pauses bulk imports into tablets w/ too many files #5104

Conversation

keith-turner commented Nov 24, 2024 • edited Loading

keith-turner commented Nov 24, 2024

dlmarion commented Nov 25, 2024

cshannon left a comment

Choose a reason for hiding this comment

dlmarion commented Dec 7, 2024

keith-turner commented Dec 7, 2024

keith-turner commented Nov 24, 2024 •

edited

Loading