[BugFix] fix scan blocked by unreleased tokens #36836

fzhedu · 2023-12-12T04:22:25Z

Why I'm doing:

some scan operators are blocked as they cann't get tokens, which are hold by others. This case happen when a fragment with scan + limit, some dirvers run faster and reach the limit, and the scan's has_output() always return full state, so the incoming drivers can't be issued, blocking the fragment to be finished.

What I'm doing:

let finishing scan release token in time.
besides, add some key logs when the scan is blocked, help to fix similar issues in the future.

Fixes #issue

What type of PR is this:

Does this PR entail a change in behavior?

Yes, this PR will result in a change in behavior.
No, this PR will not result in a change in behavior.

If yes, please specify the type of change:

Interface/UI changes: syntax, type conversion, expression evaluation, display information
Parameter changes: default values, similar parameters but with different default values
Policy changes: use new policy to replace old one, functionality automatically enabled
Feature removed
Miscellaneous: upgrade & downgrade compatibility, etc.

Checklist:

I have added test cases for my bug fix or my new feature
This pr needs user documentation (for new or modified features or behaviors)
- I have added documentation for my new feature or new function

Bugfix cherry-pick branch check:

Signed-off-by: Zhuhe Fang <[email protected]>

starrocks-cr · 2023-12-12T04:23:20Z

be/src/exec/pipeline/scan/scan_operator.cpp

@@ -409,6 +419,8 @@ Status ScanOperator::_trigger_next_scan(RuntimeState* state, int chunk_source_in
            int64_t prev_scan_bytes = chunk_source->get_scan_bytes();
            auto status = chunk_source->buffer_next_batch_chunks_blocking(state, kIOTaskBatchSize, _workgroup.get());
            if (!status.ok() && !status.is_end_of_file()) {
+                LOG(ERROR) << "scan fragment " << print_id(state->fragment_instance_id()) << " driver "
+                           << get_driver_sequence() << " Scan tasks error: " << status.to_string();
                _set_scan_status(status);
            }



The most risky bug in this code is:
Concurrent modification of shared resources without proper synchronization may lead to undefined behavior or data races.

You can modify the code like this:

void ScanOperator::close(RuntimeState* state) { std::lock_guard guard(_task_mutex); // Acquire the lock before modifying shared resources set_buffer_finished(); // For the running io task, we close its chunk sources in ~ScanOperator not in ScanOperator::close. for (size_t i = 0; i < _chunk_sources.size(); i++) { // std::lock_guard guard(_task_mutex); // This line is commented out as we've already acquired the lock above ...

Note: The provided code snippet seems to be a diff patch from a version control system. However, the context around modifications suggests that there is some thread-sensitive logic regarding the management of IO tasks and buffer state. Moving set_buffer_finished() outside of the close method without locking (_task_mutex) could result in data races if another thread is simultaneously involved with the chunk sources or related counters such as _num_running_io_tasks or _submit_task_counter.

Acquiring the mutex lock at the start of the ScanOperator::close method can help ensure these shared resources are protected against concurrent access, preventing data races and ensuring the consistency of the runtime state.

github-actions · 2023-12-12T05:55:50Z

[FE Incremental Coverage Report]

✅ pass : 0 / 0 (0%)

github-actions · 2023-12-12T06:00:41Z

[BE Incremental Coverage Report]

❌ fail : 2 / 10 (20.00%)

file detail

	path	covered_line	new_line	coverage	not_covered_line_detail
🔵	src/exec/pipeline/scan/scan_operator.cpp	2	10	20.00%	[234, 235, 236, 237, 238, 239, 422, 423]

github-actions · 2023-12-12T07:52:54Z

@Mergifyio backport branch-3.2

github-actions · 2023-12-12T07:52:55Z

@Mergifyio backport branch-3.1

github-actions · 2023-12-12T07:52:57Z

@Mergifyio backport branch-3.0

github-actions · 2023-12-12T07:52:58Z

@Mergifyio backport branch-2.5

mergify · 2023-12-12T07:53:07Z

backport branch-3.2

✅ Backports have been created

#36858 [BugFix] fix scan blocked by unreleased tokens (backport #36836) has been created for branch branch-3.2

mergify · 2023-12-12T07:53:09Z

backport branch-3.1

✅ Backports have been created

#36859 [BugFix] fix scan blocked by unreleased tokens (backport #36836) has been created for branch branch-3.1

mergify · 2023-12-12T07:53:11Z

backport branch-3.0

✅ Backports have been created

#36860 [BugFix] fix scan blocked by unreleased tokens (backport #36836) has been created for branch branch-3.0

mergify · 2023-12-12T07:53:12Z

backport branch-2.5

✅ Backports have been created

#36862 [BugFix] fix scan blocked by unreleased tokens (backport #36836) has been created for branch branch-2.5

Signed-off-by: Zhuhe Fang <[email protected]> (cherry picked from commit d3621d0)

) Co-authored-by: Zhuhe Fang <[email protected]>

) Signed-off-by: Zhuhe Fang <[email protected]> Co-authored-by: Zhuhe Fang <[email protected]>

Signed-off-by: Zhuhe Fang <[email protected]> (cherry picked from commit d3621d0)

) Co-authored-by: Zhuhe Fang <[email protected]>

[BugFix] fix scan blocked by unreleased tokens

dff0e9d

Signed-off-by: Zhuhe Fang <[email protected]>

github-actions bot added 3.2 3.1 3.0 labels Dec 12, 2023

mergify bot assigned fzhedu Dec 12, 2023

github-actions bot added the 2.5 label Dec 12, 2023

fzhedu requested review from ZiheLiu and satanson December 12, 2023 04:23

starrocks-cr bot reviewed Dec 12, 2023

View reviewed changes

liuyehcf approved these changes Dec 12, 2023

View reviewed changes

ZiheLiu approved these changes Dec 12, 2023

View reviewed changes

fzhedu merged commit d3621d0 into StarRocks:main Dec 12, 2023
44 of 45 checks passed

github-actions bot removed the 3.2 label Dec 12, 2023

github-actions bot removed the 3.1 label Dec 12, 2023

github-actions bot removed 3.0 2.5 labels Dec 12, 2023

mergify bot pushed a commit that referenced this pull request Dec 12, 2023

[BugFix] fix scan blocked by unreleased tokens (#36836)

cc6dcda

Signed-off-by: Zhuhe Fang <[email protected]> (cherry picked from commit d3621d0)

mergify bot pushed a commit that referenced this pull request Dec 12, 2023

[BugFix] fix scan blocked by unreleased tokens (#36836)

c6d9ac7

Signed-off-by: Zhuhe Fang <[email protected]> (cherry picked from commit d3621d0)

mergify bot pushed a commit that referenced this pull request Dec 12, 2023

[BugFix] fix scan blocked by unreleased tokens (#36836)

647d2fc

Signed-off-by: Zhuhe Fang <[email protected]> (cherry picked from commit d3621d0)

mergify bot mentioned this pull request Dec 12, 2023

[BugFix] fix scan blocked by unreleased tokens (backport #36836) #36858

Merged

This was referenced Dec 12, 2023

[BugFix] fix scan blocked by unreleased tokens (backport #36836) #36859

Merged

[BugFix] fix scan blocked by unreleased tokens (backport #36836) #36860

Merged

mergify bot pushed a commit that referenced this pull request Dec 12, 2023

[BugFix] fix scan blocked by unreleased tokens (#36836)

232e0fa

Signed-off-by: Zhuhe Fang <[email protected]> (cherry picked from commit d3621d0)

mergify bot mentioned this pull request Dec 12, 2023

[BugFix] fix scan blocked by unreleased tokens (backport #36836) #36862

Merged

wanpengfei-git pushed a commit that referenced this pull request Dec 12, 2023

[BugFix] fix scan blocked by unreleased tokens (backport #36836) (#36860

53b0573

) Co-authored-by: Zhuhe Fang <[email protected]>

wanpengfei-git pushed a commit that referenced this pull request Dec 13, 2023

[BugFix] fix scan blocked by unreleased tokens (backport #36836) (#36862

6d5a1de

) Signed-off-by: Zhuhe Fang <[email protected]> Co-authored-by: Zhuhe Fang <[email protected]>

andyziye pushed a commit that referenced this pull request Dec 15, 2023

[BugFix] fix scan blocked by unreleased tokens (#36836)

2fff1d9

Signed-off-by: Zhuhe Fang <[email protected]> (cherry picked from commit d3621d0)

andyziye pushed a commit that referenced this pull request Dec 15, 2023

[BugFix] fix scan blocked by unreleased tokens (backport #36836) (#36859

7dc8880

) Co-authored-by: Zhuhe Fang <[email protected]>

fzhedu mentioned this pull request Dec 15, 2023

[BugFix] fix blocked scan and add more logs to check blocked drivers #34614

Merged

22 tasks

wanpengfei-git pushed a commit that referenced this pull request Dec 21, 2023

[BugFix] fix scan blocked by unreleased tokens (backport #36836) (#36858

2f396ef

) Co-authored-by: Zhuhe Fang <[email protected]>

wangruin added the backport_ok label Jan 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] fix scan blocked by unreleased tokens #36836

[BugFix] fix scan blocked by unreleased tokens #36836

fzhedu commented Dec 12, 2023

starrocks-cr bot Dec 12, 2023

github-actions bot commented Dec 12, 2023

github-actions bot commented Dec 12, 2023

github-actions bot commented Dec 12, 2023

github-actions bot commented Dec 12, 2023

github-actions bot commented Dec 12, 2023

github-actions bot commented Dec 12, 2023

mergify bot commented Dec 12, 2023 •

edited

Loading

mergify bot commented Dec 12, 2023 •

edited

Loading

mergify bot commented Dec 12, 2023 •

edited

Loading

mergify bot commented Dec 12, 2023 •

edited

Loading

[BugFix] fix scan blocked by unreleased tokens #36836

[BugFix] fix scan blocked by unreleased tokens #36836

Conversation

fzhedu commented Dec 12, 2023

What type of PR is this:

Checklist:

Bugfix cherry-pick branch check:

starrocks-cr bot Dec 12, 2023

Choose a reason for hiding this comment

github-actions bot commented Dec 12, 2023

[FE Incremental Coverage Report]

github-actions bot commented Dec 12, 2023

[BE Incremental Coverage Report]

file detail

github-actions bot commented Dec 12, 2023

github-actions bot commented Dec 12, 2023

github-actions bot commented Dec 12, 2023

github-actions bot commented Dec 12, 2023

mergify bot commented Dec 12, 2023 • edited Loading

✅ Backports have been created

mergify bot commented Dec 12, 2023 • edited Loading

✅ Backports have been created

mergify bot commented Dec 12, 2023 • edited Loading

✅ Backports have been created

mergify bot commented Dec 12, 2023 • edited Loading

✅ Backports have been created

mergify bot commented Dec 12, 2023 •

edited

Loading

mergify bot commented Dec 12, 2023 •

edited

Loading

mergify bot commented Dec 12, 2023 •

edited

Loading

mergify bot commented Dec 12, 2023 •

edited

Loading