Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more metrics to async task dispatch #6202

Merged
merged 1 commit into from
Jul 31, 2024

Conversation

Shaddoll
Copy link
Member

What changed?
Add more metrics to async task dispatch

Why?
Improve observability

How did you test it?
manual tests

Potential risks

Release notes

Documentation Changes

Copy link

codecov bot commented Jul 30, 2024

Codecov Report

Attention: Patch coverage is 80.00000% with 3 lines in your changes missing coverage. Please review.

Project coverage is 72.91%. Comparing base (9a7a8a4) to head (3544e2d).
Report is 2 commits behind head on master.

Files Patch % Lines
service/matching/tasklist/matcher.go 80.00% 3 Missing ⚠️
Additional details and impacted files
Files Coverage Δ
service/matching/tasklist/matcher.go 82.15% <80.00%> (-0.23%) ⬇️

... and 8 files with indirect coverage changes


Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9a7a8a4...3544e2d. Read the comment docs.

return nil
case <-ctx.Done():
return fmt.Errorf("context done when trying to forward local task: %w", ctx.Err())
default:
}

attempt := 0
Copy link
Member

@davidporter-id-au davidporter-id-au Jul 31, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My memory is slightly hazy here, but I'd be careful interpreting this value as the actual number of retries per task, if I recall correctly this times out (in pathological cases) and then ends up looping from dispatchSingleTaskFromBufferWithRetries or above, I dont' recall exactly, so the actual number of attempts per task may be much higher.

I don't think its bad to include this metric like this, but I just think it may only capture part of the issue, and possibly adding an attempt/timer higher up might be worthwhile to more fully capture the wait time (maybe dispatchSingleTaskFromBufferWithRetries)?

@Shaddoll Shaddoll merged commit 38c295d into cadence-workflow:master Jul 31, 2024
20 checks passed
@Shaddoll Shaddoll deleted the async-metrics branch July 31, 2024 16:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants