You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
When trying to count the number of spans using count_over_time the actual count gets doubled randomly. Query used was {} | count_over_time() by (name)
Start generating traffic using curl command that can be found here. Make sure to send traffic without a delay (wanted to replicate our production scenario hence chose to skip delay) and I triggered traffic from command line as follows
while true
do
curl -s http://localhost:8081/rolldice
done
After waiting for almost an hour, go to Grafana and choose Tempo data source and run {name="roll"} | count_over_time() by (name) . Please make sure step is set to 1m and time range is last 1 hour - this is really important as issue happens only when range is set to last 1 hour, if you choose an hour window in the past query works just fine. It will show a time series with a huge spike at a random point (if you don't see any spike at once just try hitting Run Query every minute ). Looking the value you could see that its exactly twice the actual value and the next minute the doubling behaviour moves to next point in time.
The following snap shot show the buggy behaviour
At 16:26 you could see that the count is 4.87K which is the actual value
After a moment later the count is changed to 9.74K which exactly the double of the actual value.
And then it continuously slides through the window.
Actual Count at 16:27 is 4.73K
Couple of minutes later it is 9.45K
Expected behavior
The count should n't get doubled when looking at the last 1 hour span count Environment:
Infrastructure: k8s, docker-compose
Deployment tool: helm, docker
The text was updated successfully, but these errors were encountered:
Unfortunately I have not had time to dig into this yet, but my belief is that spans are occasionally being double counted on the border between the metrics generators and the backend.
and attempts to cleanly divide what is requested from the generators and backend. Perhaps if the timing of the query is aligned in a specific way we actually double count.
Describe the bug
When trying to count the number of spans using
count_over_time
the actual count gets doubled randomly. Query used was{} | count_over_time() by (name)
To Reproduce
Steps to reproduce the behavior:
{name="roll"} | count_over_time() by (name)
. Please make sure step is set to1m
and time range is last 1 hour - this is really important as issue happens only when range is set to last 1 hour, if you choose an hour window in the past query works just fine. It will show a time series with a huge spike at a random point(if you don't see any spike at once just try hitting Run Query every minute )
. Looking the value you could see that its exactly twice the actual value and the next minute the doubling behaviour moves to next point in time.At 16:26 you could see that the count is 4.87K which is the actual value
After a moment later the count is changed to 9.74K which exactly the double of the actual value.
And then it continuously slides through the window.
Actual Count at 16:27 is 4.73K
Couple of minutes later it is 9.45K
Expected behavior
The count should n't get doubled when looking at the last 1 hour span count
Environment:
The text was updated successfully, but these errors were encountered: