-
Notifications
You must be signed in to change notification settings - Fork 335
Lag error on ks_test #43
Comments
The error means that there is not enough datapoints for the test. What resolution (interval between observations) do you use? |
I use a 10 second resolution, with lots of variation in overall sample size. Is there a hard number on the minimum datapoints needed for this statistic? |
Yes, there is a hard limit of 10 datapoints in reference part (between hour and 10 minutes ago). |
Ah, I see - yeah, I think a conditional there would be safer. On Mon, Aug 19, 2013 at 8:29 AM, Anton Lebedevich
Abe Stanway |
I've added conditional. As a side note there might be some confusion in the way algorithms select data range to check for anomalies. Checking last N datapoints gives different results on metrics with different resolutions. If anomaly is detected on 1 last datapoint or even 3 last datapoints on a metric with 2 seconds resolution that anomaly might disappear in 10 seconds. If metric has resolution of 5 minutes then there is quite a lot of time for human to notice detected anomaly. Checking last N minutes would not provide enough datapoints for some algorithms (like ks_test) on low resolution metrics. |
That is correct. Perhaps a new setting is needed - TAIL_AVERAGE_SIZE? On Aug 20, 2013, at 4:15 AM, Anton Lebedevich [email protected] wrote:
|
Metrics with a different resolutions might be present in the same environment so single size won't fit them all. Maybe it's better to use time to cut tail off the sequence (TAIL_TIME)? |
I'm going to close this out, but can you please raise another issue with a case for TAIL_TIME and pragmatic resolution checking? |
I occasionally get this error:
Any clues? cc @mabrek
Re: f886000
The text was updated successfully, but these errors were encountered: