-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support more than one statistic for summarizing time lists #127
Conversation
6c40950
to
1e2ab07
Compare
8650250
to
4b67615
Compare
Not yet configurable, but a first run through the code of where changes would be needed to move to more than one statistic. At a first glance, the table doesn't look that bad. At least me, I always look at more than one statistic (I believe I started that back when I discovered the possibility to create "describe" stats with R when starting to look at series of numbers). The question here is which stats exactly to look at. As for the state of the change:
|
In the latest commit I replaced references to averages where in reality they already are a combination of median, average, 90th percentile. |
This commit switches code for computing averages to using numpy. This is a first step towards supporting more than one statistic for summarizing lists of times for state change of issues. Averages are easy to compute but sensitive to outliers. In particular when thinking of SLAs for times for first responses it is often more helpful to look at median, quartiles or even 90th percentile.
Currently we are only looking at averages. This adds median and 90th percentile to issues stats. Those are much less sensitive to outliers. (Think issue cleanup sessions where very old issues get closed out but then dominate the issue stats at the end of the month).
3536281
to
aabb02f
Compare
In previous commits pure averages were replace by a dict of avg, median, 90th percentile. This makes that switch visible in docs, method names, variable names.
aabb02f
to
6eeef9b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Tested this out and its working great! Love the new additional information. Thanks for the improvements @MaineC !
This commit switches code for computing averages to using numpy.
This is a first step towards supporting more than one statistic for summarizing lists of times for state change of issues. #84
Averages are easy to compute but sensitive to outliers. In particular when thinking of SLAs for times for first responses it is often more helpful to look at median, quartiles or even 90th percentile.
Author/Contributor
make lint
and fix any issues that you have introducedmake test
and ensure you have test coverage for the lines you are introducingbug
,documentation
,enhancement
,infrastructure
, orbreaking