-
Notifications
You must be signed in to change notification settings - Fork 220
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[maintainance] Collect OWNERship metrics for every repository #340
Comments
Issue-Label Bot is automatically applying the labels:
Please mark this comment with 👍 or 👎 to give our bot feedback! |
Comments from @animeshsingh
|
@jlewi What would be a good identifier for ownership of files? Do you know if GitHub API provides a way to do this? |
We have http://devstats.kubeflow.org/ but we cannot get the statistics for repositories. We also have https://www.stackalytics.com/unaffiliated?project_type=kubeflow-group&release=all&metric=commits but it does not have all contributions information and it only collects the PR metrics. We know that people like @terrytangyuan help the community to do the release, these kinds of contributions cannot be collected in devstats or stackalytics. |
We probably want to use pure GitHub API to retrieve all information that we need. Copying my reply on another email thread here in case anyone is interested:
|
We still have the problem, some contributions made in GDoc or community meeting cannot be recorded. But it will be better to use devstats or stackalytics. @terrytangyuan Are you interested in helping the community to do it? |
Those types of contributions are probably non-trivial to capture. I am uncertain whether I’ll have bandwidth to do it recently though. I’ll definitely keep you posted here if I do. |
@terrytangyuan OWNERs files will be named "OWNERs". I believe the GitHub data in BigQuery lists files so you could probably easily do a BigQuery to identify all the OWNERs files and then fetch them with "curl" or some other way. Although you could also just check out all the repositories.
@gaocegege I don't want to measure contributions. I want to measure accountability. At the simplest level, OWNERs files list who can approve PRs. So the OWNERs files are pretty good indication of who understands a piece of code and is actively involved in a project. There are incentives to keep OWNERs files up to date because if people in an OWNERs files are no longer active it slows everyone down having to reassign PRs to someone else. For example, suppose someone has been very active in a project but then they switch jobs and are no longer involved in KF. At this point, they would likely be removed from the OWNERs files indicating they are no longer active. So the OWNERs file but not contribution metrics would reflect the fact that they will no longer be actively involved going forward. |
Make sense. |
Updated OWNERS for Katib, tf-operator and pytorch-operator. |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Ref https://groups.google.com/d/msgid/kubeflow-discuss/858f1953-1808-4ebc-9a8e-1661895522ac%40Spark?utm_medium=email&utm_source=footer
From @jlewi
I also think it is necessary, thus open an issue in the community for it.
The text was updated successfully, but these errors were encountered: