-
Notifications
You must be signed in to change notification settings - Fork 222
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TEP-0006: New (sub-)metrics #146
Conversation
/assign afrittoli |
I believe that moving to OpenTelemetry should be a goal if we are attempting to put metrics on the correct path for the future as that is the direction of OpenCensus. |
I agree. But at the moment we use the Knative reporting libraries. We could alternatively wait for them to migrate and just rely on their migration as I mentioned in this comment. What do you think? |
I would assume that it would be the goal of Tekton to have its own metrics/reporting (exporter/observability config) code and not use Knative's in order to have the ability to move independently of Knative. If Knative were a dedicated monitoring/metrics project that intended other projects to directly reuse its metrics-related libraries, I would feel differently, but instead believe that Knative's mission is not to provide their metrics libs. as first-class, supported code for downstream projects to use and rely upon. |
That's a very good point. I now am more inclined to indeed have our own reporting system after your comment and decouple it from the Knative at some point. One quick problem is that OpenTelemetry APIs are in alpha (and recently some in beta). So I'm not sure it is at the stage that we accept using in this project as is. I defer this decision to you and others (cc @afrittoli @vdemeester) who have more experience and I will follow your guidance. Let me know. |
So I am not fully sure I agree with that. We are using
That's a good question 🙃. I see |
cc @evankanderson has been poking at OpenCensus + OpenTelemetry stuff in the context of knative/pkg. |
My suggestion would be to use the upstream opencensus libraries directly with the opencensus collector, and plan at some point to move to opentelemetry when their metrics support is production quality. The knative/pkg metrics package as currently implemented addresses a number of problems which Tekton probably doesn't need to handle. (The one part which may be useful is the support for exporting metrics for multiple Resources from a single process. If that is useful, it may be better to upstream it.) The capabilities that make the package a poor fit are:
|
@vdemeester @mrutkows based on what Evan said are we okay at least with the (non-)goals and other sections so far? |
/assign vdemeester |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
based on what Evan said are we okay at least with the (non-)goals and other sections so far?
@NavidZ I think we are. Let's focus this TEP around the metrics we want to gather/report 👍
teps/000X-tekton-metrics.md
Outdated
|
||
--> | ||
|
||
# TEP-NNNN: More granular metrics |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We will need to assign a number 👼
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done. I picked 0006 as that doesn't seem to exist yet in the repo.
This is a barebone TEP for adding new metrics as suggested by the TEP template. At this point it has only the summary, motivation, and a testing plan to get aggreement on the general path. I will iterate more to add the proposed submetrics and what they measure and where should they be measured in the code in subsequent PRs.
/assign @afrittoli |
ping @afrittoli |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good to me as a starting point.
I like the idea of describing the general intention in the 'proposed' state and add the bulk of the content in the next iteration.
/lgtm
@afrittoli: GitHub didn't allow me to request PR reviews from the following users: mrutkows. Note that only tektoncd members and repo collaborators can review this PR, and authors cannot review their own PRs. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
About using Something worth noting is that some part of the controller code are generated using |
/hold cancel |
/kind tep |
Please let us know the status and any blockers? @NavidZ We are looking for examples and types of metrics, thanks |
@pritidesai do we want to land this and I'll follow up with more PRs regarding the details? |
Had a brief discussion in the API working group today, @NavidZ suggested merging this with less details fleshed out as proposed, then incrementally adding more; we'd want to all be satisfied with the level of detail before moving to implmentable In the meeting @imjasonh took an action to take a look as well |
Sorry for the delay in getting to this. This TEP as proposed lgtm, as others have said we'll definitely want to add more detail in future revisions before we get to /lgtm |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: afrittoli The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Signed-off-by: Meredith Lancaster <[email protected]> Signed-off-by: Meredith Lancaster <[email protected]>
This is a barebone TEP for adding new metrics as
suggested by the TEP template. At this point it
has only the problem statement and a testing plan.
I will iterate more to add the actual (sub-)metrics
and what they measure and where should they be
measured in the code in subsequent PRs.
tektoncd/pipeline#2814