Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cAdvisor-less, CRI-full Container and Pod Stats #2371

Open
10 of 17 tasks
haircommander opened this issue Jan 29, 2021 · 121 comments
Open
10 of 17 tasks

cAdvisor-less, CRI-full Container and Pod Stats #2371

haircommander opened this issue Jan 29, 2021 · 121 comments
Assignees
Labels
sig/node Categorizes an issue or PR as relevant to SIG Node. sig/windows Categorizes an issue or PR as relevant to SIG Windows. stage/alpha Denotes an issue tracking an enhancement targeted for Alpha status

Comments

@haircommander
Copy link
Contributor

haircommander commented Jan 29, 2021

Enhancement Description

@k8s-ci-robot k8s-ci-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Jan 29, 2021
@haircommander
Copy link
Contributor Author

/sig node

@k8s-ci-robot k8s-ci-robot added sig/node Categorizes an issue or PR as relevant to SIG Node. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Jan 29, 2021
@ehashman
Copy link
Member

ehashman commented Feb 2, 2021

/milestone v1.21

@k8s-ci-robot k8s-ci-robot added this to the v1.21 milestone Feb 2, 2021
@annajung annajung added stage/alpha Denotes an issue tracking an enhancement targeted for Alpha status tracked/yes Denotes an enhancement issue is actively being tracked by the Release Team labels Feb 3, 2021
@JamesLaverack
Copy link
Member

Hey @haircommander and @bobbypage, enhancements 1.21 shadow here,

Enhancements Freeze is 2 days away, Feb 9th EOD PST

The enhancements team is aware that KEP update is currently in progress (PR #2364). Please make sure to work on PRR questionnaires and requirements and get it merged before the freeze. For PRR related questions or to boost the PR for PRR review, please reach out in Slack on the #prod-readiness channel.

Any enhancements that do not complete the following requirements by the freeze will require an exception.

  • [IN PROGRESS] The KEP must be merged in an implementable state
  • [IN PROGRESS] The KEP must have test plans
  • [IN PROGRESS] The KEP must have graduation criteria
  • [IN PROGRESS] The KEP must have a production readiness review

@annajung
Copy link
Contributor

Hi @haircommander and @bobbypage, 1.21 Enhancements Lead here.

Enhancements Freeze is now in effect.

Unfortunately, your KEP needed to be updated and the PR has not yet merged. If you wish to be included in the 1.21 Release, please submit an Exception Request as soon as possible.

/milestone clear

@k8s-ci-robot k8s-ci-robot removed this from the v1.21 milestone Feb 10, 2021
@annajung annajung added tracked/no Denotes an enhancement issue is NOT actively being tracked by the Release Team and removed tracked/yes Denotes an enhancement issue is actively being tracked by the Release Team labels Feb 10, 2021
@ehashman
Copy link
Member

ehashman commented May 4, 2021

/milestone v1.22

@k8s-ci-robot k8s-ci-robot added this to the v1.22 milestone May 4, 2021
@JamesLaverack JamesLaverack added tracked/yes Denotes an enhancement issue is actively being tracked by the Release Team and removed tracked/no Denotes an enhancement issue is NOT actively being tracked by the Release Team labels May 5, 2021
@salaxander
Copy link
Contributor

Hey @haircommander and @bobbypage - 1.22 enhancements team here! Just a reminder that enhancements freeze is coming up on 5/13. We'll need the KEP merged before then to get this included in the 1.22 release.

Let us know if there's anything we can do to help before then!

@salaxander
Copy link
Contributor

@haircommander @bobbypage #2364 merged so we've got you tracked for 1.22 :)

@PI-Victor
Copy link
Member

Hello @haircommander @bobbypage 👋, 1.22 Docs release lead here.
This enhancement is marked as ‘Needs Docs’ for 1.22 release.

Please follow the steps detailed in the documentation to open a PR against dev-1.22 branch in the k/website repo. This PR can be just a placeholder at this time and must be created before Fri July 9, 11:59 PM PDT.
 Also, take a look at Documenting for a release to familiarize yourself with the docs requirement for the release.

Thank you!

@haircommander
Copy link
Contributor Author

thanks for the heads up @PI-Victor !

@ehashman
Copy link
Member

xref kubernetes/kubernetes#102789

@haircommander
Copy link
Contributor Author

xref initial kubelet implementation kubernetes/kubernetes#103095

@salaxander
Copy link
Contributor

Hey @bobbypage and @haircommander - Just checking in as we're about 2 weeks away from 1.22 code freeze. I've got kubernetes/kubernetes#103095 and kubernetes/kubernetes#103095 tracked as the open k/k PRs. Are there any other open or merged PRs we should be tracking? Thanks!

@haircommander
Copy link
Contributor Author

none opened yet! I will post them here if we do. Thanks for your work @salaxander

@salaxander
Copy link
Contributor

Hey @haircommander - One more check-in as we're a week out from 1.22 code freeze. Any updates on if you expect kubernetes/kubernetes#103095 and kubernetes/kubernetes#103095 to merge before the deadline?

Thanks!

@haircommander
Copy link
Contributor Author

Thanks for checking! I do expect them to merge. We're waiting on a e2e POC which I'm working on :)

@salaxander
Copy link
Contributor

Hi @haircommander - One last ping (sorry!). Code freeze is tomorrow evening (PST), so those two open PRs will need to merge before then for this to be included in 1.22. Let me know if there's anything we can do to help :)

@haircommander
Copy link
Contributor Author

As a note: I think we are deferring this (again 🙃 ) from 1.30 until containerd and cri-o have support. The cri-o support is actively being worked on, and containerd is being taken by @adisky . I am hoping to make forward progress in 1.31

@ialidzhikov
Copy link

yeah containerd needs to be extended to pass the required container and pod metrics through to the kubelet. That said, for metrics that weren't formerly exposed by cadvisor (like the pull metrics that this comment was done on), it may be more efficient to continue to read them directly from containerd

Hi @haircommander,

Just to make sure I understand correct: This KEP won't help with exposing containerd metrics (like the image pulls ones as above) via kubelet, right? Hence, if we need the containerd metrics for the image pulls, we need somehow to collect/scrape these metrics from containerd directly, right?

@haircommander
Copy link
Contributor Author

yes correct!

@k8s-triage-robot
Copy link

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle stale
  • Close this issue with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 26, 2024
@bitoku
Copy link

bitoku commented Jun 8, 2024

/assign

@pacoxu
Copy link
Member

pacoxu commented Jul 1, 2024

@haircommander
I update the issue description according to kep and PR. Please check if I misunderstand about the current status of this feature.

@k8s-triage-robot
Copy link

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle rotten
  • Close this issue with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

@k8s-ci-robot k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 31, 2024
@HirazawaUi
Copy link
Contributor

/remove-lifecycle rotten

@k8s-ci-robot k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jul 31, 2024
@kannon92
Copy link
Contributor

kannon92 commented Aug 20, 2024

@haircommander Can you give an update on your plans for 1.32? Do you want this to included in KEP feature planning for 1.32?

@haircommander
Copy link
Contributor Author

I'd love to make progress in 1.32. it in part relies on containerd support but maybe we can decouple those. let's plan on it for 1.32

@HirazawaUi
Copy link
Contributor

HirazawaUi commented Aug 26, 2024

@haircommander I'm curious whether we need to ensure consistent behavior when enabling or disabling the feature gate. If it's not necessary, then we can push kubernetes/kubernetes#126006 forward.

@akhilerm
Copy link
Member

I'd love to make progress in 1.32. it in part relies on containerd support but maybe we can decouple those. let's plan on it for 1.32

@haircommander Is ListPodSandboxMetrics the only pending piece from containerd. I should be able to get that in by 1.32

@haircommander
Copy link
Contributor Author

my hero! let's try to push this to beta in 1.32 then

@haircommander haircommander moved this from Proposed for consideration to Triage in SIG Node 1.32 KEPs planning Sep 17, 2024
@pacoxu
Copy link
Member

pacoxu commented Sep 23, 2024

Xref WIP KEP #4792 which may be related.

@zvonkok
Copy link

zvonkok commented Dec 16, 2024

@haircommander Is this still on your agenda? What is the plan here?

@haircommander
Copy link
Contributor Author

yup mostly waiting on containerd/containerd#10691 which @akhilerm is working on. Once containerd has an implementation we will push this to beta

@zvonkok
Copy link

zvonkok commented Dec 20, 2024

@haircommander The original KEP says under Goals: "Enhance CRI implementations to provide metrics analogous to the existing metrics provided by /metrics/cadvisor.". Comparing https://github.com/google/cadvisor/blob/master/docs/storage/prometheus.md and https://github.com/containerd/containerd/pull/10691/files shows me we're missing some metrics in the PR. Do we have a reference list of which metrics need to be available to move this forward?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
sig/node Categorizes an issue or PR as relevant to SIG Node. sig/windows Categorizes an issue or PR as relevant to SIG Windows. stage/alpha Denotes an issue tracking an enhancement targeted for Alpha status
Projects
Status: Major Change
Status: Removed From Milestone
Status: Removed from Milestone
Status: Tracked for Code Freeze
Status: Triage
Status: No status
Development

No branches or pull requests