Refactor the way resources are injected into the reconcilers #3305

pierretasci · 2020-09-30T04:03:54Z

Feature request

The current assumption in Tekton is that most of the things it needs to look up to be able to reconcile a TaskRun or PipelineRun exist within the cluster. Until now, this was a fair assumption and it makes the reconcile loop rather straight-forward.

Going forward, with the introduction of Tekton Bundles and soon to be other forms of external dependencies, there is a missing concept of the Tekton controllers which is that of "pre-work". That is, reconciliation should not block but reaching out to external services like an image registry is expensive and error prone. For future expansion, we should add another step in the process between when the object is created and when it start running to fetch and prepare and "prework". This will likely require a redesign of how the controllers reconcile.

vdemeester · 2020-10-08T09:33:24Z

/assign

pritidesai · 2020-11-04T19:20:24Z

Hey @vdemeester I am moving this out of 0.18 (to 0.19) as per our discussion in the General WG, "moving bundle behind a feature flag give that its still in alpha"

(see PR #3492 for more discussion)

ghost · 2021-01-25T16:40:22Z

Pushing this one back to 0.22 since it sounds like there are a number of connected pieces that need to be pulled together to get this into good shape.

ghost · 2021-03-23T16:38:20Z

Pushing this one back again to 0.24.

ghost · 2021-05-18T16:37:45Z

We could spend some effort taking some measurements in our dogfooding clusters to see how a change here could have an impact before we commit to refactoring?

@pierretasci do you have an idea of how you'd like to proceed on this one? Is it still worth keeping in the milestones?

pierretasci · 2021-05-18T17:54:04Z

I think this is going to need to happen for remote resource resolution anyways, so perhaps we refocus this on what is needed to get that to work

ghost · 2021-05-21T12:35:31Z

Makes sense, I will remove this from milestones for now on the basis that remote resolution is still in the design stages.

ghost · 2021-07-16T21:04:00Z

/assign

I've started working on this in a branch. The way I've structured it at the moment, I've introduced a new reconciler whose sole responsibility is resolving. Once the specs have been resolved into the status field of the taskrun / pipelinerun then the "original" reconcilers take over again and actuate as normal.

ghost · 2021-07-19T14:52:14Z

One interesting wrinkle during this refactor: our resolution logic is pretty directly tied to the way we update annotations and labels on taskruns/pipelineruns. As soon as we've resolved the task or pipeline we copy the labels and annotations over to the run. So at the moment the "Resolver Reconciler" I've introduced is responsible not only for resolving resources but also copying those annotations/labels over too.

At some point we'll probably want to change this. I think ideally the resolution layer shouldn't also be responsible for deciding which labels and annotations are copied to runs.

mattmoor · 2021-09-05T22:29:29Z

That is, reconciliation should not block but reaching out to external services like an image registry is expensive and error prone

I thought I'd chime in here, since I noticed an experiment that's got multiple reconcilers parting on *Run status, which is kind of an anti-pattern.

So we have a number of places we've dealt with problems very similar to this in Knative with a fair degree of success. Two that are top-of-mind for me are:

probing the dataplane for kingress readiness,
tag to digest resolution.

Both of these expose idempotent interfaces to the core reconciler, uses a workqueue for, and has a callback that enqueues the original resource when the job is done. This pattern has served us very well, and I'd definitely recommend it as a way of offloading work from the main reconciler.

That said, I think the main thing I'd change with these is to actually use a child resource as the API, and leverage a proper reconciler to manage the workqueuing (and OwnerRefs to handle queuing the parent on completion).

It is notable that this is effectively the direction that y'all are heading, except instead of reconciling a child resource you are splitting the reconciliation of a single resource across two reconcilers (which I would strongly discourage).

I'd be happy to jump on a call to discuss this more.

ghost · 2021-09-13T13:22:32Z

This sounds like the Tekton Resource Request CRD alternative captured in the resolution TEP. That proposal introduces a new resource type (child resource?) and reconciler as you describe. Good to know this is a preferred approach.

tekton-robot · 2021-12-12T13:47:28Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale with a justification.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close with a justification.
If this issue should be exempted, mark the issue as frozen with /lifecycle frozen with a justification.

/lifecycle stale

Send feedback to tektoncd/plumbing.

ghost · 2021-12-13T12:14:44Z

/lifecycle frozen

ghost · 2021-12-13T12:15:26Z

/remove-lifecycle stale

justification: this may still happen if TEP-0060 gets approved and implemented.

ghost · 2022-03-08T13:05:26Z

In #4596 we're adding support for pipelineRefs via remote resolution. This makes the resolution bit non-blocking to a PipelineRun's reconcile. However the code still maintains the PipelineRun's existing resolution mechanics alongside the new stuff and could still use a refactor, perhaps if/when remote resolution moves out of alpha. One option would be for every impl, including Tekton's built-in, to be moved behind the remote.Resolver interface that bundles introduced and Pipeline's reconcilers could just speak to that.

pierretasci added the kind/feature Categorizes issue or PR as related to a new feature. label Sep 30, 2020

pierretasci mentioned this issue Sep 30, 2020

Introduces Tekton bundles: take 2 #3142

Merged

4 tasks

vdemeester added this to the Pipelines v0.18 milestone Oct 8, 2020

tekton-robot assigned vdemeester Oct 8, 2020

pritidesai modified the milestones: Pipelines v0.18, Pipelines v0.19 Nov 4, 2020

vdemeester modified the milestones: Pipelines v0.19, Pipelines v0.20 Nov 30, 2020

dibyom modified the milestones: Pipelines v0.20, Pipelines 0.21 Jan 11, 2021

ghost modified the milestones: Pipelines 0.21, Pipelines 0.22 Jan 25, 2021

dibyom modified the milestones: Pipelines 0.22, Pipelines 0.23 Mar 9, 2021

ghost modified the milestones: Pipelines 0.23, Pipelines 0.24 Mar 23, 2021

dibyom modified the milestones: Pipelines 0.24, Pipelines v0.25 May 4, 2021

vdemeester mentioned this issue May 6, 2021

Run vs Definition "race" and "source of truth": PipelineRun errors out after Pipeline changed #3916

Closed

ghost mentioned this issue May 18, 2021

Measurements of resource injection overheads tektoncd/plumbing#840

Closed

ghost removed this from the Pipelines v0.25 milestone May 21, 2021

tekton-robot assigned ghost Jul 16, 2021

ghost mentioned this issue Jul 20, 2021

WIP Refactor ref resolution #4108

Closed

5 tasks

ghost mentioned this issue Aug 17, 2021

Add controller flag to turn off built-in resolution #4168

Merged

3 tasks

tekton-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 12, 2021

tekton-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Dec 13, 2021

ghost mentioned this issue Mar 8, 2022

Add pipelineRef remote resolution #4596

Merged

5 tasks

vdemeester removed their assignment Apr 6, 2022

xchapter7x added this to Tekton Community Roadmap Sep 20, 2022

xchapter7x moved this to Todo in Tekton Community Roadmap Sep 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor the way resources are injected into the reconcilers #3305

Refactor the way resources are injected into the reconcilers #3305

pierretasci commented Sep 30, 2020 •

edited

Loading

vdemeester commented Oct 8, 2020

pritidesai commented Nov 4, 2020 •

edited

Loading

ghost commented Jan 25, 2021

ghost commented Mar 23, 2021 •

edited by ghost

Loading

ghost commented May 18, 2021

pierretasci commented May 18, 2021

ghost commented May 21, 2021

ghost commented Jul 16, 2021

ghost commented Jul 19, 2021

mattmoor commented Sep 5, 2021

ghost commented Sep 13, 2021

tekton-robot commented Dec 12, 2021

ghost commented Dec 13, 2021

ghost commented Dec 13, 2021

ghost commented Mar 8, 2022

Refactor the way resources are injected into the reconcilers #3305

Refactor the way resources are injected into the reconcilers #3305

Comments

pierretasci commented Sep 30, 2020 • edited Loading

Feature request

vdemeester commented Oct 8, 2020

pritidesai commented Nov 4, 2020 • edited Loading

ghost commented Jan 25, 2021

ghost commented Mar 23, 2021 • edited by ghost Loading

ghost commented May 18, 2021

pierretasci commented May 18, 2021

ghost commented May 21, 2021

ghost commented Jul 16, 2021

ghost commented Jul 19, 2021

mattmoor commented Sep 5, 2021

ghost commented Sep 13, 2021

tekton-robot commented Dec 12, 2021

ghost commented Dec 13, 2021

ghost commented Dec 13, 2021

ghost commented Mar 8, 2022

pierretasci commented Sep 30, 2020 •

edited

Loading

pritidesai commented Nov 4, 2020 •

edited

Loading

ghost commented Mar 23, 2021 •

edited by ghost

Loading