Get simple PipelineRun implementation working #128

bobcatfish · 2018-10-11T16:55:28Z

Sorry this is a big one, with even more refactoring 😅The net effect is that when we create a PipelineRun, it actually creates and executes the required Task runs! 🎉

I tried to break it up into separate commits, so might be easier to review that way:

Combine logic to grab Tasks and their Runs for a PipelineRun

It turns out that we need to look at TaskRuns for a few reasons, including 1) figuring out what to run next and 2) determining the status of the PipelineRun, so I've refactored the logic that grabs these to collect a bunch of related state that can be reused. When the graph becomes more sophisticated, we will need to make this structure more than just a list.
Check status of TaskRuns when finding TaskRun to start
Added logic to check statuses of other TaskRuns when deciding if a new one should be started for Implement simple PipelineRun #61
Add condition status to PipelineRun
PipelineRun status will be based on the condition of the TaskRuns which it has created, for Implement simple PipelineRun #61. If any TaskRuns have failed, the PipelineRun has failed. If all are successful, it is successful. If any are in progress, it is in progress. This is assuming a linear Pipeline, we will have to tweak this a bit
when we implement the graph (for Pipeline uses passedConstraints to provide correct inputs and outputs #65)
Create TaskRun from PipelineRun that runs a Task Added the Task reference to the TaskRun so that when a PipelineRun creates a TaskRun, it actually executes! (For Implement simple PipelineRun #61)
While running the integration test, noticed that the PipelineRuns weren't getting reconciled quickly enough, but adding a tracker which will invoke reconcile when the created TaskRuns are updated fixed this - however it did still take > 1 minute to create 3 helloworld TaskRuns and wait for them to complete, so since 3 was arbitrary, reduced to 2. Also cleaned up the TaskRun controller a bit: using the Logger object on the controller/reconciler itself, made the log messages a bit more descriptive.

Fixes #61

knative-prow-robot · 2018-10-11T16:55:42Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: bobcatfish

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [bobcatfish]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

tejal29

Lots of Code and Lots of Test!!! Yey!!!

pkg/apis/pipeline/v1alpha1/pipelinerun_types.go

tejal29 · 2018-10-11T16:58:42Z

pkg/reconciler/v1alpha1/pipelinerun/pipelinerun.go

@@ -91,6 +95,11 @@ func NewController(
 		UpdateFunc: controller.PassNew(impl.Enqueue),
 		DeleteFunc: impl.Enqueue,
 	})
+
+	r.tracker = tracker.New(impl.EnqueueKey, 30*time.Minute)


30 seconds?

tejal29 · 2018-10-11T17:04:10Z

pkg/reconciler/v1alpha1/pipelinerun/pipelinerun.go

 	if err != nil {
-		return fmt.Errorf("error getting next TaskRun to create for PipelineRun %s: %s", pr.Name, err)
+		return fmt.Errorf("error getting Tasks for Pipeline %s, Pipeline may be invalid!: %s", p.Name, err)


s/error getting Tasks for Pipeline/error getting Tasks or TaskRuns for Pipeline/

tejal29 · 2018-10-11T17:07:59Z

pkg/reconciler/v1alpha1/pipelinerun/resources/pipelinestate.go

+type GetTask func(namespace, name string) (*v1alpha1.Task, error)
+
+// GetTaskRun is a function that will retrieve the TaskRun name from namespace.
+type GetTaskRun func(namespace, name string) (*v1alpha1.TaskRun, error)


Not your fault, but this is not rendered properly :) i wonder if its a formatting issue

yeah im not sure why the syntax highlighting is having a problem here 🤔

last time I saw this happen it was b/c @jonjohnsonjr decided to replace a e with an ℯ or something like that

I think the syntax highlighter that GitHub uses has a hard time with func if it isn't followed by { ... }.

ah that could be it, thanks @jonjohnsonjr :D

and seriously that 𝛾 was pretty sweet, gonna be talking about that one for years to come

knative/serving#1460 (comment)

🤣

oh it just keeps getting better, i'm crying

well that was distracting

tejal29 · 2018-10-11T17:15:48Z

pkg/reconciler/v1alpha1/pipelinerun/resources/pipelinestate.go

+		pt := p.Spec.Tasks[i]
+		t, err := getTask(p.Namespace, pt.TaskRef.Name)
+		if err != nil {
+			return nil, fmt.Errorf("failed to get tasks for Pipeline %q: Error getting task %q : %s",


Now, this is case where we need to check if Task does not exists.
If it does not exist, then we should quit re-conciling this pipeline since its invalid and nothing can be done.
For that to happen, we should return "nil" from the pipelinerun.Reconciler.reconcile

however, there was some other error while fetching task then we keep trying.
Does that make sense?

I would say, we can create a InvalidPipelineError and return that from here when a Task is not found.
In the reconcile method, we should if err is something other than InvalidPipelineError and return nil if its InvalidPipelineError

yep that makes sense! okay ill update this so that if we can't find a task, we return nil from the reconcile loop and stop reconciling.

thanks for catching this and the detailed explanation!

bobcatfish · 2018-10-11T18:55:14Z

test/pipelinerun_test.go

+	logger.Infof("Making sure the expected TaskRuns were created")
+	expectedTaskRuns := []string{
+		hwPipelineName + hwPipelineTaskName1,
+		hwPipelineName + hwPipelineTaskName2,


whoops somehow forgot to include a fix for this

bobcatfish · 2018-10-11T23:53:47Z

Ready for another look @tejal29 !

nader-ziada · 2018-10-12T13:37:58Z

pkg/reconciler/v1alpha1/pipelinerun/pipelinerun.go

@@ -91,6 +95,11 @@ func NewController(
 		UpdateFunc: controller.PassNew(impl.Enqueue),
 		DeleteFunc: impl.Enqueue,
 	})
+
+	r.tracker = tracker.New(impl.EnqueueKey, 30*time.Second)


I added a better way to get the tracker lease as a function of the resync period in #120 which I can update here once this is merged, no point in both of us making the same change

nice!! thanks @pivotal-nader-ziada :D if you merge first ill pickup your change, ill leave this at 30 min for now then

bobcatfish · 2018-10-12T15:06:54Z

oh dear

I1012 01:29:35.122] default                  8s          8s           1         pvc-284c1005-cdbe-11e8-a9ba-42010a800231.155cb8422d54da43                         PersistentVolume                                                Normal    VolumeDelete              persistentvolume-controller                                           googleapi: Error 400: The disk resource 'projects/knative-boskos-10/zones/us-central1-a/disks/gke-kbuild-pipeline-e2-pvc-284c1005-cdbe-11e8-a9ba-42010a800231' is already being used by 'projects/knative-boskos-10/zones/us-central1-a/instances/gke-kbuild-pipeline-e2e--default-pool-a626184b-8p1m', resourceInUseByAnotherResource

bobcatfish · 2018-10-12T15:33:13Z

oh i guess that volume error is about deletion, maybe a red herring 🤔

bobcatfish · 2018-10-12T17:46:03Z

I1012 01:29:21.113] --- FAIL: TestPipelineRun (240.28s)
I1012 01:29:21.113] 	pipelinerun_test.go:63: Error waiting for PipelineRun helloworld-run to finish: timed out waiting for the condition
I1012 01:29:21.113] 	pipelinerun_test.go:77: Expected TaskRun helloworld-pipelinerun-helloworld-task-1 to have succeeded but Status is Unknown
I1012 01:29:21.113] 	pipelinerun_test.go:73: Couldn't get expected TaskRun helloworld-pipelinerun-helloworld-task-2: taskruns.pipeline.knative.dev "helloworld-pipelinerun-helloworld-task-2" not found
I1012 01:29:21.114] 	crd.go:237: Error waiting for Pod helloworld-validation-busybox to finish: timed out waiting for the condition

yyyyyyyyyyyyyyyyyyyyyyyy

tejal29 · 2018-10-12T19:02:52Z

/lgtm

bobcatfish · 2018-10-12T22:22:02Z

oh man i just keep making these tests better and better XD

I1012 16:40:42.177] --- FAIL: TestTaskRun (0.06s)
I1012 16:40:42.178] panic: runtime error: invalid memory address or nil pointer dereference [recovered]
I1012 16:40:42.178] 	panic: runtime error: invalid memory address or nil pointer dereference
I1012 16:40:42.178] [signal SIGSEGV: segmentation violation code=0x1 addr=0x18 pc=0x152ab9b]
I1012 16:40:42.178] 
I1012 16:40:42.178] goroutine 166 [running]:
I1012 16:40:42.178] testing.tRunner.func1(0xc4200ac2d0)
I1012 16:40:42.178] 	/usr/local/go/src/testing/testing.go:742 +0x567
I1012 16:40:42.178] panic(0x163b020, 0x20b1ce0)
I1012 16:40:42.178] 	/usr/local/go/src/runtime/panic.go:502 +0x24a
I1012 16:40:42.178] github.com/knative/build-pipeline/test.TestTaskRun.func2(0xc4202886c0, 0x17789c3, 0xe, 0x0)
I1012 16:40:42.179] 	/go/src/github.com/knative/build-pipeline/test/taskrun_test.go:56 +0x7b
I1012 16:40:42.179] github.com/knative/build-pipeline/test.WaitForTaskRunState.func1(0x94e27f, 0x1, 0xc4205a6800)
I1012 16:40:42.179] 	/go/src/github.com/knative/build-pipeline/test/crd_checks.go:54 +0x152
I1012 16:40:42.179] github.com/knative/build-pipeline/vendor/k8s.io/apimachinery/pkg/util/wait.pollImmediateInternal(0xc4205a6800, 0xc4202262d0, 0xc4205a6800, 0x16af140)
I1012 16:40:42.179] 	/go/src/github.com/knative/build-pipeline/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:245 +0x39
I1012 16:40:42.179] github.com/knative/build-pipeline/vendor/k8s.io/apimachinery/pkg/util/wait.PollImmediate(0x3b9aca00, 0x1bf08eb000, 0xc4202262d0, 0x31, 0x0)
I1012 16:40:42.179] 	/go/src/github.com/knative/build-pipeline/vendor/k8s.io/apimachinery/pkg/util/wait/wait.go:241 +0x5f
I1012 16:40:42.180] github.com/knative/build-pipeline/test.WaitForTaskRunState(0xc42011f260, 0x17789c3, 0xe, 0x17ed5a0, 0x177881f, 0xe, 0x0, 0x0)
I1012 16:40:42.180] 	/go/src/github.com/knative/build-pipeline/test/crd_checks.go:49 +0x320
I1012 16:40:42.180] github.com/knative/build-pipeline/test.TestTaskRun(0xc4200ac2d0)
I1012 16:40:42.180] 	/go/src/github.com/knative/build-pipeline/test/taskrun_test.go:54 +0xf37
I1012 16:40:42.180] testing.tRunner(0xc4200ac2d0, 0x17ed5a8)
I1012 16:40:42.180] 	/usr/local/go/src/testing/testing.go:777 +0x16e
I1012 16:40:42.180] created by testing.(*T).Run
I1012 16:40:42.180] 	/usr/local/go/src/testing/testing.go:824 +0x565
I1012 16:40:42.181] FAIL	github.com/knative/build-pipeline/test	118.710s

It turns out that we need to look at TaskRuns for a few reasons, including 1) figuring out what to run next and 2) determining the status of the PipelineRun, so I've refactored the logic that grabs these to collect a bunch of related state that can be reused. When the graph becomes more sophisticated, we will need to make this structure more than just a list.

Added logic to check statuses of other TaskRuns when deciding if a new one should be started for #61

PipelineRun status will be based on the condition of the TaskRuns which it has created, for #61. If any TaskRuns have failed, the PipelineRun has failed. If all are successful, it is successful. If any are in progress, it is in progress. This is assuming a linear Pipeline, we will have to tweak this a bit when we implement the graph (for #65)

Added the Task reference to the TaskRun so that when a PipelineRun creates a TaskRun, it actually executes! (For #61) While running the integration test, noticed that the PipelineRuns weren't getting reconciled quickly enough, but adding a tracker which will invoke reconcile when the created TaskRuns are updated fixed this - however it did still take > 1 minute to create 3 helloworld TaskRuns and wait for them to complete, so since 3 was arbitrary, reduced to 2. Also cleaned up the TaskRun controller a bit: using the Logger object on the controller/reconciler itself, made the log messages a bit more descriptive.

If a PipelineRun references a Pipeline that uses Tasks which don't exist, we should immediately stop trying to Reconcile it. To fix this, the user/trigger should create a new PipelineRun after creating the Tasks needed.

tejal29 · 2018-10-12T22:56:31Z

/lgtm

bobcatfish · 2018-10-12T23:08:48Z

😩

tejal29 · 2018-10-12T23:11:44Z

/lgtm

bobcatfish · 2018-10-12T23:16:32Z

🤞🤞🤞

🤞🤞🤞

Something has gone wrong with one of the integration tests on my PR and I don't know what so I'm trying to add more info. Added Builds to the dumped CRDs, and also moved the step that deploys the examples is now after the integration tests b/c it produces a lot of errors in the logs (hahaha...) and makes it harder to debug integration tests failures.

I think it's reasonable for only one of our eventually many integration tests to verify the build output, especially when it involves adding a volume mount to the pile of things that could go wrong in the test. Refactored the test a bit, so we don't assert inside the test, and we output some logs before polling. Removed dumping of CRDs in test script b/c each test runs in its own namespace and cleans up after itself, so there is never anything to dump (see tektoncd#145). Updated condition checking so that if the Run fails, we bail immediately instead of continuing to hope it will succeed.

bobcatfish · 2018-10-12T23:35:19Z

😩

bobcatfish · 2018-10-13T00:05:32Z

/lgtm

WHAT ITS WORTH A TRY

knative-prow-robot · 2018-10-13T00:05:33Z

@bobcatfish: you cannot LGTM your own PR.

In response to this:

/lgtm

WHAT ITS WORTH A TRY

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

bobcatfish · 2018-10-13T00:06:23Z

nader-ziada · 2018-10-13T00:07:23Z

/lgtm

knative-prow-robot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label Oct 11, 2018

knative-prow-robot requested review from imjasonh and nader-ziada October 11, 2018 16:55

knative-prow-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 11, 2018

tejal29 reviewed Oct 11, 2018

View reviewed changes

tejal29 added this to the Mid October Demo milestone Oct 11, 2018

bobcatfish commented Oct 11, 2018

View reviewed changes

bobcatfish force-pushed the pipelinerun_status_status branch 2 times, most recently from 5a02ce8 to 3283605 Compare October 11, 2018 20:31

bobcatfish mentioned this pull request Oct 11, 2018

Implement working kaniko build Task #62 #120

Merged

knative-prow-robot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Oct 11, 2018

bobcatfish force-pushed the pipelinerun_status_status branch from ab1dc80 to 33036c9 Compare October 11, 2018 23:53

nader-ziada reviewed Oct 12, 2018

View reviewed changes

bobcatfish removed this from the Mid October Demo milestone Oct 12, 2018

bobcatfish force-pushed the pipelinerun_status_status branch from 43bbb0a to bb6e0f8 Compare October 12, 2018 16:32

knative-prow-robot assigned tejal29 Oct 12, 2018

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Oct 12, 2018

bobcatfish added 5 commits October 12, 2018 15:34

Check status of TaskRuns when finding TaskRun to start

e7ad309

Added logic to check statuses of other TaskRuns when deciding if a new one should be started for #61

Stop reconciling invalid PipelineRun

16c56d1

If a PipelineRun references a Pipeline that uses Tasks which don't exist, we should immediately stop trying to Reconcile it. To fix this, the user/trigger should create a new PipelineRun after creating the Tasks needed.

bobcatfish force-pushed the pipelinerun_status_status branch from bb6e0f8 to 87d6926 Compare October 12, 2018 22:51

knative-prow-robot removed the lgtm Indicates that a PR is ready to be merged. label Oct 12, 2018

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Oct 12, 2018

bobcatfish force-pushed the pipelinerun_status_status branch from 87d6926 to 3a4cad3 Compare October 12, 2018 23:10

knative-prow-robot removed the lgtm Indicates that a PR is ready to be merged. label Oct 12, 2018

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Oct 12, 2018

bobcatfish added 2 commits October 12, 2018 16:34

bobcatfish force-pushed the pipelinerun_status_status branch from 3a4cad3 to 360d282 Compare October 12, 2018 23:35

knative-prow-robot removed the lgtm Indicates that a PR is ready to be merged. label Oct 12, 2018

knative-prow-robot assigned nader-ziada Oct 13, 2018

knative-prow-robot added the lgtm Indicates that a PR is ready to be merged. label Oct 13, 2018

knative-prow-robot merged commit 162c2f5 into tektoncd:master Oct 13, 2018

mchmarny unassigned tejal29 and nader-ziada Mar 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get simple PipelineRun implementation working #128

Get simple PipelineRun implementation working #128

bobcatfish commented Oct 11, 2018

knative-prow-robot commented Oct 11, 2018

tejal29 left a comment

tejal29 Oct 11, 2018

tejal29 Oct 11, 2018

bobcatfish Oct 11, 2018

tejal29 Oct 11, 2018

bobcatfish Oct 11, 2018

jonjohnsonjr Oct 11, 2018

bobcatfish Oct 11, 2018

bobcatfish Oct 11, 2018

bobcatfish Oct 11, 2018

bobcatfish Oct 11, 2018 •

edited

Loading

tejal29 Oct 11, 2018

bobcatfish Oct 11, 2018

bobcatfish Oct 11, 2018

bobcatfish commented Oct 11, 2018

nader-ziada Oct 12, 2018

bobcatfish Oct 12, 2018

bobcatfish commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

tejal29 commented Oct 12, 2018

bobcatfish commented Oct 12, 2018 •

edited

Loading

tejal29 commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

tejal29 commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

bobcatfish commented Oct 13, 2018

knative-prow-robot commented Oct 13, 2018

bobcatfish commented Oct 13, 2018

nader-ziada commented Oct 13, 2018

Get simple PipelineRun implementation working #128

Get simple PipelineRun implementation working #128

Conversation

bobcatfish commented Oct 11, 2018

knative-prow-robot commented Oct 11, 2018

tejal29 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bobcatfish Oct 11, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bobcatfish commented Oct 11, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bobcatfish commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

tejal29 commented Oct 12, 2018

bobcatfish commented Oct 12, 2018 • edited Loading

tejal29 commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

tejal29 commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

bobcatfish commented Oct 12, 2018

bobcatfish commented Oct 13, 2018

knative-prow-robot commented Oct 13, 2018

bobcatfish commented Oct 13, 2018

nader-ziada commented Oct 13, 2018

bobcatfish Oct 11, 2018 •

edited

Loading

bobcatfish commented Oct 12, 2018 •

edited

Loading