Affinity assistant deadlock during node maintenance #6586

lbernick · 2023-04-26T18:35:10Z

Expected Behavior

If a node is cordoned (marked as unschedulable) for maintenance, any PipelineRuns with TaskRuns running on that node should run to completion.

(Out of scope = nodes going down or pods being evicted)

Actual Behavior

This situation can result in deadlock when the affinity assistant is enabled. Subsequent TaskRun pods have affinity for the placeholder pod, which is on an unschedulable node. These pods cannot be scheduled and do not trigger scale-up, so they just pend until the TaskRuns time out. (Reported by @skaegi and @pritidesai.)

Related: #4699

With the affinity assistant disabled, you can cordon a node, wait for existing TaskRuns to finish before you delete any pods, and then the cluster autoscaler will trigger a scale up, creating a new node matching the node affinity terms of the original PV. Subsequent TaskRun pods get scheduled on the new node and the PipelineRun completes successfully, i.e. this is not a problem.

Steps to Reproduce the Problem

enable the affinity assistant
Create the following PipelineRun (sequential tasks sharing the same PVC workspace):

apiVersion: tekton.dev/v1
kind: PipelineRun
metadata:
  generateName: good-morning-run-
spec:
  workspaces:
  - name: source
    volumeClaimTemplate:
      spec:
        accessModes:
          - ReadWriteOnce
        resources:
          requests:
            storage: 10Mi
  pipelineSpec:
    workspaces:
    - name: source
    tasks:
    - name: first
      taskSpec:
        workspaces:
        - name: source
        steps:
        - image: busybox
          script: |
            echo $(workspaces.source.path)
            sleep 60
      workspaces:
      - name: source
    - name: last
      taskSpec:
        workspaces:
        - name: source
        steps:
        - image: busybox
          script: |
            echo $(workspaces.source.path)
            sleep 60
      runAfter: ["first"]
      workspaces:
      - name: source

Determine what node the affinity assistant pod is running on:

$ kubectl get pods -l app.kubernetes.io/component=affinity-assistant -o=custom-columns=NAME:.metadata.name,NODE:.spec.nodeName
NAME                              NODE
affinity-assistant-6d8794b076-0   gke-test-cluster-default-pool-2b351b27-vrl5

Cordon the node:

$ kubectl cordon gke-test-cluster-default-pool-2b351b27-vrl5
node/gke-test-cluster-default-pool-2b351b27-vrl5 cordoned

When the second TaskRun is created, its pod is stuck in pending status:

$ kubectl get po
NAME                               READY   STATUS      RESTARTS   AGE
affinity-assistant-6d8794b076-0    1/1     Running     0          117s
good-morning-run-kcsr4-first-pod   0/1     Completed   0          117s
good-morning-run-kcsr4-last-pod    0/1     Pending     0          26s

$ kubectl get events -n default --field-selector involvedObject.name=good-morning-run-kcsr4-last-pod
LAST SEEN   TYPE      REASON              OBJECT                                MESSAGE
60s         Warning   FailedScheduling    pod/good-morning-run-kcsr4-last-pod   0/4 nodes are available: 1 node(s) didn't match pod affinity rules, 1 node(s) were unschedulable, 2 node(s) had volume node affinity conflict. preemption: 0/4 nodes are available: 4 Preemption is not helpful for scheduling.
60s         Normal    NotTriggerScaleUp   pod/good-morning-run-kcsr4-last-pod   pod didn't trigger scale-up: 2 node(s) had volume node affinity conflict, 1 node(s) didn't match pod affinity rules

Additional Info

Kubernetes version:

Client Version: v1.25.4
Kustomize Version: v4.5.7
Server Version: v1.24.10-gke.2300

Tekton Pipeline version:

main

The text was updated successfully, but these errors were encountered:

pritidesai · 2023-04-27T05:01:44Z

Thanks a bunch @lbernick for creating this issue, appreciate it 🙏

lbernick added the kind/bug Categorizes issue or PR as related to a bug. label Apr 26, 2023

lbernick mentioned this issue Apr 26, 2023

RFC: Handle affinity assistant deadlock for node maintenance #6584

Closed

7 tasks

pritidesai mentioned this issue Apr 28, 2023

update affinity assistant creation implementation #6596

Merged

7 tasks

tekton-robot closed this as completed in #6596 May 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Affinity assistant deadlock during node maintenance #6586

Affinity assistant deadlock during node maintenance #6586

lbernick commented Apr 26, 2023 •

edited

Loading

pritidesai commented Apr 27, 2023

Affinity assistant deadlock during node maintenance #6586

Affinity assistant deadlock during node maintenance #6586

Comments

lbernick commented Apr 26, 2023 • edited Loading

Expected Behavior

Actual Behavior

Steps to Reproduce the Problem

Additional Info

pritidesai commented Apr 27, 2023

lbernick commented Apr 26, 2023 •

edited

Loading