-
Notifications
You must be signed in to change notification settings - Fork 14.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add kubeadm upgrade docs for 1.11 #9089
Changes from 1 commit
e3663c5
a986d34
b3c10bb
d370c63
78de6e6
f765ee1
f2a309a
451ce77
68bcbc2
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,303 @@ | ||
--- | ||
reviewers: | ||
- pipejakob | ||
- luxas | ||
- roberthbailey | ||
- jbeda | ||
title: Upgrading kubeadm clusters from v1.10 to v1.11 | ||
content_template: templates/task | ||
--- | ||
|
||
{{% capture overview %}} | ||
|
||
This guide is for upgrading `kubeadm` clusters from version 1.10.x to 1.11.x, as well as 1.10.x to 1.10.y and 1.11.x to 1.11.y where `y > x`. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We removed the possibility for kubeadm v1.11.0 to upgrade from v1.10.3 to v1.10.4 right b/c of the config stuff right? |
||
|
||
{{% /capture %}} | ||
|
||
{{% capture prerequisites %}} | ||
|
||
Before proceeding: | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. we should talk about what kind of setup we are covering here. It's not enough to say a For example we will not be supporting upgrades for clusters using an external etcd There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We should definitely fix upgrading external etcd in a patch release of v1.11 IMO |
||
|
||
- You need to have a functional `kubeadm` Kubernetes cluster running version 1.10.0 or higher in order to use the process described here. Swap also needs to be disabled. | ||
- Make sure you read the [release notes](https://github.com/kubernetes/kubernetes/blob/master/CHANGELOG-1.9.md) carefully. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Shouldn't this be 1.11.md |
||
- `kubeadm upgrade` now allows you to upgrade etcd. `kubeadm upgrade` will also upgrade of etcd to 3.1.10 as part of upgrading from v1.8 to v1.9 by default. This is due to the fact that etcd 3.1.10 is the officially validated etcd version for Kubernetes v1.9. The upgrade is handled automatically by kubeadm for you. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. nix this whole statement, not needed and not correct. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Shouldn't we at least say kubeadm will upgrade your etcd if it's hosted locally and There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I don't think we really need to mention this since we will show this in the plan output. |
||
- Note that `kubeadm upgrade` will not touch any of your workloads, only Kubernetes-internal components. As a best-practice you should back up what's important to you. For example, any app-level state, such as a database an app might depend on (like MySQL or MongoDB) must be backed up beforehand. | ||
|
||
{{< caution >}} | ||
**Caution:** All the containers will get restarted after the upgrade, due to container spec hash value being changed. | ||
{{< /caution >}} | ||
|
||
Also, note that only one minor version upgrade is supported. For example, you can only upgrade from 1.10 to 1.11, not from 1.9 to 1.11. | ||
|
||
{{% /capture %}} | ||
|
||
{{% capture steps %}} | ||
|
||
## Upgrading your control plane | ||
|
||
Execute these commands on your master node: | ||
|
||
1. Install the most recent version of `kubeadm` using `curl` like so: | ||
|
||
```shell | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. this whole section under the numbered list will have to be indented 4 or 5 spaces (i have seen both /shrug) to preserve the correct numbering on the numbered list. The preview shows all the numbered items as "1" https://deploy-preview-9089--kubernetes-io-master-staging.netlify.com/docs/tasks/administer-cluster/upgrade-downgrade/kubeadm-upgrade-1-11/ |
||
sudo apt-get upgdate && sudo apt-get upgrade kubeadm | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. update
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. You need to use curl here. Otherwise the kubeadm deb will pull in the new dropin and restart the old kubelet 😱. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @luxas I tested this myself, it works just fine on Ubuntu xenial? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. s/upgdate/update/ There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @lizto I haven't tested specifically for the v1.11 upgrades, but with previous versions installing the kubeadm deb caused a service restart of the kubelet. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. https://asciinema.org/a/PEvquZxQzHpjJWh1dRUfcBCIE proof is in the pudding There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. #fancy_ascii_vods There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. After speaking with @liztio about this yesterday we discovered that part of the diff between the kubeadm packages built using bazel from k/k and the release tooling is the postinstall kubelet restart 😢 There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @detiber yes, hence my comment. We should fix the bazel packages. Can any of you open an issue about that? |
||
``` | ||
|
||
`kubeadm` is only needed on individual (non-master) nodes for joining the cluster. | ||
It is not necessary to update kubeadm on nodes | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. These statements read a bit awkward and can probably just be removed, since this section is about updating the control plane and does not talk about taking actions on nodes. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I found this confusing while I was working on this, so I think it's worth including. Open to wording suggestions There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Actually, going back to think about this, since we rely on the updated systemd drop-in file, we may actually require an updated kubeadm package on the worker nodes 😭 There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Also this is false with v1.11, that requires the user to do |
||
|
||
Verify that this download of kubeadm works and has the expected version: | ||
|
||
```shell | ||
kubeadm version | ||
``` | ||
|
||
2. On the master node, run the following: | ||
|
||
```shell | ||
kubeadm upgrade plan | ||
``` | ||
|
||
You should see output similar to this: | ||
|
||
<!-- TODO: copy-paste actual output once new version is stable --> | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Now it should kinda be from master There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What do you mean by this? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. You can now paste the "right" output from master/the release candidate and it's gonna be correct, no need to wait anymore There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Everything is still suffixed with |
||
|
||
```shell | ||
[preflight] Running pre-flight checks | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This whole section needs to match v1.11 text. |
||
[upgrade] Making sure the cluster is healthy: | ||
[upgrade/health] Checking API Server health: Healthy | ||
[upgrade/health] Checking Node health: All Nodes are healthy | ||
[upgrade/health] Checking Static Pod manifests exists on disk: All manifests exist on disk | ||
[upgrade/config] Making sure the configuration is correct: | ||
[upgrade/config] Reading configuration from the cluster... | ||
[upgrade/config] FYI: You can look at this config file with 'kubectl -n kube-system get cm kubeadm-config -o yaml' | ||
[upgrade] Fetching available versions to upgrade to: | ||
[upgrade/versions] Cluster version: v1.10.1 | ||
[upgrade/versions] kubeadm version: v1.10.0 | ||
[upgrade/versions] Latest stable version: v1.9.0 | ||
[upgrade/versions] Latest version in the v1.8 series: v1.8.6 | ||
|
||
Components that must be upgraded manually after you've upgraded the control plane with 'kubeadm upgrade apply': | ||
COMPONENT CURRENT AVAILABLE | ||
Kubelet 1 x v1.8.1 v1.8.6 | ||
|
||
Upgrade to the latest version in the v1.8 series: | ||
|
||
COMPONENT CURRENT AVAILABLE | ||
API Server v1.8.1 v1.8.6 | ||
Controller Manager v1.8.1 v1.8.6 | ||
Scheduler v1.8.1 v1.8.6 | ||
Kube Proxy v1.8.1 v1.8.6 | ||
Kube DNS 1.14.4 1.14.5 | ||
|
||
You can now apply the upgrade by executing the following command: | ||
|
||
kubeadm upgrade apply v1.8.6 | ||
|
||
_____________________________________________________________________ | ||
|
||
Components that must be upgraded manually after you've upgraded the control plane with 'kubeadm upgrade apply': | ||
COMPONENT CURRENT AVAILABLE | ||
Kubelet 1 x v1.8.1 v1.9.0 | ||
|
||
Upgrade to the latest stable version: | ||
|
||
COMPONENT CURRENT AVAILABLE | ||
API Server v1.8.1 v1.9.0 | ||
Controller Manager v1.8.1 v1.9.0 | ||
Scheduler v1.8.1 v1.9.0 | ||
Kube Proxy v1.8.1 v1.9.0 | ||
Kube DNS 1.14.5 1.14.7 | ||
|
||
You can now apply the upgrade by executing the following command: | ||
|
||
kubeadm upgrade apply v1.9.0 | ||
|
||
Note: Before you do can perform this upgrade, you have to update kubeadm to v1.9.0 | ||
|
||
_____________________________________________________________________ | ||
``` | ||
|
||
The `kubeadm upgrade plan` checks that your cluster is upgradeable and fetches the versions available to upgrade to in an user-friendly way. | ||
|
||
3. Pick a version to upgrade to and run. For example: | ||
|
||
```shell | ||
kubeadm upgrade apply v1.11.0 | ||
``` | ||
|
||
You should see output similar to this: | ||
|
||
<!-- TODO: output from stable --> | ||
|
||
```shell | ||
[preflight] Running pre-flight checks. | ||
[upgrade] Making sure the cluster is healthy: | ||
[upgrade/config] Making sure the configuration is correct: | ||
[upgrade/config] Reading configuration from the cluster... | ||
[upgrade/config] FYI: You can look at this config file with 'kubectl -n kube-system get cm kubeadm-config -oyaml' | ||
I0614 20:56:08.320369 30918 feature_gate.go:230] feature gates: &{map[]} | ||
[upgrade/apply] Respecting the --cri-socket flag that is set with higher priority than the config file. | ||
[upgrade/version] You have chosen to change the cluster version to "v1.11.0-beta.2.78+e0b33dbc2bde88" | ||
[upgrade/versions] Cluster version: v1.10.4 | ||
[upgrade/versions] kubeadm version: v1.11.0-beta.2.78+e0b33dbc2bde88 | ||
[upgrade/confirm] Are you sure you want to proceed with the upgrade? [y/N]: y | ||
[upgrade/prepull] Will prepull images for components [kube-apiserver kube-controller-manager kube-scheduler etcd] | ||
[upgrade/apply] Upgrading your Static Pod-hosted control plane to version "v1.11.0-beta.2.78+e0b33dbc2bde88"... | ||
Static pod: kube-apiserver-ip-172-31-85-18 hash: 7a329408b21bc0c44d7b3b78ff8187bf | ||
Static pod: kube-controller-manager-ip-172-31-85-18 hash: 24fd3157627c7567b687968967c6a5e8 | ||
Static pod: kube-scheduler-ip-172-31-85-18 hash: 5179266fb24d4c1834814c4f69486371 | ||
Static pod: etcd-ip-172-31-85-18 hash: 9dfc197f444be11fcc70ab1467b030b8 | ||
[etcd] Wrote Static Pod manifest for a local etcd instance to "/etc/kubernetes/tmp/kubeadm-upgraded-manifests089436939/etcd.yaml" | ||
[certificates] Using the existing etcd/ca certificate and key. | ||
[certificates] Using the existing etcd/server certificate and key. | ||
[certificates] Using the existing etcd/peer certificate and key. | ||
[certificates] Using the existing etcd/healthcheck-client certificate and key. | ||
[upgrade/staticpods] Moved new manifest to "/etc/kubernetes/manifests/etcd.yaml" and backed up old manifest to "/etc/kubernetes/tmp/kubeadm-backup-manifests-2018-06-14-20-56-11/etcd.yaml" | ||
[upgrade/staticpods] Waiting for the kubelet to restart the component | ||
Static pod: etcd-ip-172-31-85-18 hash: 9dfc197f444be11fcc70ab1467b030b8 | ||
< snip > | ||
[apiclient] Found 1 Pods for label selector component=etcd | ||
[upgrade/staticpods] Component "etcd" upgraded successfully! | ||
[upgrade/etcd] Waiting for etcd to become available | ||
[util/etcd] Waiting 0s for initial delay | ||
[util/etcd] Attempting to see if all cluster endpoints are available 1/10 | ||
[upgrade/staticpods] Writing new Static Pod manifests to "/etc/kubernetes/tmp/kubeadm-upgraded-manifests089436939" | ||
[controlplane] wrote Static Pod manifest for component kube-apiserver to "/etc/kubernetes/tmp/kubeadm-upgraded-manifests089436939/kube-apiserver.yaml" | ||
[controlplane] wrote Static Pod manifest for component kube-controller-manager to "/etc/kubernetes/tmp/kubeadm-upgraded-manifests089436939/kube-controller-manager.yaml" | ||
[controlplane] wrote Static Pod manifest for component kube-scheduler to "/etc/kubernetes/tmp/kubeadm-upgraded-manifests089436939/kube-scheduler.yaml" | ||
[certificates] Using the existing etcd/ca certificate and key. | ||
[certificates] Using the existing apiserver-etcd-client certificate and key. | ||
[upgrade/staticpods] Moved new manifest to "/etc/kubernetes/manifests/kube-apiserver.yaml" and backed up old manifest to "/etc/kubernetes/tmp/kubeadm-backup-manifests-2018-06-14-20-56-11/kube-apiserver.yaml" | ||
[upgrade/staticpods] Waiting for the kubelet to restart the component | ||
Static pod: kube-apiserver-ip-172-31-85-18 hash: 7a329408b21bc0c44d7b3b78ff8187bf | ||
< snip > | ||
[apiclient] Found 1 Pods for label selector component=kube-apiserver | ||
[upgrade/staticpods] Component "kube-apiserver" upgraded successfully! | ||
[upgrade/staticpods] Moved new manifest to "/etc/kubernetes/manifests/kube-controller-manager.yaml" and backed up old manifest to "/etc/kubernetes/tmp/kubeadm-backup-manifests-2018-06-14-20-56-11/kube-controller-manager.yaml" | ||
[upgrade/staticpods] Waiting for the kubelet to restart the component | ||
Static pod: kube-controller-manager-ip-172-31-85-18 hash: 24fd3157627c7567b687968967c6a5e8 | ||
Static pod: kube-controller-manager-ip-172-31-85-18 hash: 63992ff14733dcb9dcfa6ac0a3b8031a | ||
[apiclient] Found 1 Pods for label selector component=kube-controller-manager | ||
[upgrade/staticpods] Component "kube-controller-manager" upgraded successfully! | ||
[upgrade/staticpods] Moved new manifest to "/etc/kubernetes/manifests/kube-scheduler.yaml" and backed up old manifest to "/etc/kubernetes/tmp/kubeadm-backup-manifests-2018-06-14-20-56-11/kube-scheduler.yaml" | ||
[upgrade/staticpods] Waiting for the kubelet to restart the component | ||
Static pod: kube-scheduler-ip-172-31-85-18 hash: 5179266fb24d4c1834814c4f69486371 | ||
Static pod: kube-scheduler-ip-172-31-85-18 hash: 831e4b9425f758e572392976311e56d9 | ||
[apiclient] Found 1 Pods for label selector component=kube-scheduler | ||
[upgrade/staticpods] Component "kube-scheduler" upgraded successfully! | ||
[uploadconfig] storing the configuration used in ConfigMap "kubeadm-config" in the "kube-system" Namespace | ||
[kubelet] Creating a ConfigMap "kubelet-config-1.11" in namespace kube-system with the configuration for the kubelets in the cluster | ||
[kubelet] Downloading configuration for the kubelet from the "kubelet-config-1.11" ConfigMap in the kube-system namespace | ||
[kubelet] Writing kubelet configuration to file "/var/lib/kubelet/config.yaml" | ||
[patchnode] Uploading the CRI Socket information "/var/run/dockershim.sock" to the Node API object "ip-172-31-85-18" as an annotation | ||
[bootstraptoken] configured RBAC rules to allow Node Bootstrap tokens to post CSRs in order for nodes to get long term certificate credentials | ||
[bootstraptoken] configured RBAC rules to allow the csrapprover controller automatically approve CSRs from a Node Bootstrap Token | ||
[bootstraptoken] configured RBAC rules to allow certificate rotation for all node client certificates in the cluster | ||
[addons] Applied essential addon: CoreDNS | ||
[addons] Applied essential addon: kube-proxy | ||
|
||
[upgrade/successful] SUCCESS! Your cluster was upgraded to "v1.11.0-beta.2.78+e0b33dbc2bde88". Enjoy! | ||
|
||
[upgrade/kubelet] Now that your control plane is upgraded, please proceed with upgrading your kubelets if you haven't already done so. | ||
``` | ||
|
||
To upgrade the cluster with CoreDNS as the default internal DNS, invoke `kubeadm upgrade apply` with the `--feature-gates=CoreDNS=true` flag. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Say that now CoreDNS is default, and tell the user if they don't wanna switch do X, Y and Z There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It should probably be called out at the top, and this section should probably be updated to show how to keep kube-dns rather than coredns. |
||
`kubeadm upgrade apply` does the following: | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Move this to a reference, FYI section below? |
||
|
||
- Checks that your cluster is in an upgradeable state: | ||
- The API server is reachable, | ||
- All nodes are in the `Ready` state | ||
- The control plane is healthy | ||
- Enforces the version skew policies. | ||
- Makes sure the control plane images are available or available to pull to the machine. | ||
- Upgrades the control plane components or rollbacks if any of them fails to come up. | ||
- Applies the new `kube-dns` and `kube-proxy` manifests and enforces that all necessary RBAC rules are created. | ||
- Creates new certificate and key files of apiserver and backs up old files if they're about to expire in 180 days. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. should we |
||
|
||
4. Manually upgrade your Software Defined Network (SDN). | ||
|
||
Your Container Network Interface (CNI) provider may have its own upgrade instructions to follow. | ||
Check the [addons](/docs/concepts/cluster-administration/addons/) page to | ||
find your CNI provider and see if there are additional upgrade steps | ||
necessary. | ||
|
||
## Upgrading your master and node packages | ||
|
||
For each host (referred to as `$HOST` below) in your cluster, upgrade `kubelet` by executing the following commands: | ||
|
||
1. Prepare the host for maintenance, marking it unschedulable and evicting the workload: | ||
|
||
```shell | ||
kubectl drain $HOST --ignore-daemonsets | ||
``` | ||
|
||
When running this command against the master host, `--ignore-daemonsets` is required: | ||
|
||
```shell | ||
kubectl drain ip-172-31-85-18 | ||
node "ip-172-31-85-18" cordoned | ||
error: unable to drain node "ip-172-31-85-18", aborting command... | ||
|
||
There are pending nodes to be drained: | ||
ip-172-31-85-18 | ||
error: DaemonSet-managed pods (use --ignore-daemonsets to ignore): calico-node-5798d, kube-proxy-thjp9 | ||
``` | ||
|
||
``` | ||
kubectl drain ip-172-31-85-18 --ignore-daemonsets | ||
node "ip-172-31-85-18" already cordoned | ||
WARNING: Ignoring DaemonSet-managed pods: calico-node-5798d, kube-proxy-thjp9 | ||
node "ip-172-31-85-18" drained | ||
``` | ||
|
||
2. Upgrade the Kubernetes package versions on the `$HOST` node by using a Linux distribution-specific package manager: | ||
|
||
If the host is running a Debian-based distro such as Ubuntu, run: | ||
|
||
```shell | ||
apt-get update | ||
apt-get upgrade | ||
``` | ||
|
||
If the host is running CentOS or the like, run: | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Can you do tabs like the installing kubeadm page has for ubuntu/centos? |
||
|
||
```shell | ||
yum update | ||
``` | ||
|
||
3. Restart the kubectl process with | ||
```shell | ||
sudo systemctl restart kubelet | ||
``` | ||
|
||
Now the new version of the `kubelet` should be running on the host. Verify this using the following command on `$HOST`: | ||
|
||
```shell | ||
systemctl status kubelet | ||
``` | ||
|
||
4. Bring the host back online by marking it schedulable: | ||
|
||
```shell | ||
kubectl uncordon $HOST | ||
``` | ||
|
||
5. After upgrading `kubelet` on each host in your cluster, verify that all nodes are available again by executing the following (from anywhere, for example, from outside the cluster): | ||
|
||
```shell | ||
kubectl get nodes | ||
``` | ||
|
||
If the `STATUS` column of the above command shows `Ready` for all of your hosts and the version you expect, you are done. | ||
|
||
## Recovering from a failure state | ||
|
||
If `kubeadm upgrade` somehow fails and fails to roll back, for example due to an unexpected shutdown during execution, | ||
you can run `kubeadm upgrade` again as it is idempotent and should eventually make sure the actual state is the desired state you are declaring. | ||
|
||
You can use `kubeadm upgrade` to change a running cluster with `x.x.x --> x.x.x` with `--force`, which can be used to recover from a bad state. | ||
|
||
{{% /capture %}} | ||
|
||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nix all these and use the alias
- sig-cluster-lifecycle
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I really should get #8506 in after all the docs flux of the release