-
Notifications
You must be signed in to change notification settings - Fork 716
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kubeadm reset command whether remove etcd member on the master #1211
Comments
@pytimer could you kindly provide more info about your cluster (how it was created, the kubeadm-gce-master.yaml) |
@fabianofranz sorry, some things during this time. I use 1.13.0-beta.2 on the virtual machine to test this issue. I init and join control plane successfully. But when i run Etcd container on the first init node always restart, and i found logs output this etcd still connect reset node etcd member. etcd logs:
reproduce
kubeadm init yaml:apiVersion: kubeadm.k8s.io/v1beta1
kind: InitConfiguration
bootstrapTokens:
- groups:
- system:bootstrappers:kubeadm:default-node-token
ttl: 24h0m0s
usages:
- signing
- authentication
localAPIEndpoint:
advertiseAddress: 0.0.0.0
bindPort: 6443
nodeRegistration:
criSocket: /var/run/dockershim.sock
name: master212
taints:
- effect: NoSchedule
key: node-role.kubernetes.io/master
---
apiServer:
timeoutForControlPlane: 4m0s
apiVersion: kubeadm.k8s.io/v1beta1
kind: ClusterConfiguration
certificatesDir: /etc/kubernetes/pki
clusterName: kubernetes
controlPlaneEndpoint: "10.33.46.215"
controllerManager: {}
dns:
type: CoreDNS
etcd:
local:
serverCertSANs:
- "10.33.46.215"
extraArgs:
cipher-suites: TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256
dataDir: /var/lib/etcd
imageRepository: k8s.gcr.io
kubernetesVersion: v1.13.0-beta.2
networking:
dnsDomain: cluster.local
podSubnet: "10.244.0.0/16"
serviceSubnet: 10.96.0.0/12 kubeadm join yaml:apiVersion: kubeadm.k8s.io/v1beta1
kind: JoinConfiguration
caCertPath: /etc/kubernetes/pki/ca.crt
discovery:
bootstrapToken:
apiServerEndpoint: 10.33.46.215:6443
token: 1jvhzl.37osma939vn5q1uh
unsafeSkipCAVerification: true
timeout: 5m0s
tlsBootstrapToken: 1jvhzl.37osma939vn5q1uh
controlPlane:
localAPIEndpoint:
advertiseAddress: 0.0.0.0
bindPort: 6443
nodeRegistration:
criSocket: /var/run/dockershim.sock
name: master213 |
@pytimer The reason behind that is that the etcd cluster loses quorum @timothysc opinions about if/how to handle this use case? |
Yes, you said right. So i think if run |
we should try to make the remaining etcd nodes to not get stuck. edit: also having related tests in the future would be great. |
I add remove etcd member feature when reset the control plane node, it's works for me. I am not sure if this workflow should join the |
/remove-priority awaiting-more-evidence |
Is this a BUG REPORT or FEATURE REQUEST?
FEATURE REQUEST
Versions
kubeadm version (use
kubeadm version
): kubeadm master branchEnvironment:
kubectl version
):uname -a
):Linux master1 3.10.0-693.el7.x86_64 #1 SMP Tue Aug 22 21:09:27 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
What happened?
Hi, i use
kubeadm reset
command on the one of masters, it not remove etcd member in the etcd cluster. I use local etcd in the init.What you expected to happen?
I look at the master branch code, but i'm not find about this. I hope if reset on the master, kubeadm can remove etcd member.
How to reproduce it (as minimally and precisely as possible)?
Anything else we need to know?
The text was updated successfully, but these errors were encountered: