-
Notifications
You must be signed in to change notification settings - Fork 743
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New pods failing to start with FailedCreatePodSandBox
warning for CNI versions 1.7.x with Cilium
#1265
Comments
Hi @YesemKebede Can you please confirm if you have set I checked IPAMD logs and I see IP allocation seems fine on the first look. We will further investigate the issue. Thanks. |
@jayanthvn |
Thanks @YesemKebede . We will look into it asap. |
Hi @YesemKebede Can also please confirm how you upgraded from 1.6.3 to 1.7.X? Thank you! |
@jayanthvn I followed this Doc |
I upgraded from 1.6.3 to 1.7.5 and had the same problem. No new pod could be started and they had that same error. I had both AWS_VPC_K8S_CNI_LOG_FILE and AWS_VPC_K8S_PLUGIN_LOG_FILE set to stdout and had this same problem. Removing them so they get sent to files as per their default config solved the issue for me. Google led me here: Azure/azure-container-networking#195 (comment) |
@sophomeric Yes setting AWS_VPC_K8S_PLUGIN_LOG_FILE to stdout will cause a similar issue(#1251). But here it wasn't set. |
We are experiencing the same issue on newly provisioned clusters with the following difference in versions:
If it helps, although I am not 100% sure about the Kubernetes version being exactly the same patch version, that same configuration was working last week on a different cluster with the same characteristics. |
Hi @Aggouri Can you please confirm the CNI version for the two clusters?
Thanks. |
The cluster was provisioned a few hours ago: $ kubectl describe daemonset aws-node --namespace kube-system | grep Image | cut -d "/" -f 2
amazon-k8s-cni-init:v1.7.5-eksbuild.1
amazon-k8s-cni:v1.7.5-eksbuild.1 Sadly, I am unable to provide the CNI plugin version of the previous cluster as it was already torn down. If it helps, I know it was provisioned at the beginning of last week and used the defaults EKS came with for Kubernetes version |
Thanks for conforming @Aggouri . We are actively looking into the issue. Will update asap. |
@jayanthvn upgraded from 1.6.3 to 1.7.5 having a problem.
|
Hi, We have found the RC, for now please add
I was able to repro and below is the o/p after fixing the conflist -
Thank you! |
FailedCreatePodSandBox
warning for CNI versions 1.7.xFailedCreatePodSandBox
warning for CNI versions 1.7.x with Cilium
#1275 is merged so closing this issue. |
is there any ETA for a new release including this fix? |
I tried the workaround suggested in #1265 (comment) 👍 After this, coredns is RUNNING but NOT READY. This from the 2 pods.
Thoughts? |
Hi @pankajmt Which image version are you using since Rel 1.7.9 has the fix for #1265 - https://github.com/aws/amazon-vpc-cni-k8s/releases/tag/v1.7.9. |
Can we point to a particular set of EKS releases in the Cilium docs somehow? What versions of EKS will ship with 1.7.9? |
I am on aws cni 1.7.8.
So looks like there is hope assuming the EKS version we need is GA in our region. While docs improve, someone knows the EKS version I should be looking for? Many thanks, |
Hi, Yeah that would be great if Cilium docs can point to EKS CNI versions and if there is any known issue it would be easy for Cx to fallback or look for new versions. Currently EKS default CNI version is 1.7.5 with new clusters. Will keep you updated if we plan to make 1.7.9 or later versions default for EKS. Thank you! |
So looks like then ours is a custom install of the EKS CNI. I will figure out how it was done and how can I upgrade it to 1.7.9. |
worked well with aws cni 1.7.9. many thanks. |
What happened:
New pods started failing to come up after upgrading to eks CNI v1.7.0 from v1.6.0. I was able to upgrade to v1.6.3 without any issue. I started to see the errors when I upgraded to 1.7.0. I also tried to upgrade to other version ( v1.7.2 and v1.7.5) but I am seeing the same issue.
Here is the error I am seeing.
Here is the cni log
Anything else we need to know?:
Environment:
v1.17.9-eks-4c6976
5.4.58-27.104.amzn2.x86_64
The text was updated successfully, but these errors were encountered: