Upgrade to coreDNS 1.5 in node-cache #306

prameshj · 2019-05-30T16:40:48Z

This is needed to pick up a bugfix coredns/coredns#2636

woopstar · 2019-08-27T09:00:58Z

@prameshj can you test if our new image has the same problems with the cleanup that caused the revert:

pasientskyhosting/k8s-dns-node-cache-amd64:1.15.4-2-g903339e-coredns1.6.2-dirty

prameshj · 2019-09-02T19:47:59Z

@prameshj can you test if our new image has the same problems with the cleanup that caused the revert:

pasientskyhosting/k8s-dns-node-cache-amd64:1.15.4-2-g903339e-coredns1.6.2-dirty

It still does not do the cleanup.. This is how i test it:

Pick one of the node-local-dns pods and run "watch -n 1 'iptables -t raw -nvL'"
This command watches the iptables rules that node-local-dns pod configures.
Delete that specific node-local-dns pod with "kubectl delete pods "
See the output of 1) if the rules are still there.

I see the rules are still there and get cleaned up when the next pod comes up. This is problematic when we disable nodelocaldns and the rules/interface are not cleaned up.

Last I had checked, like the shutdown callbacks in forward plugin are taking longer/not completing before pod is shut down. The node-cache cleanup would be called after all the shutdown callbacks of plugins. @johnbelamaric any ideas on how to debug this further?

jordips · 2019-09-04T09:19:08Z

Ok. I test it with the image and PREROUTING rules are not deleted, as you said

Chain PREROUTING (policy ACCEPT 4239 packets, 952K bytes)
    4   374 CT         udp  --  *      *       0.0.0.0/0            169.254.25.10        udp dpt:53 NOTRACK
    0     0 CT         tcp  --  *      *       0.0.0.0/0            169.254.25.10        tcp dpt:53 NOTRACK

I will try to debug a litt bit more.

jordips · 2019-09-04T13:25:00Z

Last I had checked, like the shutdown callbacks in forward plugin are taking longer/not completing before pod is shutdown

@prameshj How did you debug this part? I'm trying to see logs when pod is deleted but I can't access with --previous flag.

Thank you

prameshj · 2019-09-05T18:33:49Z

I think i added logs to the different plugin shutdown callbacks and checked if they are invoked.
I started with logs here -

dns/vendor/github.com/mholt/caddy/sigtrap_posix.go

Line 45 in 64023fb

exitCode := executeShutdownCallbacks("SIGTERM")

then checked which callback completed, i recall forward plugin callback did not complete.

I also modified nodelocaldns yaml to include a terminationGracePeriodSeconds value of 60, but that did not help.

prameshj · 2019-09-06T20:01:04Z

@jordips Is it possible for you to build this image with the latest CoreDNS - 1.6.3? https://github.com/coredns/coredns/releases

I talked to @johnbelamaric , he suggested we try with the latest version and if it still has the same behavior, we can open an issue on coredns repo.

jordips · 2019-09-09T07:13:20Z

@prameshj sure! I will try later and I tell you something.

jordips · 2019-09-09T08:18:33Z

@woopstar or @prameshj Is there any special procedure to manage packages with DEP? I change coreDNS version in Gopkg.toml and making a dep ensure it fails with missing packages from k8s.io (the ones inside "pkg" and "cmd")
Thanks

woopstar · 2019-09-09T08:20:00Z

@chad-jones Is the one who made our image and had it work with 1.6.2. I can ask him to rebuild with 1.6.3 too

jordips · 2019-09-09T08:32:20Z

@woopstar that will be awesome! :) Or if @chad-jones explains the rebuild steps I can do it myself. I think the procedure is:

change Gopkg.toml (coredns version)
"dep ensure" or something like this to get new source to vendor folder.
"make containers" to build new image.

But I have problems with dep dependecies... I'm still solving this

woopstar · 2019-09-09T08:39:08Z

@chad-jones mentioned for me that it was a dep nightmare when he did 1.6.2 :) I'll let him post here whatever he decides to do :)

jordips · 2019-09-09T08:50:43Z

@woopstar perfect. Thank you for the info. Good to know that i'm not the only one thinking is a nightmare 👍 :D

chad-jones · 2019-09-09T09:58:26Z

CoreDNS - 1.6.3
pasientskyhosting/k8s-dns-node-cache-amd64:1.15.4-coredns1.6.3-dirty

johnbelamaric · 2019-09-10T18:35:26Z

k/k and CoreDNS have both moved to go modules, is it worth making that change here.

prameshj · 2019-09-12T03:27:46Z

Andreas Krüger perfect. Thank you for the info. Good to know that i'm not the only one thinking is a nightmare 👍 :D

I think the biggest issue is with the prometheus client version that skydns uses and what CoreDNS requires. When i tried updating to a newer CoreDNS, i had to manually edit the skydns vendor directory to use a newer prometheus client. Was this the issue you faced too?

I am not sure if moving to go modules will fix that particular issue.. but it is worth making the change.

prameshj · 2019-09-12T03:28:20Z

I will try out the cleanup test with the new image and report back in a day or so. Thanks for sharing the image.

axot · 2019-09-18T09:17:05Z

I also tried to upgrade to 1.5.3 before and did not face the skydns issue. One more change I think was caddy renamed from github.com/mholt/caddy to github.com/caddyserver/caddy.

prameshj · 2019-09-18T21:46:39Z

pasientskyhosting/k8s-dns-node-cache-amd64:1.15.4-coredns1.6.3-dirty

I just tried this.. The cleanup code is not invoked in the latest version either.

prameshj · 2019-10-04T23:19:18Z

@woopstar @chad-jones I am trying to debug the cleanup issue by adding logs and building a custom image. I am having a hard time getting the image to build.

I changed Gopkg.toml to point to coredns 1.16.4 and client_golang to 1.1.0 which coredns needs. However, that dependency is not working when i do "dep ensure". Could you submit a PR with the changes to upgrade to 1.16? I can debug it further. Thanks!

prameshj · 2019-10-11T21:15:40Z

I found the issue. coreDNS changed the import path for caddy in coredns/coredns#2961
node-cache was still using the old import path and setting the cleanup function as part of OnProcessExit. I changed it to use the right path and it is working fine now.
many thanks to @chad-jones for creating the PR with the dependencies update. I have submitted a PR with the changes, hope to merge it soon.

woopstar · 2019-12-03T10:23:51Z

We are glad to help out with this issue! Thank you @prameshj for helping out on an official solution to this.

prameshj mentioned this issue May 30, 2019

Revert upgrade to CoreDNS 1.5.0 #307

Merged

jordips mentioned this issue Aug 27, 2019

nodelocaldns give status FORMERR in some (random) external DNS queries kubernetes-sigs/kubespray#5109

Closed

johnbelamaric mentioned this issue Sep 23, 2019

plugin/rewrite Not working in k8s coredns/coredns#3298

Closed

chad-jones mentioned this issue Oct 8, 2019

Upgrade CoreDNS to v1.6.4 #325

Closed

This was referenced Oct 10, 2019

don't hardcode coredns version #327

Merged

Upgrade to CoreDNS 1.6.4 in node-cache. #328

Merged

k8s-ci-robot closed this as completed in #328 Oct 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade to coreDNS 1.5 in node-cache #306

Upgrade to coreDNS 1.5 in node-cache #306

prameshj commented May 30, 2019

woopstar commented Aug 27, 2019

prameshj commented Sep 2, 2019

jordips commented Sep 4, 2019

jordips commented Sep 4, 2019

prameshj commented Sep 5, 2019

prameshj commented Sep 6, 2019

jordips commented Sep 9, 2019

jordips commented Sep 9, 2019

woopstar commented Sep 9, 2019

jordips commented Sep 9, 2019

woopstar commented Sep 9, 2019

jordips commented Sep 9, 2019

chad-jones commented Sep 9, 2019

johnbelamaric commented Sep 10, 2019

prameshj commented Sep 12, 2019

prameshj commented Sep 12, 2019

axot commented Sep 18, 2019 •

edited

Loading

prameshj commented Sep 18, 2019

prameshj commented Oct 4, 2019

prameshj commented Oct 11, 2019

woopstar commented Dec 3, 2019

Upgrade to coreDNS 1.5 in node-cache #306

Upgrade to coreDNS 1.5 in node-cache #306

Comments

prameshj commented May 30, 2019

woopstar commented Aug 27, 2019

prameshj commented Sep 2, 2019

jordips commented Sep 4, 2019

jordips commented Sep 4, 2019

prameshj commented Sep 5, 2019

prameshj commented Sep 6, 2019

jordips commented Sep 9, 2019

jordips commented Sep 9, 2019

woopstar commented Sep 9, 2019

jordips commented Sep 9, 2019

woopstar commented Sep 9, 2019

jordips commented Sep 9, 2019

chad-jones commented Sep 9, 2019

johnbelamaric commented Sep 10, 2019

prameshj commented Sep 12, 2019

prameshj commented Sep 12, 2019

axot commented Sep 18, 2019 • edited Loading

prameshj commented Sep 18, 2019

prameshj commented Oct 4, 2019

prameshj commented Oct 11, 2019

woopstar commented Dec 3, 2019

axot commented Sep 18, 2019 •

edited

Loading