Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ipv6 instructions broken #6075

Closed
eylvisaker opened this issue Feb 25, 2018 · 43 comments
Closed

ipv6 instructions broken #6075

eylvisaker opened this issue Feb 25, 2018 · 43 comments
Labels
area/compose Relates to docker-compose.yml spec or docker-compose binary lifecycle/locked

Comments

@eylvisaker
Copy link

File: config/daemon/ipv6.md, CC @johndmulhausen

If you follow instructions as written, it will prevent the docker daemon from starting. The instructions need to be updated to include the "fixed-cidr-v6" key with a valid ipv6 cidr value.

@justinclift
Copy link

justinclift commented Aug 10, 2018

Having also hit this today, when looking into enabling IPv6 for a swarm setup... it seems the IPv6 instructions used to be more comprehensive, but at some point were reduced to the current non-working single line version.

That single line approach seems like it might work, but only for specialised (non-default) configurations.

Some further info here: moby/moby#29443 (comment)

The IPv6 instructions should probably be made to work "out of the box" for default configurations, or at least be expanded out again to explain how to set up a non-default configuration that does support them.

In the meantime, people should probably use the older docs version that's more comprehensive. 😉

@justinclift
Copy link

justinclift commented Aug 10, 2018

Ugh. docker/compose#4958 ☹️

nolim1t added a commit to lncm/dockerfiles that referenced this issue Sep 16, 2018
@ghost
Copy link

ghost commented Jan 2, 2019

Closing this ticket due to its age, and the impending refactor. If you think this is in error, feel free to reopen. Thanks!

@ghost ghost closed this as completed Jan 2, 2019
@nogweii
Copy link

nogweii commented Jan 2, 2019

Where can we follow for news about the refactor?

@justinclift
Copy link

and the impending refactor.

Please don't. Would be better to close it after the refactor is in place and known to work. 😄

If you think this is in error, feel free to reopen.

Doesn't look like I have permission to reopen this, as the button isn't showing up.

@L-Hudson Would you be ok to re-open for us? 😄

@jedahan
Copy link

jedahan commented Feb 23, 2019

Just ran into this issue today when I only want ipv6 link-local addresses.

@paigehargrave paigehargrave reopened this Feb 24, 2019
@bermudezmt bermudezmt added the area/compose Relates to docker-compose.yml spec or docker-compose binary label Feb 27, 2019
@hr1232
Copy link

hr1232 commented May 24, 2019

As there are several tickets written by people claming that IPv6 is broken, here is the solution how to configure it corretly. Someone should change the documentation accordingly.

Docker IPv6 is working perfectly, it's just the documentation that doesn't tell people what to do in order to set it up correctly.

Docker will not do SLAAC (Stateless Address Autoconfiguration), nor will it use DHCPv6. As it also won't do NATv6 (thank god!), you will have to assign ipv6 prefixes to your networks manually. The default bridge network has to be assigned a network, you can do this by adding the following to daemon.json:

{
"ipv6": true,
"fixed-cidr-v6": "2001:db8::/64",
}

For any other network you might have, you can decide wether or not it will be ipv6 capable. To create an IPv6 capable network, do the following:
docker network create --ipv6 --subnet "2001:db8:1::/64" testnet

Also remember, that this is not NATv6 (thank god!). Therefor you will have to have a route from your router to your docker host!

@justinclift
Copy link

@hr1232 That sounds potentially really good. 😄

Which version of Docker have you tried this on, so it's "known good" with?

@hr1232
Copy link

hr1232 commented May 25, 2019

I'm on Debian Stretch and use the most current version in the repositories.

# docker version
Client:
 Version:           18.09.5
 API version:       1.39
 Go version:        go1.10.8
 Git commit:        e8ff056dbc
 Built:             Thu Apr 11 04:44:28 2019
 OS/Arch:           linux/amd64
 Experimental:      false

@justinclift
Copy link

Thanks @hr1232, that's good info. There's now a reference point for people to work from. 😄

@hr1232
Copy link

hr1232 commented May 26, 2019

@justinclift There actually is a reference, but it is wrong. According to the documentation, you only have to add the ipv6 parameter and that is just wrong.

As long as Docker won't do SLAAC, DHCPv6 or NATv6 people need to know that they have to do more than just slapping on a simple parameter. The second parameter fixed-cidr-v6 is mandatory, so is the manual calculation of IPv6 subnets. Sadly that isn't mentioned in the documentation and leads people the the error message Error starting daemon: Error initializing network controller: Error creating default "bridge" network: could not find an available, non-overlapping IPv6 address pool among the defaults to assign to the network

@justinclift
Copy link

justinclift commented May 26, 2019

Yeah, personally speaking I pretty much marked it off as "not fit for purpose" and moved on to other things. 😉

@pandruszkow
Copy link

Why in the world do we have to specify link-local IPv6 addresses by hand over a year after this issue was opened? What is this, the 1990s?

IMO, if Docker is smart enough to figure out and assign a valid loopback IPv4 address, it's more than capable of doing the same for IPv6 addresses.

@hr1232
Copy link

hr1232 commented Jul 12, 2019

@pandruszkow

First of all, the IPv6 addresses you assign are not link local. Docker will do that on its own, so will all containers. You do, however, have to assign a prefix from either the Unique Local Unicast range or the Global Unicast range for every Docker network.

Second of all, Docker does also not assign a lookback IPv4 address. It assigns a private IPv4 subnet and does NAT, except if you assign a routed IPv4 subnet manually.

You do have to do a little bit more with IPv6 because there is no NAT available in IPv6 if it is used correctly. Therefor Docker cannot just assign any Unique Local Unicast address and to NAT. Docker is not able to know which prefix you routed towords the Docker host. Therefor it cannot do that job for you - at least not without SLAAC support. And even if it had SLAAC, you would need a prefix with less then 64 bits in your host network to use SLAAC but most ISPs do only provide a 64 bit prefix to their customers and therefor SLAAC won't work. Providing 64 bit subnets to customers is against RFC (customers should get 48-56 bit subnets), but ISPs are not bound to follow RFCs and most don't.

That said, there is nothing to fix, except the documentation. Of course, Docker could be made SLAAC-capable and thereby easy the configuration in some cases where a large enough IPv6 subnet is available. However, that will do nothing for most users and they will still need to configure manually.

Either way, as you used the terms link local address and loopback address completely wrong, you obviously should read up on both protocols before trolling around and making such nonsensical comments.

@justinclift
Copy link

You do have to do a little bit more with IPv6 because there is no NAT available in IPv6 if it is used correctly.

A bold statement. 😉

@hr1232
Copy link

hr1232 commented Jul 12, 2019

Not a "bold statement" but reality. NAT64 is broken crap and nobody wants to use it because of all the problems NAT is causing in IPv4. If you are talking about the missing SLAAC capability, the answer to why that won't help anybody is also stated above.

Also, the discussion is about the crappy documentation and not about wether or not @justinclift understands that NAT64 is nonsense, invented by people that don't understand IPv6 and try to stay in their broken IPv4 conventions. So stop aguing about things this ticket is not about.
@justinclift Also, please send my a link to where you think that the IPv6 RFC is talking abount a meaningful and correct use of NAT in IPv6 networks. Im waiting for your PM

@scyto
Copy link

scyto commented Sep 16, 2019

@hr1232 I agree the documentation is highly confusing. I have been unable to get traffic to egress from a user created bridge. I am sure it is my new-ness to linux and docker routing that is the issue but after 10 hours on this i am kind of read up and researched out.

For example the comments in this thread appear to suggest that the following only applies to the default bridge and not user created bridges?

{
"ipv6": true,
"fixed-cidr-v6": "2001:db8:1::/64"
}

(note by default my docker platform doesn't have this enabled, but i have played with having it on and off)

If that is so then then doc needs to clearly state it as i enabled this and was messing around for hours on user created bridge.

On my user created ipv6 bridge, the container gets an IP at the beginning of the range and can ping the gateway defined inside the container. Heck i even get full ipv6 name resolution for external IPv6 addresses and full ICC IPv6 name resolution AND traffic flowing between multiple containers on the same bridge.

On the default bridge i enabled the ipv6 section in dockerd.json and got stuck in exactly the same place.

What i don't have is any form of routing out or back into the container. Or what i am supposed to do about it!? It isn't clear to me which commands are executed on the host and which in the container.

It is also possible i messed up on IPv6 itself, my PD issued subnet is:
2601:xxxx:xxxx:7c80::/64

My subnet i am using the containers is 2601:xxxx:xxxx:7c80::500/120

I assume this is ok as there is nothing to say don't do this... i tried and failed to figure out what the IPv6 equivalent is of 192.168.x.x. is... and what the notation would be.. aka fd00, fc00 - maybe an example showing how to use these AND get routing but given there is no ipv6 NAT i assume these are useless for egress to internet?

While i don't expect steps to tell me how to do it on my router - i do expect the basic sketch of the procedure..

For example i would expect see a default ipv6 route in the containers routing table but i don't... and it won't let me add one, or rather if it can i can't figure out the exact command.

My point being the documentation is incomplete in that it should either say ipv6 is only for internal traffic on a user created bridge OR 'here is the outline on how to get it; routing.

root@debian-ipv6:/# ip route
default via 172.20.0.1 dev eth0
172.20.0.0/16 dev eth0 proto kernel scope link src 172.20.0.2

root@debian-ipv6:/# ip addr
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: sit0@NONE: mtu 1480 qdisc noop state DOWN group default qlen 1
link/sit 0.0.0.0 brd 0.0.0.0
137: eth0@if138: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default
link/ether 02:42:ac:14:00:02 brd ff:ff:ff:ff:ff:ff link-netnsid 0
inet 172.20.0.2/16 brd 172.20.255.255 scope global eth0
valid_lft forever preferred_lft forever
inet6 2601:600:8880:7c80::502/120 scope global nodad
valid_lft forever preferred_lft forever
inet6 fe80::42:acff:fe14:2/64 scope link
valid_lft forever preferred_lft forever

I think i will be using macvlan for now as that allows a single container to talk full ipv6 with ease!

@hr1232
Copy link

hr1232 commented Sep 16, 2019

@hr1232 I agree the documentation is highly confusing. I have been unable to get traffic to egress from a user created bridge,

You have to assign a gateway and subnet to you docker network manually. You have to make sure, the subnet ist routed from your router to the docker host. You have to create the route manually on your router because Docker will not use SLAAC!

For example the comments here appear to suggest that the following only applies to the default bridge and not user created bridges.
{
"ipv6": true,
"fixed-cidr-v6": "2001:db8:1::/64"
}

ipv6: true; applies to all bridges.
fixed-cidr-v6 only applies to the default bridge but is necessary for IPv6 to work.

(note by default my docker platform doesn't have this enabled, but i have played with having it on and off)

In that case you will never be able to get packets out because without ipv6: true; there will be no ipv6 routing available.

If that is so then then doc needs to clearly state it, it doesn't today.

Correct

Secondly i have configured a ipv6 bridge, the container gets an IP at the beginning of the range and can ping the gateway defined inside the container. Heck i even get full ipv6 name resolution for external IPv6 addresses and full ICC IPv6 name resolution AND traffic flowing between multiple containers on the same bridge.

If you don't have ipv6: true; set, this ist exactly what you will see. Without IPv6 routing activated you will not have IPv6 routing. Of course containers on your virtual switch will still be able to communicate amongst each other because switches don't care about layer 3 protocols.Name resolution is done through IPv4 (from the docker host towords the Internet). Note that you don't need an IPv6 capable nameserver to query AAAA records.

What i don't have is any form of routing out or back into the container.

See above. You have to activate IPv6 routing (ipv6: true;) to have IPv6 routing.

Someone mentioned i need to have a route on my router?

Your router will have to know where to send answering packets. This ist neither NAT, nor dynamic routing.

While i don't expect steps to tell me how to do it on my router - i do expect the basic sketch of the procedure..

See above in my earlier comment. You should know how to subnet your home network without creating conflicts within your home network. You than have to route the subnet you created towards your docker host. Then you have to do further subnetting to get one subnet for every docker network, including the default bridge, and assign them manually. Default bridge MUST have a subnet assigned, all other networks are optional. Also, you have to activate IPv6 on your manual networks during creation (--ipv6). This ist all very basic network knowlegde. It is not the job of docker docs to tell you that. It is the job of Wikipedia. Docker docs just need to tell you that Docker won't so SLAAC.

For example i would expect see a default ipv6 route in the containers routing table but i don't... and it won't let me add one, or rather if it can i can't figure out the exact command.

Again, if you don't activate IPv6 routing, you won't have IPv6 routing. As soon as you activate it, your containers will be taken care of by DHCPv6.

My point being the documentation is incomplete in that it should either say ipv6 is only for internal traffic on a user created bridge OR 'here is the outline on how to get it; routing.

If you follow the instructions you will have egress capabilities. You did not activate IPv6 routing, so what do you expect? Take an old IPv4-only router and connect your PC. Will you then have IPv6 in you PC? I think Not. Will you be able to communicate via IPv6 with another PC through the router's internal Switch? Of course! Will your stoneage-router give your PC an egress route or will it Just magically appear in your PC's routing table? Probably not, but don't give up hope....

root@debian-ipv6:/# ip route
default via 172.20.0.1 dev eth0
172.20.0.0/16 dev eth0 proto kernel scope link src 172.20.0.2

Result as expected. Please consult either man ip or man route.

root@debian-ipv6:/# ip addr
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: sit0@NONE: mtu 1480 qdisc noop state DOWN group default qlen 1
link/sit 0.0.0.0 brd 0.0.0.0
137: eth0@if138: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default
link/ether 02:42:ac:14:00:02 brd ff:ff:ff:ff:ff:ff link-netnsid 0
inet 172.20.0.2/16 brd 172.20.255.255 scope global eth0
valid_lft forever preferred_lft forever
inet6 2601:600:8880:7c80::502/120 scope global nodad
valid_lft forever preferred_lft forever
inet6 fe80::42:acff:fe14:2/64 scope link
valid_lft forever preferred_lft forever

Also as expected.

I think i will be using macvlan for now as that allows a single container to talk full ipv6 with ease!

Completely non-sensical, but OK. Dont't learn, just use deprecated hacks and wonder why they stop working at some point in the near future.

@scyto
Copy link

scyto commented Sep 16, 2019

@hr1232 > Completely non-sensicle but OK. Dont't learn, just use deprecated hacks and wonder why they stop working at some point in the near future.

No thats pragmatically using something that works for what i wanted to do (ipv4 and ipv6 enabled pihole with non port conflicts on host) - it worked quickly and easily.

Moreover i am trying to learn so thank you for your patience - what may seem obvious to you isn't obvious to others - esp with vast amounts of conflcting information on the webs. For example in your docs is the first time i have manged to get the IPv6 routes displayed with ip -6 route show - other posts i followed were BS.

I am currently in a state where, as i said, i do have the following section enabled:

{
"ipv6": true,
"fixed-cidr-v6": "2601:600:8880:7c80::500/120"
}

I am fairly comfortable with IPv4 subnetting and i think i have done the IPv6 correctly,. I think i may be in the NDP scenario (as will almost all folks) as we will always be subnettings the scheme provided to us by an ISP - most of us don't have the luxry of the ISP giving us multiple independent /64 like you show in all the diagrams.

If my scenario is indeed the NDP Proxying may i suggest you change the title of that section to indicate the outcome (Subnetting you ISP provide subnet) as it wasn't obvious to me until the 15th reading it might be my scenario - not because i am stupid, but because there is a vast amount of highly complex information to parse.

though i am not sure i need to do the NDP as my host route table looks like this

`alexb@SYN02:/var/packages/Docker/etc$ ip -6 route show

  • 2601:600:8880:7c80::500/120 dev docker0 proto kernel metric 256 linkdown pref medium
  • 2601:600:8880:7c80::500/120 dev docker0 metric 1024 linkdown pref medium
  • 2601:600:8880:7c80::600/120 dev docker-282a4aac proto kernel metric 256 linkdown pref medium
  • 2601:600:8880:7c80::600/120 dev docker-282a4aac metric 1024 linkdown pref medium
  • 2601:600:8880:7c80::/64 dev ovs_eth1 proto kernel metric 256 pref medium
  • 2601:600:8880:7c80::/64 dev ovs_eth0 proto kernel metric 256 pref medium
  • fe80::/64 dev ovs_eth0 proto kernel metric 256 pref medium
  • fe80::/64 dev ovs_eth1 proto kernel metric 256 pref medium
  • fe80::/64 dev docker0 proto kernel metric 256 linkdown pref medium
  • fe80::/64 dev docker-282a4aac proto kernel metric 256 linkdown pref medium
  • default via fe80::1ae8:29ff:feb5:132a dev ovs_eth1 metric 1024 pref medium`

When i look at your 'routed network' example I note they have example route add comands tagged clearly against where to run them - doing the same in the NDP diagram might help - i don't expect you to tell me how to run mine or what IPs to use., i think i can figure that out (maybe not).

I just need to a)identify which scenario documented i should be following and b)if i have all tne right commands. BTW i tried the NDP commands you have documented and it didn't fix a thing for me.

You beat me for not trying to learn, well i am here to learn, please help and give some pointers.

PS for example:
Is this run on the host or the c008 container $ ip -6 neigh add proxy 2001:db8::c009 dev eth0 ?
also once this is run why does the next command show an output of route show that only includes the c008 route?

@scyto
Copy link

scyto commented Sep 16, 2019

I have tried running this on the hosts to no avail:

sudo sysctl net.ipv6.conf.ovs_eth0.accept_ra=2
sudo sysctl net.ipv6.conf.ovs_eth1.accept_ra=2

sudo sysctl net.ipv6.conf.ovs_eth0.proxy_ndp=1
sudo sysctl net.ipv6.conf.ovs_eth1.proxy_ndp=1

sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth0
sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth1
sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth0
sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth1

@hr1232
Copy link

hr1232 commented Sep 16, 2019

I have tried running this on the hosts to no avail:

sudo sysctl net.ipv6.conf.ovs_eth0.proxy_ndp=1
sudo sysctl net.ipv6.conf.ovs_eth1.proxy_ndp=1

sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth0
sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth1
sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth0
sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth1

Why would you do something like that? Please don't confuse the terms proxy and router.
sys.net.ipv6.conf.all.forwarding=1
sys.net.ipv6.conf.default.forwarding=1

@scyto
Copy link

scyto commented Sep 16, 2019

Thanks for replying
I would do something like that because in the main section it says to do something like that and in the NDP section it says to do something like that. I have no idea which section i am supposed to be following - aka which is my scenario.

Which of the commands should i have not run? Did my running them harm anything.

After all i did documented in my previous post i have the router pinging the 502 machine (on the default bridge) and vice versa but not the 602 machine on the custom bridge.

I ran the two extra commands above, no change. Also the doc implies they are not needed if one has modified the dockerd.json?

@scyto
Copy link

scyto commented Sep 16, 2019

ok i see why the 602 failed, i am a dumbass it should have been.

sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth0
sudo ip -6 neigh add proxy 2601:600:8880:7c80::502 dev ovs_eth1
sudo ip -6 neigh add proxy 2601:600:8880:7c80::602 dev ovs_eth0
sudo ip -6 neigh add proxy 2601:600:8880:7c80::602 dev ovs_eth1

serves me right for copying and pasting.

Ok now i ahve this I can write it all up for all those synology folks wondering why IPV6 doesn't work when it has UI items - bloody synology never finished supporting the scenario.

I will call it good, thanks for your help figuring out, hopefully the guide i will write will help mean less folks end up here.

Any chance the NDPPD you mentioned would be standard part of docker (if it makes it simpler?)

@scyto
Copy link

scyto commented Sep 16, 2019

Hmm this is exactly the article I needed about 10 hours ago. Seems about right level... but seriously this is perfect. I never found it because a google would never turn a German doc up....

This is way better the formal docs. Still not sure why only the NDP method worked for me - maybe because I didn’t create a route on my router? I think the docs could be improved by starting with ‘if you are in this scenario do X. For most people getting started the instruction below will be the default....

https://www.translatetheweb.com/?from=&to=en&ref=SERP&dl=en&rr=UC&a=https%3a%2f%2fwww.heikorichter.name%2fpost%2f123%2fdocker-und-ipv6%2f#

@hr1232
Copy link

hr1232 commented Sep 16, 2019

Forget everything you EVER heard about NDP. NDP is a very foolish way of pressing IPv6 into the broken, outdated NAT-concepts of IPv4. Please remove every NDP-thing you did from your Docker host. It is not possible to use routing and proxying simultaniously. Also remember to activate IPv6 routing in the host via the 2 commands stated above. Lastly, call your ISP and demand to be given a prefix of 48-52 bits of length as is the RFC-default for end users. If they ignore you, try tunnelbroker.net, they will give you a 48 Bit Network for free. Alternatively, you can also add 32 to every bitmask in my examples but remember that this will rob your containers of the ability to use SLAAC internally because it will leave you with a 96 bit subnet in your docker networks.

PS: you don't need the sysctl accept_ra command, either. This ist für SLAAC, but Docker uses DHCPv6 by Default. If you manipulate accept_ra that will only give you trouble with tbe Hosts external connection.

@hr1232
Copy link

hr1232 commented Sep 16, 2019

@hr1232 > Completely non-sensicle but OK. Dont't learn, just use deprecated hacks and wonder why they stop working at some point in the near future.

No thats pragmatically using something that works for what i wanted to do (ipv4 and ipv6 enabled pihole with non port conflicts on host) - it worked quickly and easily.

That method has been marked deprecated for years. Therefore your Docker may stop working after any apt upgrade without any prior notice.

Moreover i am trying to learn so thank you for your patience - what may seem obvious to you isn't obvious to others - esp with vast amounts of conflcting information on the webs. For example in your docs is the first time i have manged to get the IPv6 routes displayed with ip -6 route show - other posts i followed were BS.

Always a good idea to consult man ip instead of Google. Only ask Google for specific questions, not for man pages.

I am currently in a state where, as i said, i do have the following section enabled:
{
"ipv6": true,
"fixed-cidr-v6": "2601:600:8880:7c80::500/120"
}
I am fairly comfortable with IPv4 subnetting and i think i have done the IPv6 correctly,. I think i may be in the NDP scenario (as will almost all folks) as we will always be subnettings the scheme provided to us by an ISP - most of us don't have the luxry of the ISP giving us multiple independent /64 like you show in all the diagrams.

If you anable routing AND NDP that won't work for you. With routing you give Docker a subnet for every docker network. With NDP you assign ip addresses from your external network and do something very dirty and awful - some mixture between NAT and Proxy.

though i am not sure i need to do the NDP as my host route table looks like this
alexb@SYN02:/var/packages/Docker/etc$ ip -6 route show * 2601:600:8880:7c80::500/120 dev docker0 proto kernel metric 256 linkdown pref medium * 2601:600:8880:7c80::500/120 dev docker0 metric 1024 linkdown pref medium * 2601:600:8880:7c80::600/120 dev docker-282a4aac proto kernel metric 256 linkdown pref medium * 2601:600:8880:7c80::600/120 dev docker-282a4aac metric 1024 linkdown pref medium * 2601:600:8880:7c80::/64 dev ovs_eth1 proto kernel metric 256 pref medium * 2601:600:8880:7c80::/64 dev ovs_eth0 proto kernel metric 256 pref medium * fe80::/64 dev ovs_eth0 proto kernel metric 256 pref medium * fe80::/64 dev ovs_eth1 proto kernel metric 256 pref medium * fe80::/64 dev docker0 proto kernel metric 256 linkdown pref medium * fe80::/64 dev docker-282a4aac proto kernel metric 256 linkdown pref medium * default via fe80::1ae8:29ff:feb5:132a dev ovs_eth1 metric 1024 pref medium

Should Work but won't as long as you try to route and proxy an ip address at the same time. Hang on to my examples commands, just replace the prefix and the bitmask. Don't try to subnet within address blocks. If your ISP gives you /64 prifix, use the entire fifth Block (bits 65-80) for subnetting so you will end up with /80 subnets. Use a Block representing locally administered Mac addresses for routing to your Docker Host so you won't run into conflicts with your other devices doing SLAAC....

I just need to a)identify which scenario documented i should be following and b)if i have all tne right commands. BTW i tried the NDP commands you have documented and it didn't fix a thing for me.

A) The only valid scenario is routing. There ist no NAT in IPv6 so trying to use it is nonsense.
B) If you try to use proxy hacks AND routing it will never work. Having an IP in both sides of a router makes no sense.

PS for example:
Is this run on the host or the c008 container $ ip -6 neigh add proxy 2001:db8::c009 dev eth0 ?

In neither

also once this is run why does the next command show an output of route show that only includes the c008 route?

Proxied IP-addresses (pseudo-Nat) is not routing.

@scyto
Copy link

scyto commented Sep 16, 2019

Thanks for all your pointers, and if that german articles was yours, thanks for that too!
Either way that document is right about the docs and that they are not that helpful - that article is super helpful -maybe some supplemental doc info like that article is what is needed - to distil the detail into the real world. I have a poor analogy - but most people don't need to know the inner workings of a car to drive it, they need the shorter instructions on what to do, how to get from A to B etc. This is just a suggestion i am throwing into the conversation here. I don't think the doc pages are broken, just confusing for the uninitiated.

Oddly i have done both the NDP commands and routing commands (minus setting up a route on the router back to each bridges subnet as i stopped before doing that) it is all working.

My next task is tear it all down and try just the routing and eliminate the NDP.

My confusion was that because my ISP is Comcast (a US home cabled tv company) demanding anything from them will be next to impossible. It only leaves me the option of further dividing their provided /64 and the NDP is the only picture that matched my scenario.

Yes i know proxied IP addresses are not routing,. To your point if it is the only valid scenario - that is exactly the issue with the document - it throws all scenarios at one including ones you have stated should not be used.

I don't know why you have said i shouldn't try subntetting the ISP given /64 block?
I have subbnetted them down to /120 which gives me more network IDs than i will ever need each with 256 hosts, which for me is more than i will ever have running on my two synologies :-)
Also because i only use the second 64 bit subnets of 000:000:0000:nnnn I will never hit a SLACC issue as it never issues anything with all those zeros. I assume because the /64 is derived from MAC address.

And yeah i get the issue with using deprecated mechanisms. (though as a point of order, the doc pages give no indication of this https://docs.docker.com/network/macvlan/ and the original github just says it was moved upstream to experimental and provides only a dead link https://github.com/gopher-net/macvlan-docker-plugin)

And thanks for the tip on RA 2 - i didn't commit those syctls to file so a reboot will get me back to default :-) that may explain why the docker host interfaces suddenly became really slow at acquiring their IPv6 SLACC.

Does beg question is there any plans to implement SLACC for docker? - i can go look in the issues list and see what the state of that conversation is like.

Again thanks for your patience, i can tell this is frustrating for you, my only parting comments is don't assume folks are complaining about the docs to personally slight or attack you. Folks are finding the doc confusing from the angle of 'what should I do' and 'how does this relate to 'my' real world' AND yes one can argue the doc, like a man page shouldn't do that but hey that's us dumb masses for you.

Thanks!

@hr1232
Copy link

hr1232 commented Sep 16, 2019

Oddly i have done both the NDP commands and routing commands (minus setting up a route on the router back to each bridges subnet as i stopped before doing that) it is all working.

That is, because your entire /64 subnet is in the same broadcast domain ("on the same wire") as your Docker host. If you do NDP Proxy/NAT your host accepts packets for these ip addresses and the kernel's internal routing does the rest to shove them into Docker. This only works with a /64 prefix, though. If your ISP was aware of RFCs they gave you something between /52 and /48, your home network would only have a /64 subnet and you would be in charge of routing the rest (or parts of the rest) towards your Docker host via a static route.

My confusion was that because my ISP is Comcast (a US home cabled tv company) demanding anything from them will be next to impossible. It only leaves me the option of further dividing their provided /64 and the NDP is the only picture that matched my scenario.

Even if you only have 64 bits, the correct way is to route a smaller subnet towards your Docker Host. Don't do NDP, just create a static route on your router.

I don't know why you have said i shouldn't try subntetting the ISP given /64 block?
I have subbnetted them down to /120 which gives me more network IDs than i will ever need each with 256 hosts, which for me is more than i will ever have running on my two synologies :-)
Also because i only use the second 64 bit subnets of 000:000:0000:nnnn I will never hit a SLACC issue as it never issues anything with all those zeros. I assume because the /64 is derived from MAC address.

When subnetting a 64 bit network you have to be aware of the inner workings of DHCPv6 and SLAAC. As you probably don't have static ipv6 addresses in your home network, you mustn't use any address space that may be used by your other clients in SLAAC. You also have to stay clear of any addresses your router may give to clients via DHCPv6 (that usually happens if a device doesn't support SLAAC or has been configured not to use it).

DHCPv6 usually uses ::nnnn which you routed towards Docker. Don't use that.

With SLAAC enabled, subnetting is a little more complex, as the last 64 bits consist of 12 bits for the mac vendor id, 16 static bits and 12 bits for the mac device id. With privacy extensions enabled, those 64 bits become almost completely random (appart from a few exceptions).

Your best bet is to use a subnet that would normally be reserved for any Locally Administered MAC Address (LAA) as this area will never be used by SLAAC (with or without privacy extensions), nor by DHCPv6. For a given subnet 2001:db8:1:1::/64 an example subnet that is safe to use would be 2001:db8:1:1:600::/80. Just route that towards your Docker host and slice it into tiny subnets for each of your docker networks.

Just be arware, that your containers will not be able to use SLAAC anymore because the subnets are to small (SLAAC needs 64 bits). You should be safe because Docker hands out addresses via DHCPv6, but you have to make sure that all of your containers support that, because DHCPv6 is not the default. Not every container's kernel supports it and those that do, may have it deactivated.

And yeah i get the issue with using deprecated mechanisms. (though as a point of order, the doc pages give no indication of this https://docs.docker.com/network/macvlan/ and the original github just says it was moved upstream to experimental and provides only a dead link https://github.com/gopher-net/macvlan-docker-plugin)

I thought we had established that the docs are incorrect and/or outdated. Also, NDP is not really a Docker thing, but a Linux kernel thing.

Does beg question is there any plans to implement SLACC for docker? - i can go look in the issues list and see what the state of that conversation is like.

Internally, Docker sends out RAs towards the containers but they can only be used if the subnets in your docker nets are at least /64. Requesting entire subnets from your router and assigning them automatically to your Docker nets is quite complex. Also, most home routers don't support that, and most hosting companies have it disabled. So even if Docker could request subnets, chances are quite high that your router wouldn't be able to answer those requests.

Also, RFCs forbid to create subnets smaller than /64 (because SLAAC won't work there), so even if your home router did support prefix delegation and Docker did support prefix requests, you would still have to get a bigger prefix from your ISP.

The only home router I know of that really supports prefix delegation, is a high priced premium product, called Fritz!Box. Aside from the fact that your ISP probably won't spend 300$ on your router, that box is only sold in German speaking countries.

@scyto
Copy link

scyto commented Sep 16, 2019

Thanks. Again thanks for your help got it all working this morning with just routed on a clean synology system.

My /120 subnetting works ok for routed (i tested this morning) i don't mind it doesn't use SLACC as the /80 wont use SLACC either.

I guess i am lucky, my ubiquit router (not a home router that i have had for 3+ years) and the Asus home router before that have always worked perfectly with comcast IPv6 and their PD64 - not sure if this means SLACC worked or not. But again not relevant given the subnets won't have SLACC.

Understand your point about if SLACC did propogate out of some future docker version i would need a higher range from comcast.

As an aside part of the issue i had was i had not recognized that the Synology IPv6 implementation is very very flaky and often doesn't setup the right default routes on the host. I logged a support call on that this morning too - FYI for anyone else who gets here by googling.

@hr1232
Copy link

hr1232 commented Sep 16, 2019

My /120 subnetting works ok for routed (i tested this morning) i don't mind it doesn't use SLACC as the /80 wont use SLACC either.
I guess i am lucky, my ubiquit router (not a home router that i have had for 3+ years) and the Asus home router before that have always worked perfectly with comcast IPv6 and their PD64 - not sure if this means SLACC worked or not. But again not relevant given the subnets won't have SLACC.

Your /120 subnet is still in conflict with the default DHCPv6 range of your home router. Use the /80 range proposed earlier to avoid ip address conflicts between your Docker networks and your home network.

Ubiquity might make prefix delegations available if Docker ever supports prefix requests. AFAIK those functions are not planned for Docker, though.

@scyto
Copy link

scyto commented Sep 16, 2019

I had to think about that for a while and look at my IP addresses to understan.

For example if I use
2601:600:8880:7c80:8::/80 the first container gets an IP of 2601:600:8880:7c80:8::2/80
2601:0600:8880:7c80::200/120 the first container gets an IP of 2601:0600:8880:7c80::202

I see that my router issues a machine an address like this:

IPv6 Address. . . . . . . . . . . : 2601:600:8880:7c80::15f
IPv6 Address. . . . . . . . . . . : 2601:600:8880:7c80:193:9e3c:6229:3f29
Temporary IPv6 Address. . . . . . : 2601:600:8880:7c80:3ca9:36d9:5a34:4540
Temporary IPv6 Address. . . . . . : 2601:600:8880:7c80:984b:a748:924e:3676
Temporary IPv6 Address. . . . . . : 2601:600:8880:7c80:c8cf:58a5:576a:4d3c

Why does it issue both?
I.e. the first which could easily conflict and the second which doesn't?
(and for now lets ignore the temp ones, i can go read up on those)

OK I will change to /80 as i now see issue - got it. Thanks.

@scyto
Copy link

scyto commented Sep 16, 2019

Yup already moved to /80.

Also what RFC forbids anything smaller than 64? I tried to find and came up blank. I would like to read it so i learn more.

The /80 doesn't really seem to be for this scenario (at least i have yet to find the explicit RFC) but rather it was a solution for IPv4 encapsulation - but i see why it works and why it avoids the conflicts you mention https://trac.tools.ietf.org/html/rfc7600 and why we can use it here - wow the RFCS for IPv6 have changed a lot since i read them last in 1997!

It took me writing out the bit mask to understand why i was being stupid and of course i guess we should document here that one shouldn't use the 2601:0600:8880:7c80:0000:0000:0000:0000/80 as that also conflicts with the 2601:0600:8880:7c80:0000:0000:0000:0000/64 space!

@hr1232
Copy link

hr1232 commented Sep 17, 2019

I had to think about that for a while and look at my IP addresses to understan.
For example if I use
2601:600:8880:7c80:8::/80 the first container gets an IP of 2601:600:8880:7c80:8::2/80
2601:0600:8880:7c80::200/120 the first container gets an IP of 2601:0600:8880:7c80::202

Don't use any /80 subnet, that's just as bad as using /120. You have to use a vendor id that represents a locally administered MAC range, e.g 2601:600:8880:7c80:600::/80

I see that my router issues a machine an address like this:
IPv6 Address. . . . . . . . . . . : 2601:600:8880:7c80::15f
Provider to you by your router via DHCPv6
IPv6 Address. . . . . . . . . . . : 2601:600:8880:7c80:193:9e3c:6229:3f29
Assigned to your system by itself by SLAAC based on your mac. You should also have a link-local address within fe80::/10 with the identical lower 64 bits.
Temporary IPv6 Address. . . . . . : 2601:600:8880:7c80:3ca9:36d9:5a34:4540
Temporary IPv6 Address. . . . . . : 2601:600:8880:7c80:984b:a748:924e:3676
Temporary IPv6 Address. . . . . . : 2601:600:8880:7c80:c8cf:58a5:576a:4d3c
Those are regularly generated by your system because you have activated the IPv6 privacy extensions.
Why does it issue both?

Because you configured your device that way. Not a problem of your router, but a problem of your device. You should configure your devices to use either DHCPv6 OR SLAAC. Default is to use DHCPv6 and fall back in SLAAC If No DHCPOFFER comes in.

@hr1232
Copy link

hr1232 commented Sep 17, 2019

Also what RFC forbids anything smaller than 64? I tried to find and came up blank. I would like to read it so i learn more.

The old 2460 and the current 8200 both tell you how the lower 64 bits are used to create your host id based on the mac. Therefor you don't have those bits for your routing issues. They also tell you that end users should be provided with 48-52 bit networks so they can actually do subnetting. That's the closest RFC's come to forbidding things. Because they are not laws, you have to decide yourself If you want to follow them or not.

@rpodgorny
Copy link

ok, nice conversation, but it's 2020 and i still landed here because of the outdated documentation. can someone please take a look at it? thanks...!

https://docs.docker.com/config/daemon/ipv6/

@johntdavis84
Copy link

As there are several tickets written by people claming that IPv6 is broken, here is the solution how to configure it corretly. Someone should change the documentation accordingly.

Docker IPv6 is working perfectly, it's just the documentation that doesn't tell people what to do in order to set it up correctly.

Docker will not do SLAAC (Stateless Address Autoconfiguration), nor will it use DHCPv6. As it also won't do NATv6 (thank god!), you will have to assign ipv6 prefixes to your networks manually. The default bridge network has to be assigned a network, you can do this by adding the following to daemon.json:

{
"ipv6": true,
"fixed-cidr-v6": "2001:db8::/64",
}

For any other network you might have, you can decide wether or not it will be ipv6 capable. To create an IPv6 capable network, do the following:
docker network create --ipv6 --subnet "2001:db8:1::/64" testnet

Also remember, that this is not NATv6 (thank god!). Therefor you will have to have a route from your router to your docker host!

Thanks for this. It's the most concise explanation of what's going on that I've managed to find in a couple of weeks of googling in my free time.

A few points I'd like to respectfully bring up, as someone who just installed docker for the first time a couple of weeks ago and only want it to run pihole--for which I need to handle v6 DNS requests.

  1. I'm writing this on Friday, July 31, 2020. The docs still have not been updated to give instructions to make this work. Just enabling ipv6 is not enough. Respectfully, I cannot understand why this has not been updated in literally years. It took me hours of spinning in circles, googling, thinking I was doing something wrong before I hit on just the right combination of Google queries to realize I didn't have all the instructions.

  2. With deepest respect, no one who is not a network technology enthusiast cares about whether using NAT or whatever other technology to get this working is philosophically/technologically wrong or not. So long as the published documentation is incomplete, v6 networking is functionally broken for anyone who is not already in the know on how both docker and network routing works. It will seem even more broken given how easy getting v4 networking going with docker is. One works out of the box, and one requires modifying the host's ipv6 routing table. I spent days educating myself on how v6 worked, how docker networking worked, and how they worked together before I figured it out. Most people who are casually interested in docker for running something like pihole in isolation are going to give up after an hour or two (or less) and just decide "docker is broken" and never come back.

  3. Even once the documentation is updated, this is still entirely too hard for most users, to the point of being not worth the effort. As an example, again, consider pihole (see: https://hub.docker.com/r/pihole/pihole/). Notice that their sample compose file uses v. 3 (which isn't even compatible with IPv6 without doing even more extra steps), and contains no instructions for how to make v6 routing work--which is kind of an important feature to have if you're running a network with v6 enabled and trying to set up a DNS server. Using the official Docker instructions to enable IPv6 and the official pihole compose file, v6 DNS resolution will never work and thus v6 adblocking will never work and both docker and or pihole will appear broken to the end user.

  4. This all leaves out the problem of making Docker less system-agnostic. Adjusting the host's v6 routing table is going to vary in procedure depending on what flavor of linux the host is running, and the best/safest way to set the routing table up at boot is going to vary as well. It makes it impossible to just prepare, say, a docker-compose.yaml and carry it from one docker host to another and have it just work with a docker-compose up or docker-compose start command.

My entire network uses dual stack v4 + v6 networking, with the v6 provided from my ISP via prefix designation. Everything is automatic and works perfectly, and any deficiencies are purely the fault of my router's software being old enough to make a few things tedious on initial setup. But it all works automatically once it's set up, and I don't have to worry about it.

Again, with deepest respect to the people who work so hard to maintain this software, I don't really understand why the Docker daemon can't just be configured to run something like prefix designation for all running containers that request ipv6 support.

I would hope a solution can be found to make v6 containerized networking just as transparent and easy to use for the end user as v4 networking. I know there must be a way. End users only care whether it works or not. Underlying technical implementation and philosophy doesn't matter to them.

And right now, out of the box, with the current documentation, ipv6 networking is effectively broken for the end users.

Users who are sophisticated enough to know they need ipv6 networking in docker shouldn't have to try to make it work with incomplete official instructions and the variability of host hardware/software in play.

@scyto
Copy link

scyto commented Aug 1, 2020

@johntdavis84 nice post, never realized it was that big an issue, had ipv6 working on my Synology outdated docker for nearly a year with macvlan and pihole containers. As you state the key is to statically subnet you ipv6 range appropriately and adding right next hop router to your router to point to host (anyone with a Synology I have a complete how to doc, pm me if u want)

To add to your post I never got it working on default bridge only user created ones (btw folks NEVER use the default bridge for anything, that’s why y’all have crappy name resolution issues between containers, only ever use a bridge you create).

@Martin-Luther
Copy link

Martin-Luther commented Oct 16, 2020

Linux myhost 4.19.0-5-amd64 #1 SMP Debian 4.19.37-5 (2019-06-19) x86_64 GNU/Linux

My Docker is behind a load balancer and a firewall (another host).
In short, I am only using a single IPv6 on some of my hosts, and I wanted to use it for Docker
I had to edit /etc/sysctl.conf by adding

net.ipv6.conf.eth0.accept_ra=2
net.ipv6.conf.default.forwarding=1
net.ipv6.conf.all.forwarding=1

I solved my problem thanks to the following link
https://dker.ru/docs/docker-engine/user-guide/network-configuration/default-bridge-network/ipv6-with-docker/

You can pick any IPv6, for the subnet calculation, I have used this site :
http://www.gestioip.net/cgi-bin/subnet_calculator.cgi

You just need to concat: network/Prefix length to get the fixed-cidr-v6

@johntdavis84
Copy link

johntdavis84 commented May 6, 2021 via email

@pcgeek86
Copy link

I'd like to be able to spin up a Docker container with the following attributes:

  • Macvlan to host network adapter
  • IPv6-only (no IPv4 address configured)
  • IPv6 address obtained automatically by SLAAC (Stateless Address Auto-Configuration)

Is this possible in 2022?

I understand that SLAAC hasn't worked in previous versions of the Docker engine. However, if the macvlan network driver works properly, I can't see why I should have to statically configure an IPv6 CIDR block on all of the Docker daemons running on my network.

Shouldn't macvlan enable a container to obtain a SLAAC address from my switch's IPv6 Prefix Delegation (PD), just like my Linux, Windows, MacOS, and Android devices all do?

FYI I'm using an Ubiquiti EdgeRouter X behind SpaceX Starlink satellite internet. IPv6 works great, but not (yet) in Docker containers.

@maffe
Copy link

maffe commented May 11, 2022

The instructions need to be updated to include the "fixed-cidr-v6" key with a valid ipv6 cidr value.

This was done in #11148.

@craig-osterhout
Copy link
Contributor

Closing the original issue as fixed. Feel free to continue the conversation in the comments.

@docker-robott
Copy link
Collaborator

Closed issues are locked after 30 days of inactivity.
This helps our team focus on active issues.

If you have found a problem that seems similar to this, please open a new issue.

/lifecycle locked

@docker docker locked and limited conversation to collaborators Mar 12, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
area/compose Relates to docker-compose.yml spec or docker-compose binary lifecycle/locked
Projects
None yet
Development

No branches or pull requests