cloud: Automatically add locality flag when using Helm to deploy CDB in K8 #23940

tlvenn · 2018-03-16T02:45:52Z

There are 2 well known node labels that could be used to create dynamically the locality information to be passed to CDB on startup:

failure-domain.beta.kubernetes.io/region
failure-domain.beta.kubernetes.io/zone

We could probably set the country if the node is exposing failure-domain.beta.kubernetes.io/country or if we detect the cloud provider and then derivate the country from its region name.

@a-robinson what do you think ?

Jira issue: CRDB-5797

a-robinson · 2018-03-20T18:05:11Z

That's a reasonable idea, yeah. Is there any reason we wouldn't want to just use all available failure-domain key-value pairs rather than only picking out specific ones?

One concern I have is that we may not want to use failure domain tags that could change for any given node when it's rescheduled. For example, if node 1 runs in zone=a but then its machine goes down for maintenance and it gets rescheduled to zone=b, we may have to do a lot of rebalancing as a result of the change. It'd be best to only use failure domain tags that match the scope of persistent volume being used by the node -- i.e. if a PV can't be moved from region to region then using a region tag is good, but if it can be moved zone to zone then using a zone tag might not be worth it.

tlvenn · 2018-03-29T03:09:40Z

That's a reasonable idea, yeah. Is there any reason we wouldn't want to just use all available failure-domain key-value pairs rather than only picking out specific ones?

I dont think so but can we order them properly in all cases ?

One concern I have is that we may not want to use failure domain tags that could change for any given node when it's rescheduled. For example, if node 1 runs in zone=a but then its machine goes down for maintenance and it gets rescheduled to zone=b, we may have to do a lot of rebalancing as a result of the change. It'd be best to only use failure domain tags that match the scope of persistent volume being used by the node -- i.e. if a PV can't be moved from region to region then using a region tag is good, but if it can be moved zone to zone then using a zone tag might not be worth it.

Ya totally agree on this

a-robinson · 2018-03-29T15:40:57Z

I dont think so but can we order them properly in all cases ?

Great point.

I doubt this is going to have much value until it's less work to get a multi-region cockroach cluster running in kubernetes, so I'm not going to work on it immediately, but if you want to play around with it contributions are welcome.

a-robinson · 2018-04-04T15:13:20Z

@tlvenn Do you have any pointers to examples of kubernetes configs that do this? Unfortunately node labels do not appear to be accessible via the downward API, so we'd have to do some work inside the pod to talk to the Kubernetes API and retrieve the node labels from it, all before starting cockroach.

I'm starting to play around with multi-region clusters on kubernetes so getting this automatically would be awesome, but it'd be great if we could do so without having to insert a bunch of glue code.

a-robinson · 2018-04-04T15:22:05Z

So based on upstream discussions we would indeed have to write some code to retrieve them from the API (and expand our RBAC scopes in order to allow such retrievals). It may be made easier in future Kubernetes releases, but that will be a ways off.

a-robinson · 2018-04-04T15:22:50Z

One example: Yolean/kubernetes-kafka#41

solsson · 2018-04-08T08:14:08Z

Anyone here interested in collaborating on kubernetes/kubernetes#62078 (comment)?

a-robinson · 2018-04-11T18:47:16Z

What do you have in mind, @solsson? I can't see us putting an init container for this into our default configuration - it adds extra mental overhead for people to understand how our deployment works, would require additional RBAC privileges (including adding RBAC privileges to our insecure deployment, which currently needs none), and would only really be useful right now for multi-zone Kubernetes clusters.

For multi-region cockroach deployments that span Kubernetes clusters, manually specifying the --locality flag in the config file for each Kubernetes cluster isn't a big deal.

solsson · 2018-04-13T09:07:22Z

Actually I did have RBAC and extra mental overhead in mind :) It's up to you to balance priority, complexity and resources for your use case - so I'll take it as no for now.

github-actions · 2021-06-07T02:15:08Z

We have marked this issue as stale because it has been inactive for
18 months. If this issue is still relevant, removing the stale label
or adding a comment will keep it active. Otherwise, we'll close it in
5 days to keep the issue queue tidy. Thank you for your contribution
to CockroachDB!

Bessonov · 2021-06-07T16:48:59Z

I think this is still an issue.

github-actions · 2023-09-26T11:08:37Z

We have marked this issue as stale because it has been inactive for
18 months. If this issue is still relevant, removing the stale label
or adding a comment will keep it active. Otherwise, we'll close it in
10 days to keep the issue queue tidy. Thank you for your contribution
to CockroachDB!

Bessonov · 2023-09-26T12:24:11Z

a comment will keep it active

a-robinson added C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) O-community Originated from the community A-orchestration Relating to orchestration systems like Kubernetes labels Apr 30, 2018

github-actions bot added the no-issue-activity label Jun 7, 2021

github-actions bot removed the no-issue-activity label Jun 8, 2021

github-actions bot added the no-issue-activity label Sep 26, 2023

github-actions bot removed the no-issue-activity label Sep 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cloud: Automatically add locality flag when using Helm to deploy CDB in K8 #23940

cloud: Automatically add locality flag when using Helm to deploy CDB in K8 #23940

tlvenn commented Mar 16, 2018 •

edited by cockroach-jira-scripts

Loading

a-robinson commented Mar 20, 2018

tlvenn commented Mar 29, 2018

a-robinson commented Mar 29, 2018

a-robinson commented Apr 4, 2018

a-robinson commented Apr 4, 2018

a-robinson commented Apr 4, 2018

solsson commented Apr 8, 2018

a-robinson commented Apr 11, 2018

solsson commented Apr 13, 2018

github-actions bot commented Jun 7, 2021

Bessonov commented Jun 7, 2021

github-actions bot commented Sep 26, 2023

Bessonov commented Sep 26, 2023

cloud: Automatically add locality flag when using Helm to deploy CDB in K8 #23940

cloud: Automatically add locality flag when using Helm to deploy CDB in K8 #23940

Comments

tlvenn commented Mar 16, 2018 • edited by cockroach-jira-scripts Loading

a-robinson commented Mar 20, 2018

tlvenn commented Mar 29, 2018

a-robinson commented Mar 29, 2018

a-robinson commented Apr 4, 2018

a-robinson commented Apr 4, 2018

a-robinson commented Apr 4, 2018

solsson commented Apr 8, 2018

a-robinson commented Apr 11, 2018

solsson commented Apr 13, 2018

github-actions bot commented Jun 7, 2021

Bessonov commented Jun 7, 2021

github-actions bot commented Sep 26, 2023

Bessonov commented Sep 26, 2023

tlvenn commented Mar 16, 2018 •

edited by cockroach-jira-scripts

Loading