-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rack awareness, replication across availability zones #5738
Comments
@solsson it is hard to understand the description of the issue :-) I think you have misunderstood on the configuration of minio. We do not use the term "availability zone" in our documentation, I think you mean "region" setting? Typically a minio cluster will be deployed such that the minio servers can span across racks, but never across data centers. If you want to replicate data in a minio cluster to another cluster in a different data center then you can use |
@krishnasrinivas Thanks for trying anyway :) I tried to follow the issue template. Maybe this should start as a Slack discussion, but the problem with that is that others will not find it later. Two use cases:
The term "rack" is from Kafka: https://kafka.apache.org/documentation/#basic_ops_racks |
minio uses erasure coding to store data safely across multiple machines. You could refer https://docs.minio.io/docs/distributed-minio-quickstart-guide What I am understanding from your question is, you would like to run DR site in another region(s). As @krishnasrinivas pointed out, you could do
You could run distributed minio in machines at "garage" and "basement". This way erasure coding takes care of data high availability and fault tolerance. |
Thanks. I'll read up on erasure coding. I falsely assumed that N/2 denoted a regular replica count, but in https://docs.minio.io/docs/minio-erasure-code-quickstart-guide I see that:
where the word any is key. |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Can minio instances be started with an arg/env that identifies the availability zone, so that replication spans as many zones as possible?
Expected Behavior
In a multi-zone cluster at for example Google or AWS, a minio cluster spanning multiple availability zones in the same region should replicate across zones as much as possible.
For example with 6 instances across 3 zones and standard class being EC:3, there should be one replica per zone.
Current Behavior
I don't know :)
Possible Solution
A startup script can look up the zone name from the machine, and provide it as arg or env to the container. I think config.json should stay the same across all minio instances in the cluster.
Context
We have this for Kafka, Yolean/kubernetes-kafka#41.
Your Environment
Kubernetes using the official Helm chart. At GKE.
The text was updated successfully, but these errors were encountered: