You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
CaOps should be able to receive an API call from every node, that will trigger a cluster-wide backup. This needs:
To be as much synchronized as possible, so to avoid network delays, the API will always schedule the backup to the nearest rounded time.
Under normal usage all Cassandra nodes must be up and running. No nodes must be joining or leaving.
Since we are not sure about how reliable Serf is on big clusters, we might need to implement something else to keep the scheduling consistent. Two options that I could think are:
Using Cassandra itself, with a table that has consistency of the number of nodes. This appears to be the most easy solution, but it has lots of trade-offs. The main one is that this table will be like a queue, which is an anti-pattern on Cassandra.
Using Hashicorp Raft library, so each CaOps agent has its own persisted state, that can also be more controlled for the special use-case.
The text was updated successfully, but these errors were encountered:
CaOps should be able to receive an API call from every node, that will trigger a cluster-wide backup. This needs:
Since we are not sure about how reliable Serf is on big clusters, we might need to implement something else to keep the scheduling consistent. Two options that I could think are:
Using Cassandra itself, with a table that has consistency of the number of nodes. This appears to be the most easy solution, but it has lots of trade-offs. The main one is that this table will be like a queue, which is an anti-pattern on Cassandra.
Using Hashicorp Raft library, so each CaOps agent has its own persisted state, that can also be more controlled for the special use-case.
The text was updated successfully, but these errors were encountered: