-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deploy dedicated Geth instances for all Prater Nimbus nodes #125
Comments
Another possible option would be to replace existing hosts with bigger ones instead of adding separate hosts for Geth. We could use Hetzner AX61-NVMe which have 2 x 1.92 TB NVMe which would be enough to run both Geth and Nimbus nodes on the same host, which would simplify setup, management, and debugging. |
Based on conversation with @zah I'm purchasing six AX61-NVMe hosts: After the migration the leftover AX41-NVMe hosts will be reused for macos and windows Geth nodes, as well as CI. |
Part of deploying dedicated Geth nodes. #125 Signed-off-by: Jakub Sokołowski <[email protected]>
I've provisioned the hosts: 1bdcf1ca infra-nimbus/ansible/inventory/test Lines 5 to 10 in 1bdcf1c
And started deploying Geth nodes on them. |
So far they have not started syncing:
But that's probably because of low peer numbers. |
I don't get it. The nodes are not syncing at all:
Despite having 50 peers each:
But nothing has been synced:
What the fuck is going on... |
The startup logs show we are correctly using Goerli network:
I don't get why it's not syncing. |
There's a lot of
Not sure if that's relevant though. |
Oooh, ok, now I see it:
We NEED a consensus layer node to learn what is the current head of the blockchain so we can start syncing the exec node. |
And now we are finally syncing:
|
Looks like for some reason some Geth nodes have fucked up
This issue suggests removing |
But that didn't help, and I had to remove all of
|
And we are seeing nimbus hosts proposing: https://prater.beaconcha.in/blocks?q=Nimbus%2Fv |
#125 Signed-off-by: Jakub Sokołowski <[email protected]>
#125 Signed-off-by: Jakub Sokołowski <[email protected]>
Tomorrow I will reuse 3 of the 6 leftover old prater hosts to do Geth nodes for the AWS/MacOS/Windows hosts. The remaining 3 hosts will be used for CI or decommissioned. |
#125 Signed-off-by: Jakub Sokołowski <[email protected]>
I configured a dedicated set of Geth nodes for Windows:
infra-nimbus/ansible/inventory/test Line 34 in cb448d6
|
We finished syncing:
So Windows host is done. |
Deployed a host for MacOS Prater nodes:
infra-nimbus/ansible/inventory/test Line 12 in 2dd9350
|
Decided to rename the hosts while adding a third one so as to simplify setup:
infra-nimbus/ansible/inventory/test Lines 5 to 7 in 08a744d
|
#125 Signed-off-by: Jakub Sokołowski <[email protected]>
Configured existing AWS and MacOS nodes to use the new Goerli Geth nodes:
Currently syncing: |
#125 Signed-off-by: Jakub Sokołowski <[email protected]>
I think this is done:
All are using the new Geth nodes. We can decommission the old AWS one. |
No longer necessary after dedicated metal hosts were deployed: #125 Signed-off-by: Jakub Sokołowski <[email protected]>
Got rid of the old AWS Geth Goerli node:
|
I consider this done. |
This has been neglected due to other priorities, like Mainnet nodes, but it's time to do a proper setup of one Geth node for each Nimbus node, as God intended. This will require quite a lot of hardware as the prater fleet involves 11 hosts and 33 nodes in total.
Since the current size of a snap-synced Geth node is about ~160 GB:
We'll need at least 200 GB per node, and about 4 nodes on each host. So 1 TB NVMe should be sufficient for a while.
The most likely candidate is a Hetzner AX51-NVMe host with 2x1 TB NVMes:
The text was updated successfully, but these errors were encountered: