-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Slow announcer for orchestrator #15
Comments
Here is the start of the announcer:
So it means that the following code is causing the above error message: me! until netstat -lnt | grep :$port >/dev/null; \
do sleep 1; \
done" |
So all I can find is that we have a discrepancy when trying to How it should look:
What it is currently:
My guess is something funky is going on with $port / a race condition in the bootstrapping of the container. https://github.com/yldio/paz/blob/d53997470d5263a2334a16b3835380a8d849dd22/unitfiles/1/paz-orchestrator-announce.service#L19-L21 port=$(docker inspect -f '{{ index .NetworkSettings.Ports \"9000/tcp\"}}' paz-orchestrator \
| sed 's/.*Port://' \
| sed 's/].+*//'); \ Maybe @sublimino might be able to offer some input into robustifying the shell code to bail out if this condition occurs. |
@tomgco thanks for all the details! i'll be taking a look at this on the weekend. |
I've seen this before when units fail to start up correctly, probably, as you identify @tomgco, when things don't start in the right order. This is an issue in and of itself, but one I haven't any insight into atm. In the meantime, if this bash code can be made more robust then perhaps we may not see it again. Got any ideas, @sublimino? |
I think adding dependencies to the units and leveraging systemd's native
|
👍 |
above @sublimino is referring to #30 btw |
This cropped up again when provisioning a machine on Digital Ocean on paz-orchestrator-announce.service, maybe something is missing from the unit file? |
A fleetctl stop and start on paz-orchestrator-announce.service fixed this. |
Yeah I've seen this I've seen this quite a bit this weekend :/ |
However this timed out and then:
However the service seemed to be up and running:
The text was updated successfully, but these errors were encountered: