Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make docker health check configurable #1522

Closed
wants to merge 1 commit into from

Conversation

richardpen
Copy link

@richardpen richardpen commented Aug 15, 2018

Summary

Fix #1235 Add an environment variable to disable the container health check.

Implementation details

Testing

  • Builds on Linux (make release)
  • Builds on Windows (go build -out amazon-ecs-agent.exe ./agent)
  • Unit tests on Linux (make test) pass
  • Unit tests on Windows (go test -timeout=25s ./agent/...) pass
  • Integration tests on Linux (make run-integ-tests) pass
  • Integration tests on Windows (.\scripts\run-integ-tests.ps1) pass
  • Functional tests on Linux (make run-functional-tests) pass
  • Functional tests on Windows (.\scripts\run-functional-tests.ps1) pass

New tests cover the changes:

Description for the changelog

Licensing

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

@richardpen richardpen requested review from adnxn, sharanyad, yhlee-aws and a team August 15, 2018 22:08
@samuelkarp
Copy link
Contributor

samuelkarp commented Aug 15, 2018

Rather than disabling the health check feature, why not add heartbeat messages like we do for the ACS websocket?

@sharanyad sharanyad requested review from a team and removed request for a team August 15, 2018 23:15
@adnxn
Copy link
Contributor

adnxn commented Aug 16, 2018

Rather than disabling the health check feature, why not add heartbeat messages like we do for the ACS websocket?

Why would those two be equivalent changes? I don't follow.

@samuelkarp
Copy link
Contributor

Reading #1235 (comment), it looks like the underlying issue is the websocket being closed because there is no traffic over it:

Yes, those errors are caused by the fact that there is no activity in the connection. And this connection is used to publish resource usage metrics and container health metrics. So, if the ECS_DISABLE_METRICS=true is set and no containers are using the container health check feature, the connection will be closed periodically.

Rather than turning off a useful feature (health check support), it would make more sense to me to address the websocket inactivity issue.

@rhuddleston
Copy link

any update on this? Our logs get spammed with "Error getting message from ws backend" after setting ECS_DISABLE_MTERICS=true

@adnxn
Copy link
Contributor

adnxn commented Oct 16, 2018

closing in favor of #1624

@adnxn adnxn closed this Oct 16, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants