Permit configuring IMDS client behavior on timeout #1233

Mark-Simulacrum · 2025-01-06T18:07:39Z

Describe the feature

The IMDS client in aws-config, both for internal usage (e.g., credential fetching) and as a public-facing client (e.g., to resolve instance metadata in user programs) should support being configured to an "expected to exist" mode where e.g. TCP connects are retried (unlike the current default).

Use Case

On EC2 instances, IMDS is not always 100% available; we periodically see short blips of unavailability in production, like with any other service. Retries avoid these bubbling out as e.g. service launch failures due to inability to provision credentials or discover local identity (instance ID).

Proposed Solution

At minimum, the client should expose a knob to enable retrying TCP failures (connect and read timeouts). Ideally, the solution would allow for us to specify that we do in fact expect a response and so the normal SDK behaviors should happen -- rather than having to chase those over time with more knobs.

The SDK defaults may make less sense for cases where IMDS may not be available, but explicit usage of the IMDS client seems like a good indicator of "I expect it to work" (at least as a default).

We would want to configure this for the implicit IMDS client created within credentials provider chains, but are OK explicitly threading a client into that state if needed.

Other Information

No response

Acknowledgements

I may be able to implement this feature request
This feature might incur a breaking change

A note for the community

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue, please leave a comment

## Motivation and Context   Addressing awslabs/aws-sdk-rust#1233 ## Description  Add ability to configure the retry classifier on the IMDS client. ## Testing    Added new unit test to ensure that a user defined retry classifer is being used. ## Checklist  - [x] For changes to the smithy-rs codegen or runtime crates, I have created a changelog entry Markdown file in the `.changelog` directory, specifying "client," "server," or both in the `applies_to` key. ---- _By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice._

landonxjames · 2025-01-23T21:01:36Z

This behavior has now been added in version 1.5.15 of the aws-config crate. See the new retry_classifier method on the IMDS builder. Additionally, so that you don't have to implement the whole retry classifier yourself, we added a with_retry_connect_timeouts method to the ImdsResponseRetryClassifier that you can configure and pass in.

ysaito1001 · 2025-01-28T16:15:47Z

The fix was released on 2025-01-23

github-actions · 2025-01-28T16:16:06Z

Comments on closed issues are hard for our team to see.
If you need more assistance, please either tag a team member or open a new issue that references this one.
If you wish to keep having a conversation with other community members under this issue feel free to do so.

Mark-Simulacrum added feature-request A feature should be added or improved. needs-triage This issue or PR still needs to be triaged. labels Jan 6, 2025

ysaito1001 removed the needs-triage This issue or PR still needs to be triaged. label Jan 15, 2025

landonxjames mentioned this issue Jan 20, 2025

IMDS retries smithy-lang/smithy-rs#3975

Merged

1 task

ysaito1001 added the pending-release This issue will be fixed by an approved PR that hasn't been released yet. label Jan 23, 2025

ysaito1001 closed this as completed Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Permit configuring IMDS client behavior on timeout #1233

Permit configuring IMDS client behavior on timeout #1233

Mark-Simulacrum commented Jan 6, 2025

landonxjames commented Jan 23, 2025 •

edited

Loading

ysaito1001 commented Jan 28, 2025

github-actions bot commented Jan 28, 2025

Permit configuring IMDS client behavior on timeout #1233

Permit configuring IMDS client behavior on timeout #1233

Comments

Mark-Simulacrum commented Jan 6, 2025

Describe the feature

Use Case

Proposed Solution

Other Information

Acknowledgements

A note for the community

Community Note

landonxjames commented Jan 23, 2025 • edited Loading

ysaito1001 commented Jan 28, 2025

github-actions bot commented Jan 28, 2025

landonxjames commented Jan 23, 2025 •

edited

Loading