Recovery attempts limit is never reset #408

camelpunch · 2016-05-31T12:34:22Z

Something interesting seems to be happening with recovery. There's a line that decrements attempts but no accompanying reset. However, channel operations seem to recover on their own even when the number of attempts is set low.

Not a huge deal, but thought I'd record this thought before forgetting it.

- Wait for state changes on HTTP API, with timeouts. This is about 3x faster than the previous constant sleeps on my machine. - Silence log output - Pend out an example that was incorrectly passing (attempt limits: ruby-amqp#408) Issue ruby-amqp#410

- Wait for state changes on HTTP API, with timeouts. This is about 3x faster than the previous constant sleeps on my machine. - Silence log output - Pend out an example that was incorrectly passing (attempt limits: #408) Issue #410

michaelklishin · 2017-02-24T13:14:58Z

I did some poking around and think that the author (seems to be @jafrog in fff7f44) intended it to work like this: the counter is global for the entire lifetime of a connection object.

I agree that this may be counterintuitive to most and decided to change it. One issue with
the per-recovery attempt limit is that Bunny tests that use HTTP client to force close connections
cannot simulate counted retries (TCP failures) easily since the server is up and thus any reconnection will succeed and reset the counter. I don't have a solution that's not environment-specific or fragile (e.g. stopping RabbitMQ and trying to start ti back), so leaning towards deleting the test after a lot of manual testing _(ツ)_/¯.

Previous behehavior is not unreasonable but is not what many users and even RabbitMQ team members come to expect. Therefore it can be treated as a bug. Previously a reconnection counter was preserved between successful recoveries. This made the integration test that uses server-sent connection.close possible. With this change, the counter is reset after successful reconnection but there's an option to go back to the original behavior. We also do a hell of a lot more logging. Note that connection.close recovery with a running node will no longer work for testing this functionality. There aren't may ways to stop and start the node that aren't environment-specific, perhaps one day we will mess with nodes in Bunny tests and introduce a replacement test. References fff7f44.

camelpunch changed the title ~~Recovery attempts aren't reset~~ Recovery attempts limit is effectively ignored Jun 13, 2016

michaelklishin self-assigned this Feb 24, 2017

michaelklishin closed this as completed Feb 24, 2017

michaelklishin changed the title ~~Recovery attempts limit is effectively ignored~~ Recovery attempts limit is never reset Feb 24, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recovery attempts limit is never reset #408

Recovery attempts limit is never reset #408

camelpunch commented May 31, 2016

michaelklishin commented Feb 24, 2017

Recovery attempts limit is never reset #408

Recovery attempts limit is never reset #408

Comments

camelpunch commented May 31, 2016

michaelklishin commented Feb 24, 2017