ChannelState#Received accounting can apparently fail #357

rvagg · 2023-01-10T10:48:09Z

Two related items:

Autoretrieve is seeing "successful" transfers that have zero bytes transferred, our event logs for them look like this: { "confirmed": true, "receivedCids": 1, "receivedSize": 0 }.

They are initiated because we don't have the block locally, and a new transfer shouldn't be able to start for the same CID. They also aren't marked as failed because the block confirmer gives us a 👍 that the root block we wanted is now in our blockstore. So it would appear that the transfer happens but DT doesn't set state properly so channelState.Received() is zero.

h/t to @dirkmc, from sophia, apparently from autoretrieve, with what looks like a ~1m timeout cancellation yet the SP claims it's sending data. We timeout when no bytes are received, even if we're still chatting with the peer.

The text was updated successfully, but these errors were encountered:

dirkmc · 2023-01-10T11:08:12Z

We timeout when no bytes are received, even if we're still chatting with the peer.

In this case it looks from the logs like the last data is sent at 2023-01-10 10:18:41.744 and then after a minute, auto-retrieve gives up waiting for more data and cancels the retrieval.

davidd8 · 2023-02-01T23:34:02Z

Triaging this to the Boost team ( @jacobheun ) to prioritize in their backlog, since this may be causing the "1m timeout" errors being seen in the autoretrieve dashboard: https://protocollabs.grafana.net/d/lDh_Fko4k/autoretrieve-estuary?orgId=1&refresh=5s&from=1675121579495&to=1675294379495&viewPanel=39. Given it's the top retrieval error at the moment, elevating this issue's severity to a P1.

davidd8 assigned dirkmc and jacobheun and unassigned dirkmc Feb 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChannelState#Received accounting can apparently fail #357

ChannelState#Received accounting can apparently fail #357

rvagg commented Jan 10, 2023

dirkmc commented Jan 10, 2023

davidd8 commented Feb 1, 2023

ChannelState#Received accounting can apparently fail #357

ChannelState#Received accounting can apparently fail #357

Comments

rvagg commented Jan 10, 2023

dirkmc commented Jan 10, 2023

davidd8 commented Feb 1, 2023