Secondary mechanism to trigger watches for transactions from past blocks #3002

t-bast · 2025-02-06T14:58:41Z

When a new block is found, we want to check its confirmed transactions to potentially trigger watches. This is especially important when a channel is spent and we haven't seen the spending transaction in our mempool before receiving it in a block.

This is already handled through the ZMQ rawtx topic, where bitcoind sends us every transaction it receives (either in the mempool or in a block). But when using remote bitcoind instances, ZMQ seems to sometimes be unreliable and silently drop some events (mostly when the connection is unstable with the bitcoind instance). That's why we add another mechanism for extra safety, where whenever a new block is found, we fetch the last N blocks and re-process their transactions. We keep a cache of the processed blocks to ensure that we don't needlessly re-process them multiple times.

When a new block is found, we want to check its confirmed transactions to potentially trigger watches. This is especially important when a channel is spent and we haven't seen the spending transaction in our mempool before receiving it in a block. This is already supposed to be handled through the ZMQ `rawtx` topic, where bitcoind should send us every transaction it receives (either in the mempool or in a block). But when using remote `bitcoind` instances, ZMQ seems to sometimes be unreliable and silently drop some events. That's why we add another mechanism for extra safety, where whenever a new block is found, we fetch the last `N` blocks and re-process their transactions. We keep a cache of the processed blocks to ensure that we don't needlessly re-process them multiple times.

pm47 · 2025-02-10T14:06:05Z

But when using remote bitcoind instances, ZMQ seems to sometimes be unreliable and silently drop some events (mostly when the connection is unstable with the bitcoind instance)

Did you notice that all ZMQ messages contain a sequence specifically for the purpose of detecting lost messages? For example hashblock.

Also, I wonder if the cause of unreliability could be that we are using the same ZMQ adress for both transactions and blocks. Could we run into high watermarks due to a burst of transactions (maybe unrelayed txs directly found in a new block), and drop blocks?

Finally, the implementation looks fine to me, but I wonder if we should instead rely on sequence messages and make the process a little more blockchainy: something like storing the last analyzed blockId and make sure that the new one is its direct child. It's not explicitly recommended by the doc but they do mention it in a section about reorgs.

t-bast · 2025-02-10T14:51:10Z

Did you notice that all ZMQ messages contain a sequence specifically for the purpose of detecting lost messages? For example hashblock.

Yes, but what would you do when you detect that you missed events? There is no mechanism to ask for a retransmission of a past event.

Also, I wonder if the cause of unreliability could be that we are using the same ZMQ adress for both transactions and blocks.

According to bitcoind, this shouldn't matter.

Could we run into high watermarks due to a burst of transactions (maybe unrelayed txs directly found in a new block), and drop blocks?

We've already disabled the high watermark (see ZmqActor.scala), so this shouldn't be related.

The main case where this issue can happen is when we get disconnected or if eclair or bitcoind restarts: bitcoind won't store events that happen while disconnected/offline to retransmit them on reconnection (because in most cases, this isn't the expected behavior of the connecting client and is impossible to fully guarantee anyway).

Finally, the implementation looks fine to me, but I wonder if we should instead rely on sequence messages and make the process a little more blockchainy: something like storing the last analyzed blockId and make sure that the new one is its direct child. It's not explicitly recommended by the doc but they do mention it in a section about reorgs.

I think this would be much more complex to implement correctly than the simple behavior of this PR?

pm47

Fair enough! Just a nit

eclair-core/src/main/scala/fr/acinq/eclair/blockchain/bitcoind/ZmqWatcher.scala

t-bast requested review from sstone and pm47 February 6, 2025 14:58

t-bast force-pushed the zmq-check-latest-blocks branch from d7d00fb to d32a21a Compare February 6, 2025 15:18

pm47 reviewed Feb 10, 2025

View reviewed changes

eclair-core/src/main/scala/fr/acinq/eclair/blockchain/bitcoind/ZmqWatcher.scala Outdated Show resolved Hide resolved

fixup! Trigger watches for transactions from past blocks

1826f70

pm47 approved these changes Feb 10, 2025

View reviewed changes

t-bast merged commit bc44808 into master Feb 11, 2025
1 check passed

t-bast deleted the zmq-check-latest-blocks branch February 11, 2025 08:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Secondary mechanism to trigger watches for transactions from past blocks #3002

Secondary mechanism to trigger watches for transactions from past blocks #3002

t-bast commented Feb 6, 2025

pm47 commented Feb 10, 2025

t-bast commented Feb 10, 2025

pm47 left a comment

Secondary mechanism to trigger watches for transactions from past blocks #3002

Secondary mechanism to trigger watches for transactions from past blocks #3002

Conversation

t-bast commented Feb 6, 2025

pm47 commented Feb 10, 2025

t-bast commented Feb 10, 2025

pm47 left a comment

Choose a reason for hiding this comment