kvdb-rocksdb with no overlay cache #310

NikVolf · 2020-01-06T10:36:47Z

Currently, substrate is not using it, and probably for a good reason, since caching something that RocksDB caches (or is able to) on it's own is generally not a good idea.

dvdplm · 2020-01-06T10:57:12Z

I have asked myself the same thing but never did anything about it due to the lack of hard data on the performance impact. I'm not convinced that "substrate isn't using it" is enough.

ordian · 2020-01-06T10:59:04Z

I can do some benchmarks on block import impact for parity-ethereum.

NikVolf · 2020-01-06T10:59:24Z

@dvdplm Yes, of course, I am not convinced either.
We can also make substrate use it and see how it goes actually.

NikVolf · 2020-01-06T11:20:23Z

I can do some benchmarks on block import impact for parity-ethereum.

I'll look what is required for my branch to work with parity-ethereum

But we can also have two versions even if parity-ethereum strictly need overlay cache

ordian · 2020-01-09T17:04:31Z

I've run a quick-and-dirty import bench (importing 1k recent blocks (at ~9230k height) on master vs ao-no-overlay branch).
There seems to be small (~4%) to no regression for that. Warp-sync doesn't seem to regress substantially either.

The thing that worries me is the semantical change. In parity-ethereum we use write_buffered and if we just replace write_buffered with write (this is what was done in that branch), we can commit some data in the intermediate state and if the node crashes/finishes it could cause some problems? But if it's not the case, I think we proceed with no overlay and probably remove write_buffered and flush from the kvdb API altogether (another breaking change).

dvdplm · 2020-01-09T18:35:46Z

if the node crashes/finishes it could cause some problems?

We do not provide any guarantees to ensure data consistency in the case of a crash afaik, not beyond what rocksdb already does with the WAL anyway.

Here's what I like about the idea of getting rid of the overlay:

less code is good
the overlay has no proven performance advantage
memory spent in the overlay can be given to rocksdb
the overlay makes it harder to reason about and test performance and bottlenecks

The downside is, like you say, that the consuming code has to be very carefully audited.

NikVolf · 2020-01-09T18:49:13Z

@ordian

Actually, I have an optimised version of no-overlay variant, want to check also that in branch lil-copy?

It squashes all key-values in one long Vec inside transaction

It will allow to avoid allocations when you, for example, do transaction.write(&h256, &h256), which is widely used afair

Overlay cache prevented this optimisation before

ordian · 2020-01-09T21:06:14Z

@NikVolf I don't expect it to make a difference, but will try tomorrow.

We do not provide any guarantees to ensure data consistency in the case of a crash afaik, not beyond what rocksdb already does with the WAL anyway.

But write_buffered is guaranteed not to commit into rocksdb, if the change the semantics, we're now committing into the db what we previously didn't (intermediate state).

ordian · 2020-01-10T11:53:56Z

@NikVolf I've tried ao-lil-copy branch as your branch didn't compile and it didn't make a difference for the import bench.

My concert about semantics still holds.

NikVolf · 2020-01-11T11:23:06Z

@ordian Anyone who relies on this semantics shouldn't have done it on the first place, it is not written anywhere (is it?)

write_buffered could have written transaction immediately, for example, if there is no cache present, or whatever logic could be behind it

atomicity is, of course, on Transaction level, and this should be unambiguously stated if it is not already

ordian · 2020-01-13T11:59:10Z

Here is my audit of write_buffered usage in parity-ethereum: https://hackmd.io/IOQQynyJSoedXXq5SKqUVQ. Would appreciate some comment regarding concerns about warp sync usage of it.

I think we should proceed with removing overlay regardless and fix the usage in parity-ethereum later.

arkpar · 2020-01-15T11:23:08Z

This overlay was originally introduced to optimize block import pipeline, not for caching.

The point is we can start importing block N+1 while N is still being submitted (flushed) to RocksDB. Copying data in memory is still much faster than creating and writing RocksDB batch. In early days with lighter blocks that resulted in about 5-10% faster full sync speed.

Although this is not currently used in substrate, we were going to use it eventually.
However if benchmarks show that performance gains are now negligible I'm fine with removing it.

ordian · 2020-01-15T11:38:19Z

@arkpar thanks for the input. I've tested with the much heavier blocks than in the early days indeed, so I don't know how will it impact substrate import time (if it is to be used), but even then 5% is not worth complexity and limitations like this.

We're essentially trading off between how often data is flushed vs latency and this is something that can be tuned on the RocksDB settings level I guess?

ordian mentioned this issue Jan 14, 2020

kvdb: no overlay #313

Merged

dvdplm closed this as completed in #313 Mar 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kvdb-rocksdb with no overlay cache #310

kvdb-rocksdb with no overlay cache #310

NikVolf commented Jan 6, 2020

dvdplm commented Jan 6, 2020

ordian commented Jan 6, 2020

NikVolf commented Jan 6, 2020

NikVolf commented Jan 6, 2020

ordian commented Jan 9, 2020

dvdplm commented Jan 9, 2020

NikVolf commented Jan 9, 2020 •

edited

Loading

ordian commented Jan 9, 2020

ordian commented Jan 10, 2020

NikVolf commented Jan 11, 2020

ordian commented Jan 13, 2020

arkpar commented Jan 15, 2020

ordian commented Jan 15, 2020

kvdb-rocksdb with no overlay cache #310

kvdb-rocksdb with no overlay cache #310

Comments

NikVolf commented Jan 6, 2020

dvdplm commented Jan 6, 2020

ordian commented Jan 6, 2020

NikVolf commented Jan 6, 2020

NikVolf commented Jan 6, 2020

ordian commented Jan 9, 2020

dvdplm commented Jan 9, 2020

NikVolf commented Jan 9, 2020 • edited Loading

ordian commented Jan 9, 2020

ordian commented Jan 10, 2020

NikVolf commented Jan 11, 2020

ordian commented Jan 13, 2020

arkpar commented Jan 15, 2020

ordian commented Jan 15, 2020

NikVolf commented Jan 9, 2020 •

edited

Loading