Add support for building merkle multiproofs #16

etan-status · 2021-12-06T17:55:53Z

This adds functionality for building merkle multiproofs in the same form
that the Ethereum consensus specs suggest, i.e., in descending order of
helper indices. Merkle multiproofs are useful for the light client sync.
https://github.com/ethereum/consensus-specs/blob/v1.1.6/ssz/merkle-proofs.md#merkle-multiproofs

zah

How come there are no tests here?

ssz_serialization/merkleization.nim

etan-status · 2021-12-08T19:43:46Z

Added tests.

etan-status · 2021-12-08T19:54:37Z

New test coverage:

rm -rf coverage && mkdir -p coverage && \
    nim c -d:PREFER_BLST_SHA256=false --nimcache:coverage/nimcache \
        --passC:-fprofile-arcs --passC:-ftest-coverage --passL:-fprofile-arcs --passL:-ftest-coverage \
        -r vendor/nim-ssz-serialization/tests/test_merkleization_types.nim && \
    lcov --capture --directory coverage/nimcache --output-file coverage/coverage.info && \
    lcov --extract coverage/coverage.info "*/vendor/nim-ssz-serialization/*" \
        --output-file coverage/coverage.f.info && \
    genhtml coverage/coverage.f.info --output-directory coverage/output

Overall coverage rate:
  lines......: 93.9% (939 of 1000 lines)
  functions..: 98.2% (161 of 164 functions)

ssz_serialization/merkleization.nim

etan-status · 2021-12-16T10:30:10Z

Wait for #19 before merging this.

ssz_serialization/merkleization.nim

ssz_serialization/proofs.nim

etan-status · 2022-01-31T10:01:03Z

Thanks @zah for the review comments. I have addressed them in the latest push.

etan-status · 2022-01-31T10:56:14Z

Nim devel checks are failing because of upstream bugs in the Nim compiler repo.

tersec · 2022-01-31T12:41:35Z

Nim devel checks are failing because of upstream bugs in the Nim compiler repo.

Is this an already-identified regression? One reason for having this Nim devel checks is so that they can be fixed upstream before Nimbus has to wait until Nim 1.x.2 or 1.x.3 before it even builds due to not having noticed these to be fixed in time for the Nim 1.x.0 or Nim 1.x.1 releases.

stefantalpalaru · 2022-01-31T13:13:49Z

Is this an already-identified regression?

Yes, they fixed it here: nim-lang/Nim#19472

etan-status · 2022-01-31T14:04:23Z

Re-ran CI.

etan-status · 2022-03-01T10:43:16Z

Anything still holding this PR back?

ssz_serialization/merkleization.nim

This adds functionality for building merkle multiproofs in the same form that the Ethereum consensus specs suggest, i.e., in descending order of helper indices. Merkle multiproofs are useful for the light client sync. https://github.com/ethereum/consensus-specs/blob/v1.1.6/ssz/merkle-proofs.md#merkle-multiproofs Tested from `nimbus-eth2` root using: ``` nim c -r vendor/nim-ssz-serialization/tests/test_all.nim ```

etan-status · 2022-04-07T13:12:28Z

Add std/ prefix to imports.

kdeme

Looks fine to me to be merged imo.

I do have a similar remark as @zah regarding the naming of hash_tree_root procs which are there to return the proofs rather than the final hash tree root.
I get it that each piece of the proof(s) is in its own a hash tree root, but based on the name, one could think that the final hash_tree_root call would then return the hash tree root, instead of the proof(s).

Initially I thought that the naming was fine, but then I noticed that the proof versions of these procs are actually all separate due to the indexing logic that needs to be done, which causes of no real code re-use with the original versions in there.
Which is great from the point of view of leaving that code untouched and making merging this less impactful (Existing code touched is actually rather limited).
But then I do think that for a lot of the code related to this indexing it would be more fitting to live in proofs.nim.

But perhaps I am missing some crucial details (I did rather skim fast over the indexing part and the new hashTreeRootAux proc). Anyhow, not necessarily something blocking.

kdeme · 2022-06-20T14:06:42Z

ssz_serialization/proofs.nim

+func build_proof*(
+    anchor: auto,
+    indices: static openArray[GeneralizedIndex],
+    proof: var openArray[Digest]): Result[void, string] =


Is there still a reason for having the function signatures with proof: var openArray[Digest] and a returned void Result (and the same counts for the root_tree_hash counterparts)?

When there is already the need for Result[void, string] (or boolean for that matter), perhaps that type of return should be only provided, considering it is a safer practice (caller must verify the result to access the proofs).

I guess it could save some copying because the result can be directly put into the destination, but semantically these two are the same.

var foo: SomeObject ? state.build_proof(5.GeneralizedIndex, foo.proof) foo.proof = ? state.build_proof(5.GeneralizedIndex)

etan-status · 2022-06-21T10:46:02Z

I do have a similar remark as @zah regarding the naming of hash_tree_root procs which are there to return the proofs rather than the final hash tree root. I get it that each piece of the proof(s) is in its own a hash tree root, but based on the name, one could think that the final hash_tree_root call would then return the hash tree root, instead of the proof(s).

hash_tree_root does not compute a proof. It returns the root at index 1 by default, or the roots at custom given indices (like, if you pass 13, 42, the result will contain the root at index 13 and the root at index 42).
Note that the terminology of root for intermediate hashes, while it seems a bit backward, seems to be also used by the EF, e.g., see ethereum/consensus-specs#2629 (comment) -- I used to have this same concern as well.

Alternative names could be hash_tree_root_at_indices, hash_tree_subroot (but it can also do index 1), hash_tree_leaves (but it can also do intermediate hashes above the leaves). Or maybe there are better ideas?

Initially I thought that the naming was fine, but then I noticed that the proof versions of these procs are actually all separate due to the indexing logic that needs to be done, which causes of no real code re-use with the original versions in there. Which is great from the point of view of leaving that code untouched and making merging this less impactful (Existing code touched is actually rather limited). But then I do think that for a lot of the code related to this indexing it would be more fitting to live in proofs.nim.

Yes, this is just an implementation detail though. The regular hash_tree_root is equivalent to hash_tree_root(1.GeneralizedIndex), so it could also use the new implementation. However, due to the top root being requested much more frequently, I think having the 1 case implemented separately allows for more optimization. The cost is that for types where non-1 indices are requested that two copies of the hashing logic are emitted (one for top root, and one for any other index), but it's not many types that we use like that.

kdeme · 2022-06-21T12:27:12Z

hash_tree_root does not compute a proof. It returns the root at index 1 by default, or the roots at custom given indices (like, if you pass 13, 42, the result will contain the root at index 13 and the root at index 42).

OK, fair enough. I guess what I was trying to say is that you can make it provided a proof by giving it the right indexes as parameter.

Note that the terminology of root for intermediate hashes, while it seems a bit backward, seems to be also used by the EF, e.g., see ethereum/consensus-specs#2629 (comment) -- I used to have this same concern as well.

I was not aware of that terminology. I saw hash_tree_root solely as "compute the hash of the root of the tree". But as it appears to be an already established terminology in the eth2 specs for any intermediate being called a root, it seems (even more) fine to leave the naming as is.

Yes, this is just an implementation detail though. The regular hash_tree_root is equivalent to hash_tree_root(1.GeneralizedIndex), so it could also use the new implementation.

Right, that's a good point and argument for keeping the code where it is.

etan-status requested review from zah and kdeme December 6, 2021 17:55

This was referenced Dec 6, 2021

Implement light client syncing status-im/nimbus-eth2#2337

Closed

bump nim-ssz-serialization to 3db6cc0f282708aca6c290914488edd832971d61 status-im/nimbus-eth2#3119

Merged

zah reviewed Dec 7, 2021

View reviewed changes

ssz_serialization/merkleization.nim Outdated Show resolved Hide resolved

ssz_serialization/merkleization.nim Outdated Show resolved Hide resolved

ssz_serialization/merkleization.nim Outdated Show resolved Hide resolved

etan-status commented Dec 8, 2021

View reviewed changes

ssz_serialization/merkleization.nim Show resolved Hide resolved

etan-status requested a review from zah December 9, 2021 16:59

This was referenced Dec 10, 2021

remove outdated and incorrect SSZ code status-im/nim-eth#447

Merged

import is_valid_merkle_branch test cases from nim-eth status-im/nimbus-eth2#3182

Merged

etan-status marked this pull request as draft December 16, 2021 10:44

etan-status marked this pull request as ready for review December 17, 2021 16:19

etan-status mentioned this pull request Dec 17, 2021

Use uint64 for GeneralizedIndex #13

Merged

zah reviewed Jan 21, 2022

View reviewed changes

arnetheduck reviewed Apr 7, 2022

View reviewed changes

ssz_serialization/merkleization.nim Outdated Show resolved Hide resolved

kdeme reviewed Jun 20, 2022

View reviewed changes

kdeme approved these changes Jun 22, 2022

View reviewed changes

kdeme merged commit da3c08c into status-im:master Jun 23, 2022

etan-status deleted the merkle-multiproof branch June 23, 2022 12:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for building merkle multiproofs #16

Add support for building merkle multiproofs #16

etan-status commented Dec 6, 2021 •

edited

Loading

zah left a comment

etan-status commented Dec 8, 2021

etan-status commented Dec 8, 2021 •

edited

Loading

etan-status commented Dec 16, 2021

etan-status commented Jan 31, 2022

etan-status commented Jan 31, 2022

tersec commented Jan 31, 2022

stefantalpalaru commented Jan 31, 2022

etan-status commented Jan 31, 2022

etan-status commented Mar 1, 2022

etan-status commented Apr 7, 2022

kdeme left a comment

kdeme Jun 20, 2022

etan-status Jun 21, 2022 •

edited

Loading

etan-status commented Jun 21, 2022

kdeme commented Jun 21, 2022

Add support for building merkle multiproofs #16

Add support for building merkle multiproofs #16

Conversation

etan-status commented Dec 6, 2021 • edited Loading

zah left a comment

Choose a reason for hiding this comment

etan-status commented Dec 8, 2021

etan-status commented Dec 8, 2021 • edited Loading

etan-status commented Dec 16, 2021

etan-status commented Jan 31, 2022

etan-status commented Jan 31, 2022

tersec commented Jan 31, 2022

stefantalpalaru commented Jan 31, 2022

etan-status commented Jan 31, 2022

etan-status commented Mar 1, 2022

etan-status commented Apr 7, 2022

kdeme left a comment

Choose a reason for hiding this comment

kdeme Jun 20, 2022

Choose a reason for hiding this comment

etan-status Jun 21, 2022 • edited Loading

Choose a reason for hiding this comment

etan-status commented Jun 21, 2022

kdeme commented Jun 21, 2022

etan-status commented Dec 6, 2021 •

edited

Loading

etan-status commented Dec 8, 2021 •

edited

Loading

etan-status Jun 21, 2022 •

edited

Loading