Optimize the Merkle tree computation #21379

tjgq · 2024-02-16T14:46:12Z

The input Merkle tree computation often shows up in profiles holding up the critical path. The --experimental_remote_merkle_tree_cache optimization, which was introduced to alleviate this problem, doesn't work reliably (for reasons explained at #21378).

The current implementation creates an intermediate Java object representation for each node, which must then be converted into a Java protobuf, and then into wire format for hashing and upload. There might be gains to be had in avoiding all of the copying and thrashing involved in these conversions.

To do better, we'd want to iterate the input list in post-order (children before parents) so that each node can be constructed directly as its final wire representation. Then we never need to copy things around, and the only state to keep is the node currently under construction and a (path -> node) map over previously built nodes.

Ideally, we'd get the input list in the right order from SpawnInputExpander#getInputMapping. But this has other consequences, since the list is cached in memory for reuse, and other callers likely expect it to be sorted alphabetically instead of topologically. So we might end up sorting it twice, undoing some of the gains.

An additional angle to explore is to try to make the computation parallel, perhaps by expressing it as a graph of ForkJoinTasks, one per node; but this requires some pre-work to set up the graph, which is time not spent actually computing things (and the computation of an individual node isn't particularly expensive, so we might need to split the computation at a coarser level).

The text was updated successfully, but these errors were encountered:

tjgq added untriaged team-Remote-Exec Issues and PRs for the Execution (Remote) team type: bug labels Feb 16, 2024

tjgq changed the title ~~Optimize the Merkle cache computation~~ Optimize the Merkle tree computation Feb 16, 2024

joeleba added P2 We'll consider working on this in future. (Assignee optional) and removed untriaged labels Feb 27, 2024

tjgq mentioned this issue Dec 13, 2024

Checking --disk_cache cache is slow for huge tree artifacts #17804

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize the Merkle tree computation #21379

Optimize the Merkle tree computation #21379

tjgq commented Feb 16, 2024 •

edited

Loading

Optimize the Merkle tree computation #21379

Optimize the Merkle tree computation #21379

Comments

tjgq commented Feb 16, 2024 • edited Loading

tjgq commented Feb 16, 2024 •

edited

Loading