Change several HashMaps to IndexMap to improve incremental hashing performance #90253

Kobzol · 2021-10-25T07:23:34Z

Stable hashing hash maps in incremental mode takes a lot of time, especially for some benchmarks like clap. As noted by @Mark-Simulacrum here, this cost could be reduced by replacing some hash maps by indexmaps.

I gathered some statistics and found several hash maps that took a lot of time to hash and replaced them by indexmaps. However, in order for this to work, we need to make sure that these indexmaps have deterministic insertion order. These three are used only in visitors as far as I can see, which seems deterministic. Can we enforce this somehow? Or should some explaining comment be included for these maps?

rust-highfive · 2021-10-25T07:23:37Z

r? @davidtwco

(rust-highfive has picked a reviewer for you, use r? to override)

davidtwco · 2021-10-25T09:30:46Z

@bors try @rust-timer queue

rust-timer · 2021-10-25T09:30:47Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-10-25T09:30:53Z

⌛ Trying commit 8587167dbd6bd39a9e6d8a5ca42d1192b5244c0e with merge 4438132761da5ec0c1b2b3e74658dd8454849393...

bors · 2021-10-25T12:17:27Z

☀️ Try build successful - checks-actions
Build commit: 4438132761da5ec0c1b2b3e74658dd8454849393 (4438132761da5ec0c1b2b3e74658dd8454849393)

rust-timer · 2021-10-25T12:17:28Z

Queued 4438132761da5ec0c1b2b3e74658dd8454849393 with parent 56694b0, future comparison URL.

Mark-Simulacrum · 2021-10-25T17:57:40Z

@bors try @rust-timer queue

rust-timer · 2021-10-25T17:57:41Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-10-25T17:57:47Z

⌛ Trying commit 8587167dbd6bd39a9e6d8a5ca42d1192b5244c0e with merge 732539524eee94bddaed2893c2b83bc6bfa49d57...

bors · 2021-10-25T19:30:48Z

☀️ Try build successful - checks-actions
Build commit: 732539524eee94bddaed2893c2b83bc6bfa49d57 (732539524eee94bddaed2893c2b83bc6bfa49d57)

rust-timer · 2021-10-25T19:30:50Z

Queued 732539524eee94bddaed2893c2b83bc6bfa49d57 with parent 84c2a85, future comparison URL.

rust-timer · 2021-10-25T22:31:57Z

Finished benchmarking commit (732539524eee94bddaed2893c2b83bc6bfa49d57): comparison url.

Summary: This change led to very large relevant mixed results 🤷 in compiler performance.

Very large improvement in instruction counts (up to -11.8% on incr-full builds of clap-rs)
Small regression in instruction counts (up to 0.3% on full builds of derive)

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR led to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf +perf-regression

davidtwco

LGTM, don't think we need to be too concerned about deterministic insertion order for these maps, so unless you want to add a comment, r=me.

Kobzol · 2021-10-26T08:52:08Z

@Mark-Simulacrum told me that we should tread lightly here. @Mark-Simulacrum do you want to add something? :)

Mark-Simulacrum · 2021-10-26T11:52:08Z

I was under the impression that StableHash needed to be deterministically produced in different builds with the "same" inputs to avoid bugs (or ICEs). This PR is changing a bunch of maps from Fx to the indexset variants, which removes the sort by stable hash key when stablehashing the maps, which I think means we need to otherwise ensure stability through a stable insertion order, right?

Cc @rust-lang/wg-incr-comp

cjgillot · 2021-10-26T16:51:32Z

I was asking myself something similar earlier. I have similar reservations.

This PR will start tracking the order in which objects are processed. This order is saves in the dep-graph, but does not enter the computation of the query fingerprint for non-anonymous queries.

For instance, in certain cases, like iterations on hir_crate, items are visited in LocalDefId order. This LocalDefId order may leak into the indexmap, and thus into the stable hash.
However, we avoid on purpose any tracking of this LocalDefId order, because it is unstable for incr. comp.

As a consequence, we may introduce spurious query invalidations (best case) or hash verification ICEs (worst case).

(I take the example of LocalDefId ordering because it is known to have caused a few bugs, and because it is easily available incr. comp. unstable information.)

I will think a bit more on it to try and find a solution...

davidtwco · 2021-10-26T17:02:29Z

r? @cjgillot

Kobzol · 2021-10-26T17:08:14Z

@cjgillot There is one more thing that I'm not sure how it works. I benchmarked the stable hashing of hash maps, and about half of the cost was invoking ToStableHashKey and creating a vector of keys, and the second half was the sorting itself. For different hash maps and benchmarks it is either bottlenecked by creating the keys or by the sort.

Now, by transforming HashMap to IndexMap, we avoid both the cost of calling ToStableHashKey (because the bound K: HashStable<CTX> is used) and of sorting (because stable hash of index maps doesn't sort).

I wonder, if the key implements HashStable, is it OK to just hash it directly, without invoking ToStableHashKey? I know that there are some outliers like HIR bodies that panic in hash_stable, but I suppose that some key types could "just work"?

If yes, maybe we could specialize stable hashing of hash maps for keys that can be hashed directly, and not go through ToStableHashKey for these? Then we could avoid some of the cost.
If not, then using IndexMap here is probably wrong, because ToStableHashKey is not used, while it was used previously?

cjgillot · 2021-10-26T18:31:36Z

@Kobzol: the issue is about information flow into the stable hash. When iterating over a collection, be it a Vec, a HashMap or an IndexMap, the order of items influences the value of the resulting hash: [a, b] and [b, a] have different hashes.

Meanwhile, there is some information we do not want to track. This is the case of the value of LocalDefId. Inserting a function in a file will change other functions LocalDefId, but it will not change their DefPathHash.

My concern is about controlling this information flow. In order to do that ToStableHashKey trait replaces the iteration order of the collection (which for hash maps is based on the key and the allocated memory capacity and should be irrelevant to the compilation), by the order of a stable hash key (the DefPathHash when the key is LocalDefId). By sorting the vectors by stable key, we manage the information flow.

Using IndexMap, the iteration order is the insertion order. Normally, this insertion order should only depend on tracked information obtained by depending on another query. For instance, a HIR visitor will create a query dependency on hir_owner_nodes, which hashes the in-code declaration order of functions. However, and this is my concern, the order of LocalDefId is freely available without using a query and is purposely untracked.

There are two ways to resolve this concern:

avoid depending on the insertion order (= not merging this PR) ;
making untracked information inaccessible (= stop implementing Ord for LocalDefId, etc).

In the long run, I think the second solution is the right way forward, but more tricky to reach. For instance, we do not really have a complete list of all untracked information.

In conclusion, I think we should eventually merge this PR, but after for some progress on point 2 above, otherwise we may get incremental hash validation ICEs. However, I may be overly conservative here, since ICEs are still very unlikely.

@Aaron1011 do you agree with this analysis?

bors · 2022-03-10T17:37:46Z

⌛ Testing commit e475a49 with merge c3d8803baf113e149eea23c0774e7744548fc1b9...

bors · 2022-03-10T18:28:39Z

💔 Test failed - checks-actions

rust-log-analyzer · 2022-03-10T18:29:27Z

A job failed! Check out the build log: (web) (plain)

Click to see the possible cause of the failure (guessed by this bot)

Kobzol · 2022-03-10T18:58:39Z

Hmm, not really sure what has happened.

michaelwoerister · 2022-03-11T09:08:58Z

@bors retry

bors · 2022-03-11T09:28:42Z

⌛ Testing commit e475a49 with merge 1e8c1ed7b34eb2efa04bab53619c6298710b0366...

bors · 2022-03-11T13:43:42Z

💥 Test timed out

lqd · 2022-03-11T13:56:26Z

@bors retry

rust-log-analyzer · 2022-03-11T14:03:42Z

A job failed! Check out the build log: (web) (plain)

Click to see the possible cause of the failure (guessed by this bot)

bors · 2022-03-11T16:37:59Z

⌛ Testing commit e475a49 with merge c9b45e6...

bors · 2022-03-11T19:20:27Z

☀️ Test successful - checks-actions
Approved by: cjgillot
Pushing c9b45e6 to master...

rust-timer · 2022-03-11T21:08:07Z

Finished benchmarking commit (c9b45e6): comparison url.

Summary: This benchmark run shows 29 relevant improvements 🎉 but 31 relevant regressions 😿 to instruction counts.

Arithmetic mean of relevant regressions: 0.3%
Arithmetic mean of relevant improvements: -0.8%
Arithmetic mean of all relevant changes: -0.2%
Largest improvement in instruction counts: -7.5% on incr-full builds of clap-rs check
Largest regression in instruction counts: 0.6% on full builds of deep-vector check

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

Next Steps: If you can justify the regressions found in this perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please open an issue or create a new PR that fixes the regressions, add a comment linking to the newly created issue or PR, and then add the perf-regression-triaged label to this PR.

@rustbot label: +perf-regression

rylev · 2022-03-15T13:14:55Z

Given that the performance seen here largely mirrors what was seen when running perf previously (large improvements to clap-rs and otherwise an overall performance wash), I'll mark this as triaged. Let me know if you disagree with this assessment.

@rustbot triage label: perf-regression-triaged

rust-highfive assigned davidtwco Oct 25, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 25, 2021

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Oct 25, 2021

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Oct 25, 2021

davidtwco approved these changes Oct 26, 2021

View reviewed changes

rust-highfive assigned cjgillot and unassigned davidtwco Oct 26, 2021

cjgillot mentioned this pull request Oct 26, 2021

Make untracked incr. comp. information inaccessible #90317

Open

5 tasks

apiraino added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Oct 28, 2021

bors added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Mar 10, 2022

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Mar 11, 2022

bors added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Mar 11, 2022

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Mar 11, 2022

bors added the merged-by-bors This PR was explicitly merged by bors. label Mar 11, 2022

bors merged commit c9b45e6 into rust-lang:master Mar 11, 2022

rustbot added this to the 1.61.0 milestone Mar 11, 2022

Kobzol deleted the hash-stable-sort-index-map branch March 11, 2022 19:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change several HashMaps to IndexMap to improve incremental hashing performance #90253

Change several HashMaps to IndexMap to improve incremental hashing performance #90253

Kobzol commented Oct 25, 2021

rust-highfive commented Oct 25, 2021

davidtwco commented Oct 25, 2021

rust-timer commented Oct 25, 2021

bors commented Oct 25, 2021

bors commented Oct 25, 2021

rust-timer commented Oct 25, 2021

Mark-Simulacrum commented Oct 25, 2021

rust-timer commented Oct 25, 2021

bors commented Oct 25, 2021

bors commented Oct 25, 2021

rust-timer commented Oct 25, 2021

rust-timer commented Oct 25, 2021

davidtwco left a comment

Kobzol commented Oct 26, 2021

Mark-Simulacrum commented Oct 26, 2021

cjgillot commented Oct 26, 2021

davidtwco commented Oct 26, 2021

Kobzol commented Oct 26, 2021

cjgillot commented Oct 26, 2021

bors commented Mar 10, 2022

bors commented Mar 10, 2022

rust-log-analyzer commented Mar 10, 2022

Kobzol commented Mar 10, 2022

michaelwoerister commented Mar 11, 2022

bors commented Mar 11, 2022

bors commented Mar 11, 2022

lqd commented Mar 11, 2022

rust-log-analyzer commented Mar 11, 2022

bors commented Mar 11, 2022

bors commented Mar 11, 2022

rust-timer commented Mar 11, 2022

rylev commented Mar 15, 2022

Change several HashMaps to IndexMap to improve incremental hashing performance #90253

Change several HashMaps to IndexMap to improve incremental hashing performance #90253

Conversation

Kobzol commented Oct 25, 2021

rust-highfive commented Oct 25, 2021

davidtwco commented Oct 25, 2021

rust-timer commented Oct 25, 2021

bors commented Oct 25, 2021

bors commented Oct 25, 2021

rust-timer commented Oct 25, 2021

Mark-Simulacrum commented Oct 25, 2021

rust-timer commented Oct 25, 2021

bors commented Oct 25, 2021

bors commented Oct 25, 2021

rust-timer commented Oct 25, 2021

rust-timer commented Oct 25, 2021

davidtwco left a comment

Choose a reason for hiding this comment

Kobzol commented Oct 26, 2021

Mark-Simulacrum commented Oct 26, 2021

cjgillot commented Oct 26, 2021

davidtwco commented Oct 26, 2021

Kobzol commented Oct 26, 2021

cjgillot commented Oct 26, 2021

bors commented Mar 10, 2022

bors commented Mar 10, 2022

rust-log-analyzer commented Mar 10, 2022

Kobzol commented Mar 10, 2022

michaelwoerister commented Mar 11, 2022

bors commented Mar 11, 2022

bors commented Mar 11, 2022

lqd commented Mar 11, 2022

rust-log-analyzer commented Mar 11, 2022

bors commented Mar 11, 2022

bors commented Mar 11, 2022

rust-timer commented Mar 11, 2022

rylev commented Mar 15, 2022