Add fast path for match checking #76918

ishitatsuyuki · 2020-09-19T13:05:36Z

This adds a fast path that would reduce the complexity to linear on matches consisting of only variant patterns (i.e. enum matches). (Also see: #7462) Unfortunately, I was too lazy to add a similar fast path for constants (mostly for integer matches), ideally that could be added another day.

TBH, I'm not confident with the performance claims due to the fact that enums tends to be small and FxHashMap could add a lot of overhead.

r? @Mark-Simulacrum

needs perf

jonas-schievink · 2020-09-19T13:22:57Z

@bors try @rust-timer queue

rust-timer · 2020-09-19T13:22:58Z

Awaiting bors try build completion

bors · 2020-09-19T13:23:09Z

⌛ Trying commit 7c98f6f with merge 52836408ae482f87159a2473c3e5475348b1f255...

bors · 2020-09-19T14:04:33Z

☀️ Try build successful - checks-actions, checks-azure
Build commit: 52836408ae482f87159a2473c3e5475348b1f255 (52836408ae482f87159a2473c3e5475348b1f255)

rust-timer · 2020-09-19T14:04:35Z

Queued 52836408ae482f87159a2473c3e5475348b1f255 with parent fd702d2, future comparison URL.

rust-timer · 2020-09-19T16:38:51Z

Finished benchmarking try commit (52836408ae482f87159a2473c3e5475348b1f255): comparison url.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. Please note that if the perf results are neutral, you should likely undo the rollup=never given below by specifying rollup- to bors.

Importantly, though, if the results of this run are non-neutral do not roll this PR up -- it will mask other regressions or improvements in the roll up.

@bors rollup=never

ishitatsuyuki · 2020-09-19T23:28:10Z

Seems that all perf results are nearly identical. I guess that match checking wasn't dominating the compile time then.

It seems that this PR only helps in pathological cases like that generated by the script below. Some crate in the ecosystem might be doing the same kind of code generation, though, and this can be become useful in such cases.

Python script:

print("""#![allow(warnings)]
enum A {""")
for i in range(8192):
    print(f"    Var{i},")
print("""}
fn main() {
    match A::Var0 {""")
for i in range(8192):
    print(f"        A::Var{i} => {{}}")
print("""    }
}""")

Old rustc takes around 2.7s, this build takes around 1.7s.

panstromek · 2020-09-20T09:24:33Z

Btw. Addressing your concerns about FxHashMap overhead - #72412 added a MiniMap for similar reasons (it's SSO map backed by ArrayVec). You can try using that here, seems like the map will always hold small number of elements, too, if I understand it correctly.

ishitatsuyuki · 2020-09-20T10:36:49Z

It seems that MiniMap is pretty limited - it doesn't support iteration and entry. So for now I'm not going to make the switch.

compiler/rustc_mir_build/src/thir/pattern/_match.rs

ishitatsuyuki · 2020-09-21T11:30:14Z

I added some more comments explaining the code per request.

oli-obk · 2020-09-21T11:40:04Z

Thanks, that helps a lot!

One thing that we could do is to add your pathological case to the perf test suite, then, once that's in, this PR will show an improvement on the perf bot and we thus guard against future regressions.

Another thing is to debug_assert! that the result of the fast path is the same as the slow path. This will require running both paths if debug assertions are activated, but I think it could be reasonably added. What do you think?

ishitatsuyuki · 2020-09-22T05:41:13Z

I added the debug assertions, it has a clone that is not really necessary but I thought the code would be easier to maintain this way.

As for perf, I have mixed feelings since the case I mentioned above is only one kind of match that is slow; I think we could add it once we have support for integer cases like #7462 (comment).

oli-obk · 2020-09-22T07:47:03Z

You can add perf tests for both the integer case and your case within one test, then we are at least tracking them, and any changes to it will show up. If the tests just take a few seconds, that's perfect for perf, as it won't really slow down the perf test suite

oli-obk · 2020-09-22T07:49:09Z

Implementation wise this PR lgtm now, so we could merge it and add the perf test later if you prefer. We'd lose the visible improvement in the perf tests, but I'm not sure how much value that has.

ishitatsuyuki · 2020-09-23T09:04:51Z

Perf PR is up at rust-lang/rustc-perf#769.

ishitatsuyuki · 2020-09-24T01:38:28Z

The match-stress-enum perf benchmark is now fully deployed.

Mark-Simulacrum · 2020-09-24T13:56:09Z

r? @oli-obk -- it sounds like you're r+ on this, but not quite sure.

oli-obk · 2020-09-24T15:13:52Z

@bors r+ rollup=never

bors · 2020-09-24T15:13:53Z

📌 Commit 01a771a has been approved by oli-obk

bors · 2020-09-24T17:23:01Z

⌛ Testing commit 01a771a with merge e599b53...

bors · 2020-09-24T19:35:14Z

☀️ Test successful - checks-actions, checks-azure
Approved by: oli-obk
Pushing e599b53 to master...

Also removes the ugly caching that was introduced in rust-lang#76918. It was bolted on without deeper knowledge of the workings of the algorithm. This commit manages to be more performant without any of the complexity. It should be better on representative workloads too.

…ly2, r=varkor Clarify main code paths in exhaustiveness checking This PR massively clarifies the main code paths of exhaustiveness checking, by using the `Constructor` enum to a fuller extent. I've been itching to write it for more than a year, but the complexity of matching consts had prevented me. Behold a massive simplification :D. This in particular removes a fair amount of duplication between various parts, localizes code into methods of relevant types when applicable, makes some implicit assumptions explicit, and overall improves legibility a lot (or so I hope). Additionally, after my changes undoing rust-lang#76918 turned out to be a noticeable perf gain. As usual I tried my best to make the commits self-contained and easy to follow. I've also tried to keep the code well-commented, but I tend to forget how complex this file is; I'm happy to clarify things as needed. My measurements show good perf improvements on the two match-heavy benchmarks (-18.0% on `unicode_normalization-check`! :D); I'd like a perf run to check the overall impact. r? `@varkor` `@rustbot` modify labels: +A-exhaustiveness-checking

exhaustiveness: Rework constructor splitting `SplitWildcard` was pretty opaque. I replaced it with a more legible abstraction: `ConstructorSet` represents the set of constructors for patterns of a given type. This clarifies responsibilities: `ConstructorSet` handles one clear task, and diagnostic-related shenanigans can be done separately. I'm quite excited, I had has this in mind for years but could never quite introduce it. This opens up possibilities, including type-specific optimisations (like using a `FxHashSet` to collect enum variants, which had been [hackily attempted some years ago](rust-lang#76918)), my one-pass rewrite (rust-lang#116042), and future librarification.

exhaustiveness: Rework constructor splitting `SplitWildcard` was pretty opaque. I replaced it with a more legible abstraction: `ConstructorSet` represents the set of constructors for patterns of a given type. This clarifies responsibilities: `ConstructorSet` handles one clear task, and diagnostic-related shenanigans can be done separately. I'm quite excited, I had has this in mind for years but could never quite introduce it. This opens up possibilities, including type-specific optimisations (like using a `FxHashSet` to collect enum variants, which had been [hackily attempted some years ago](rust-lang/rust#76918)), my one-pass rewrite (rust-lang/rust#116042), and future librarification.

Add fast path for match checking

7c98f6f

rust-highfive assigned Mark-Simulacrum Sep 19, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Sep 19, 2020

oli-obk reviewed Sep 20, 2020

View reviewed changes

compiler/rustc_mir_build/src/thir/pattern/_match.rs Outdated Show resolved Hide resolved

Improve code and documentation clarity

f95e4f3

Add debug assertions against slow path reference results

01a771a

ishitatsuyuki force-pushed the match-fastpath branch from bbafd0d to 01a771a Compare September 22, 2020 05:41

ishitatsuyuki mentioned this pull request Sep 23, 2020

Add match-stress-enum benchmark rust-lang/rustc-perf#769

Merged

rust-highfive assigned oli-obk and unassigned Mark-Simulacrum Sep 24, 2020

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Sep 24, 2020

bors added the merged-by-bors This PR was explicitly merged by bors. label Sep 24, 2020

bors merged commit e599b53 into rust-lang:master Sep 24, 2020

rustbot added this to the 1.48.0 milestone Sep 24, 2020

Nadrieril mentioned this pull request Oct 27, 2020

Clarify main code paths in exhaustiveness checking #78430

Merged

Nadrieril mentioned this pull request Nov 6, 2020

Match checking has quadratic average complexity #7462

Closed

Nadrieril mentioned this pull request Oct 3, 2023

exhaustiveness: Rework constructor splitting #116391

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fast path for match checking #76918

Add fast path for match checking #76918

ishitatsuyuki commented Sep 19, 2020

jonas-schievink commented Sep 19, 2020

rust-timer commented Sep 19, 2020

bors commented Sep 19, 2020

bors commented Sep 19, 2020

rust-timer commented Sep 19, 2020

rust-timer commented Sep 19, 2020

ishitatsuyuki commented Sep 19, 2020

panstromek commented Sep 20, 2020 •

edited

Loading

ishitatsuyuki commented Sep 20, 2020

ishitatsuyuki commented Sep 21, 2020

oli-obk commented Sep 21, 2020

ishitatsuyuki commented Sep 22, 2020

oli-obk commented Sep 22, 2020

oli-obk commented Sep 22, 2020

ishitatsuyuki commented Sep 23, 2020

ishitatsuyuki commented Sep 24, 2020

Mark-Simulacrum commented Sep 24, 2020

oli-obk commented Sep 24, 2020

bors commented Sep 24, 2020

bors commented Sep 24, 2020

bors commented Sep 24, 2020

Add fast path for match checking #76918

Add fast path for match checking #76918

Conversation

ishitatsuyuki commented Sep 19, 2020

jonas-schievink commented Sep 19, 2020

rust-timer commented Sep 19, 2020

bors commented Sep 19, 2020

bors commented Sep 19, 2020

rust-timer commented Sep 19, 2020

rust-timer commented Sep 19, 2020

ishitatsuyuki commented Sep 19, 2020

panstromek commented Sep 20, 2020 • edited Loading

ishitatsuyuki commented Sep 20, 2020

ishitatsuyuki commented Sep 21, 2020

oli-obk commented Sep 21, 2020

ishitatsuyuki commented Sep 22, 2020

oli-obk commented Sep 22, 2020

oli-obk commented Sep 22, 2020

ishitatsuyuki commented Sep 23, 2020

ishitatsuyuki commented Sep 24, 2020

Mark-Simulacrum commented Sep 24, 2020

oli-obk commented Sep 24, 2020

bors commented Sep 24, 2020

bors commented Sep 24, 2020

bors commented Sep 24, 2020

panstromek commented Sep 20, 2020 •

edited

Loading