ASTs for identical versions of reformatted file are parsed repeatedly during bisection #213

akaihola · 2021-09-30T18:56:08Z

If AST verification fails, Darker starts to bisect in order to find the smallest possible number of extra diff context lines which preserves the AST. As noted by @rogalski #205, on each iteration of the loop in which AST verification fails,

During bisect we will be hitting cases where changing context_lines will not include new chunks in reformatted file. I believe caching state of intermediate comparisons (e.g. reformatted file -> comparison result dict) also makes a lot of sense.

Parsing identical versions of the reformatted file repeatedly is of course redundant. We could get rid of redundant calls and improve performance e.g. by wrapping the Black code invocation into a function decorated with @lru_cache.

Note that if we decide to reimplement AST verification partially as suggested in #211 and #212 (both closed by merged #214), we'd probably be caching results of black.parsing.parse_ast(dst_ast). Thus implementing the caching should only be done after those changes are either made or rejected.

The downside of re-implementing AST verification in Darker is that we wouldn't automatically benefit from any possible future refinements in Black to that code.

The text was updated successfully, but these errors were encountered:

akaihola · 2021-10-31T19:54:01Z

#214 implemented the suggested caching mechanism as part of the ASTVerifier.is_equivalent_to_baseline() method.

akaihola added the performance Speed or memory usage improvement label Sep 30, 2021

akaihola added this to the 1.4.0 milestone Sep 30, 2021

akaihola mentioned this issue Sep 30, 2021

If bisect is attempted, performance suffers #205

Closed

akaihola linked a pull request Oct 31, 2021 that will close this issue

ASTVerifier #214

Merged

akaihola closed this as completed Oct 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASTs for identical versions of reformatted file are parsed repeatedly during bisection #213

ASTs for identical versions of reformatted file are parsed repeatedly during bisection #213

akaihola commented Sep 30, 2021 •

edited

Loading

akaihola commented Oct 31, 2021

ASTs for identical versions of reformatted file are parsed repeatedly during bisection #213

ASTs for identical versions of reformatted file are parsed repeatedly during bisection #213

Comments

akaihola commented Sep 30, 2021 • edited Loading

akaihola commented Oct 31, 2021

akaihola commented Sep 30, 2021 •

edited

Loading