darker can bisect context lines without a reason #204

rogalski · 2021-09-23T22:21:57Z

This bug is based on my experiments with darker on closed-source repo.
I am not at liberty to disclose any code. We'll have to work on recreating failure criteria on our own.

Steps to reproduce

Create a file with mixed tabs-and-spaces (it have to trigger parsing file in 2.7 mode)
Reformat whole file to build chunks list
Observe that reformatted file is not AST equivalent to input file.

If initial reformat do not produce equivalent AST, all of bisection steps will fail anyway.

The text was updated successfully, but these errors were encountered:

akaihola · 2021-09-24T07:09:29Z

Thanks @rogalski!

I created mixed.py (had to rename to .txt to attach here) which contains mixed tabs and spaces.

If I run Black on it, I get this:

$ black --diff mixed.py   
error: cannot format mixed.py:
INTERNAL ERROR:
Black produced code that is not equivalent to the source on pass 1.
Please report a bug on https://github.com/psf/black/issues.
This diff might be helpful: /tmp/blk_e5me14ox.log
Oh no! 💥 💔 💥
1 file would fail to reformat.

Here's blk_e5me14ox.log from Black.

So I presume this is what happens when Darker processes this file as well. The call to black.format_str() succeeds, but the AST verification fails. In this situation, Darker has no choice but to assume that this isn't due to Black but because Darker's own diff-matching algorithm failed to work correctly due to complex and ambiguous edits in the file. It then proceeds to expanding the edited chunks in order to apply fewest possible extra reformattings which preserve the AST.

I've seen AST verification failures happen on real-life files which don't contain mixed tabs and spaces, and the chunk expansion algorithm is a valid work-around for those situations.

Could we short-circuit that mechanism here? How do we detect that it's no use even trying to find a minimal set of extra reformattings?

rogalski · 2021-09-24T11:16:25Z

My intuitive argumentation was following:

darker is just diff-based black
if black crashes on input file and bails out, darker can (and should) bail out as well.

This of course is design decision more than actual bug, yet it vastly simplifies implementation and corner case handling while keeping things relatively straightforward for customer.

akaihola · 2021-09-25T13:46:01Z

Darker directly calls black.format_str() which doesn't do AST verification and thus succeeds in our example case. Darker then calls darker.verification.verify_ast_unchanged(), and acts based on the result of that.

What we could add is an extra step:
If AST verification fails when applying reformatting strictly to modified lines only, Darker could try to verify the result with all lines reformatted. If that fails, Darker would bail out early and display an error. If it passes, Darker would continue with bisection as before. Does this match what you have in mind?

I drafted a diagram showing the added new behavior:

rogalski · 2021-09-26T20:04:17Z

Sounds good to me 👍

akaihola · 2022-02-18T06:53:42Z

Moving this to milestone 1.5.0, there are a plenty of bugfixes and documentation improvements coming up for a 1.4.2 bugfix release already.

akaihola · 2022-03-30T20:22:59Z

@rogalski & @overratedpro, is there any chance you could review if I start to implement this for release 1.5.0?

rogalski mentioned this issue Sep 23, 2021

Missing --jobs support #178

Closed

akaihola added duplicate This issue or pull request already exists performance Speed or memory usage improvement question Further information is requested and removed duplicate This issue or pull request already exists labels Sep 24, 2021

akaihola added this to the 1.3.2 milestone Oct 5, 2021

akaihola modified the milestones: 1.3.2, 1.4.0 Oct 28, 2021

akaihola self-assigned this Oct 31, 2021

akaihola removed the question Further information is requested label Oct 31, 2021

akaihola modified the milestones: 1.4.0, 1.4.1, 1.4.2 Feb 8, 2022

akaihola modified the milestones: 1.4.2, 1.5.0 Feb 18, 2022

akaihola modified the milestones: 1.5.0, 1.5.1 Apr 5, 2022

akaihola modified the milestones: 1.5.1, 1.6.0 Sep 13, 2022

akaihola modified the milestones: 1.6.0, 1.6.1 Dec 19, 2022

akaihola added this to the 1.8.0 milestone Dec 25, 2022

akaihola modified the milestones: Darker 1.8.0 – Flynt compatibility, Darker 1.9.0 – Graylint Mar 26, 2023

akaihola added this to Darker and Graylint development Mar 17, 2024

akaihola mentioned this issue Oct 19, 2024

Support ruff check --fix as a formatter #758

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

darker can bisect context lines without a reason #204

darker can bisect context lines without a reason #204

rogalski commented Sep 23, 2021

akaihola commented Sep 24, 2021

rogalski commented Sep 24, 2021 •

edited

Loading

akaihola commented Sep 25, 2021

rogalski commented Sep 26, 2021

akaihola commented Feb 18, 2022

akaihola commented Mar 30, 2022

darker can bisect context lines without a reason #204

darker can bisect context lines without a reason #204

Comments

rogalski commented Sep 23, 2021

Steps to reproduce

akaihola commented Sep 24, 2021

rogalski commented Sep 24, 2021 • edited Loading

akaihola commented Sep 25, 2021

rogalski commented Sep 26, 2021

akaihola commented Feb 18, 2022

akaihola commented Mar 30, 2022

rogalski commented Sep 24, 2021 •

edited

Loading