Formatter fixes #1840

petervdonovan · 2023-06-11T05:22:00Z

This fixes some of the problems in #1832.

(1) address interaction of semicolons with at clauses
(2) fix comment duplication after annotations that precede reactor definitions
(3) fix removal of the contents of parenthesized list initializer expressions in C target
(4) fix lack of idempotency
(5) do not put comments that were not in {= =} into the {= =}
(6) do not eliminate comments that appear at the start of a code block that is an initializer

Point 4 might be merely a result from point 3, since the previous point results from destructive modification of the ECore model that might somehow violate the assumptions of the line breaker. Therefore, it might be solved; I'm not sure.

I am not sure about point (5). Fixing it would likely result in more special cases in the formatter. I would also like to point out that the current behavior is safe and semantics-preserving because comments that appear right after an opening {= do not seem to be included in the resulting Code ECore object. Instead, I think they are treated as LF-level comments. As a result, a program with the following reaction compiles with the C target:

    reaction(t) -> y {= # this is a test
       lf_set(y, self->count);
       self->count++;
    =}

whereas the following does not:

    reaction(t) -> y {= # this is a test
       # this is another test
       lf_set(y, self->count);
       self->count++;
    =}

Note the usage of a # sign, which is not a valid C comment.

(7) do not copy comments to the outside of initializers that use code:

    init_x1 = {= Va_eq * (1.0 - 1.911199519984605 /*a[1]*/ - 0.001916104193780 /*b[1]*/) =},

formats to

    /** a[1] */
    /** b[1] */
    init_x1 = {= Va_eq * (1.0 - 1.911199519984605 /*a[1]*/ - 0.001916104193780 /*b[1]*/) =},

doSwitch is what finds the comments. If you skip doSwitch and call directly into the code that is specific to the given EObject, you will drop comments.

There is some black magic happening in the implementation of EList::add that removes EObjects from their containers when they are added to a new container. This works around that by just not adding the EObjects to a container.

This only applies when the comment is in the body of the code block. According to the grammar, the body does not start until some tokens appear -- that means non-whitespace, non-comment characters.

petervdonovan · 2023-06-13T00:33:04Z

I think the only remaining problem is (5).

I do not disagree with Christian on this, but it is not exactly a trivial change (it might take an afternoon to get it right). Therefore, this PR is awaiting a consensus on problem (5).

~~EDIT: Actually I just realized that I also introduced another issue. Comments are deleted when they appear at the beginning of a code block. This will be fixed easily tomorrow.~~ DONE

This may slow down the formatter. It also is not certain to work because isEqual could contain mistakes. However, it may substantially reduce the likelihood that the formatter silently produces wrong results. It is hard to be sure about this, however, because it is already rare for the formatter to produce wrong results.

cmnrd

Thanks a lot for those fixes. They look good to me. I also tested this version of the formatter on the examples repo and it seems to fix the problems that I discovered previously.

I think we should also add tests based on the corner cases that I listed in #1832. We could add new integration tests, but probably it would be better to add specific formatter unit tests like the simple ones we already have in LffCliTest.java

core/src/main/java/org/lflang/ast/ParsingUtils.java

This avoids inlining comments when the target syntactic element is multiple lines long. In such cases the comment instead gets its own line.

If a code block begins with a multiline comment, the multiline comment should still belong to the code block.

petervdonovan · 2023-06-14T21:02:16Z

I'd like to draw attention to two significant changes other than the bugfixes:

The formatter now does an isEqual check to make sure that its output is semantically equivalent to its input
Comments are no longer inlined if their target syntactic element is more than one line long.

The latter of these two changes is responsible for the big diff in our tests. The other bugfixes did not affect our tests because they only addressed corner cases that were found in the examples repo.

I believe this is the CRLF/LF thing.

test/C/temp.txt

lhstrh

Looks good to me, modulo the .txt file that I commented on.

cmnrd · 2023-06-15T09:03:50Z

Great! Thanks a lot @petervdonovan!

petervdonovan added 7 commits June 10, 2023 17:27

Fix comment duplication issue.

43a0543

Do not destructively modify Ecore model.

a0ffd9c

Do not skip doSwitch.

8a663c4

doSwitch is what finds the comments. If you skip doSwitch and call directly into the code that is specific to the given EObject, you will drop comments.

Try again to fix destructive reads of init list.

b24c0f9

Fix NPE.

a7588fb

Really fix the destructive copying this time.

9be6955

There is some black magic happening in the implementation of EList::add that removes EObjects from their containers when they are added to a new container. This works around that by just not adding the EObjects to a container.

Do not extract comments that are in code blocks.

f44bf4f

This only applies when the comment is in the body of the code block. According to the grammar, the body does not start until some tokens appear -- that means non-whitespace, non-comment characters.

petervdonovan linked an issue Jun 13, 2023 that may be closed by this pull request

Bugs in formatter #1832

Closed

petervdonovan added 6 commits June 12, 2023 17:53

Add semicolon.

c70b5b8

Equality is symmetric.

d32e80f

Fixes in isEqual.

72cfe46

Do not capture certain comments in toText.

7c50334

More tinkering with comments around Code eObjects.

202d425

petervdonovan force-pushed the formatter-fixes branch from 438faf2 to 202d425 Compare June 14, 2023 01:38

cmnrd reviewed Jun 14, 2023

View reviewed changes

core/src/main/java/org/lflang/ast/ParsingUtils.java Outdated Show resolved Hide resolved

petervdonovan added 5 commits June 14, 2023 12:44

Reduce comment inlining.

8b37f2b

This avoids inlining comments when the target syntactic element is multiple lines long. In such cases the comment instead gets its own line.

Rename Utils -> Util.

1dffb58

Do not move multiline comments out.

8b28c94

If a code block begins with a multiline comment, the multiline comment should still belong to the code block.

Add tests.

8ac18cd

Remove files accidentally committed.

907196d

petervdonovan marked this pull request as ready for review June 14, 2023 21:00

Try to pass formatting test on Windows.

bb538e6

I believe this is the CRLF/LF thing.

petervdonovan enabled auto-merge June 14, 2023 22:06

lhstrh reviewed Jun 14, 2023

View reviewed changes

test/C/temp.txt Outdated Show resolved Hide resolved

lhstrh requested changes Jun 14, 2023

View reviewed changes

Remove accidental check-in.

bad3fa2

lhstrh approved these changes Jun 15, 2023

View reviewed changes

petervdonovan added this pull request to the merge queue Jun 15, 2023

Merged via the queue into master with commit 864ac74 Jun 15, 2023

petervdonovan deleted the formatter-fixes branch June 15, 2023 08:04

lhstrh added the bugfix label Aug 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Formatter fixes #1840

Formatter fixes #1840

petervdonovan commented Jun 11, 2023 •

edited

Loading

petervdonovan commented Jun 13, 2023 •

edited

Loading

cmnrd left a comment •

edited

Loading

petervdonovan commented Jun 14, 2023

lhstrh left a comment

cmnrd commented Jun 15, 2023

Formatter fixes #1840

Formatter fixes #1840

Conversation

petervdonovan commented Jun 11, 2023 • edited Loading

petervdonovan commented Jun 13, 2023 • edited Loading

cmnrd left a comment • edited Loading

Choose a reason for hiding this comment

petervdonovan commented Jun 14, 2023

lhstrh left a comment

Choose a reason for hiding this comment

cmnrd commented Jun 15, 2023

petervdonovan commented Jun 11, 2023 •

edited

Loading

petervdonovan commented Jun 13, 2023 •

edited

Loading

cmnrd left a comment •

edited

Loading