[DebugInfo@O2] MachineSink can unsoundly extend variable location ranges #43462

jmorse · 2019-11-22T15:25:46Z


Bugzilla Link	44117
Version	trunk
OS	Linux
Blocks	#38116
CC	@avl-llvm,@adrian-prantl,@gregbedwell,@OCHyams,@pogo59,@Melamoto,@vedantk

Extended Description

This is a bug report to document an edge case to do with the machine-sink pass that I don't think can be easily solved.

Here's a highly contrived reproducer, that when compiled with trunk "-O2 -g -c -fno-unroll-loops" will sink the computation of "a & 0xFFFF" into the final block (where there's the assign to global). It also sinks the (salvaged) DBG_VALUE for the first value of "badgers" too.

--------8<--------
int global, global2;

int
foo(int a, int b)
{
int floogie = a & 0xFFFF;
int badgers = floogie + 12;

if (a == 1234567) {
badgers = global2; // body uninteresting, but "badgers" reassigned
badgers ^= a;
global2 = badgers + 1;
if (b == 12)
return global;
}

global = floogie;
return global;
}
-------->8--------

Normally, in the end block, we would not be able to compute a location for "badgers", because we don't know which side of the "a == 1234567" condition was taken. The location would be empty / optimised out.

However, because the DBG_VALUE for "badgers" sinks into that end block, it specifies the variable location as being "floogie+12", regardless of which side of the condition was taken, which is not a true representation of the original program.

This is actually really hard to solve with our current model. If there were no further assignments to "badgers" on any path from the source to destination block, then the DBG_VALUE sinking would be absolutely fine and desirable. However, discovering whether this is true or not involves examining every block that might be on a path from the source to the destination position, which AFAIUI is expensive. Machine sinking doesn't currently do this level of analysis, so I haven't tried to fix it yet.

This technically applies to any pass that does any kind of sinking. Instcombine will only sink where there isn't any control flow present though, so this isn't a problem inscombine currently demonstrates, I think.

Time for Jeremy's pet peeve: in a more ideal world, one where the machine-location and the instruction-location were separate, we could record an assignment / location-change in the first block of the program, and the machine-location in the last block, and leave it to a debug post-processor to work these things out, when we actually do a full dataflow analysis.

avl-llvm · 2019-11-29T16:29:06Z

Hi Jeremy, I have a question about that example.

Current behavior:

* IR Dump After Machine Common Subexpression Elimination *:

%3:gr32 = COPY $edi
DBG_VALUE %3:gr32, $noreg, !"a", !DIExpression(), debug-location !21; test.c:0 line no:5
%5:gr16 = COPY %3.sub_16bit:gr32, debug-location !22; test.c:7:19
%0:gr32 = MOVZX32rr16 killed %5:gr16, debug-location !22; test.c:7:19
DBG_VALUE %0:gr32, $noreg, !"floogie", !DIExpression(), debug-location !21; test.c:0 line no:7
DBG_VALUE %0:gr32, $noreg, !"badgers", !DIExpression(DW_OP_plus_uconst, 12, DW_OP_stack_value), debug-locatio

* IR Dump After Machine code sinking *:

bb.0.entry:
successors: %bb.1(0x40000000), %bb.3(0x40000000); %bb.1(50.00%), %bb.3(50.00%)
liveins: $edi, $esi
%3:gr32 = COPY $edi
DBG_VALUE %3:gr32, $noreg, !"a", !DIExpression(), debug-location !21; test.c:0 line no:5
^^^^^ there is no DBG_VALUE for badgers

bb.3.if.end4:
; predecessors: %bb.0, %bb.1
successors: %bb.4(0x80000000); %bb.4(100.00%)

%5:gr16 = COPY %3.sub_16bit:gr32, debug-location !21; test.c:0
%0:gr32 = MOVZX32rr16 %5:gr16, debug-location !21; test.c:0
DBG_VALUE %0:gr32, $noreg, !"floogie", !DIExpression(), debug-location !21; test.c:0 line no:7
DBG_VALUE %0:gr32, $noreg, !"badgers", !DIExpression(DW_OP_plus_uconst, 12, DW_OP_stack_value), debug-location !21; test.c:0 line no:8
^^^^ incorrect DBG_VALUE for badgers

In this example Machine sinking pass moved all DBG_VALUEs related to %0:gr32 value together with the real instructions COPY, MOVZX32rr16.

What if Machine sinking pass would move only single DBG_VALUE which directly relates to moved instructions ?
And all other DBG_VALUEs would be left in their original places though with salvaged values:

* IR Dump After Machine code sinking *:

bb.0.entry:
successors: %bb.1(0x40000000), %bb.3(0x40000000); %bb.1(50.00%), %bb.3(50.00%)
liveins: $edi, $esi
%3:gr32 = COPY $edi
DBG_VALUE %3:gr32, $noreg, !"a", !DIExpression(), debug-location !21; test.c:0 line no:5
DBG_VALUE %3:gr32, $noreg, !"badgers", !DIExpression(DW_OP_plus_uconst, 12, DW_OP_convert, DW_ATE_unsigned_32, DW_OP_convert, DW_ATE_unsigned_16, DW_OP_stack_value), debug-location !21; test.c:0 line no:8

bb.3.if.end4:
; predecessors: %bb.0, %bb.1
successors: %bb.4(0x80000000); %bb.4(100.00%)

%5:gr16 = COPY %3.sub_16bit:gr32, debug-location !21; test.c:0
%0:gr32 = MOVZX32rr16 %5:gr16, debug-location !21; test.c:0
DBG_VALUE %0:gr32, $noreg, !"floogie", !DIExpression(), debug-location !21; test.c:0 line no:7

In that case there would be proper debug value at the place where "badgers" defined in original code.
And also there would not be incorrect value in bb.3.if.end4

what do you think ?

jmorse · 2019-12-03T12:29:49Z

Hi Alexey,

Alexey wrote:

^^^^^ there is no DBG_VALUE for badgers

Good spot -- this is actually what D58238 [0] was about, however it got reverted due to a performance regression tracked in [1]. You're absolutely right that there should be some kind of location there. The fix of [0,1] is to add an undef/$noreg to terminate any earlier location.

What if Machine sinking pass would move only single DBG_VALUE which directly
relates to moved instructions ? And all other DBG_VALUEs would be left in
their original places though with salvaged values:
[...]
!DIExpression(DW_OP_plus_uconst, 12, DW_OP_convert, DW_ATE_unsigned_32,
DW_OP_convert, DW_ATE_unsigned_16, DW_OP_stack_value)
[...]
In that case there would be proper debug value at the place where "badgers"
defined in original code. And also there would not be incorrect value in
bb.3.if.end4

Just to confirm I understand what you're saying: you've added DW_OP_convert to the expression there, meaning we should recover the effect of the sunk MOVZX32rr16 and encode it in the DIExpression, yes?

That would definitely be desirable, and that's what happens when something gets moved / deleted in LLVM-IR. However, after instruction selection this becomes much more difficult as there are literally thousands of machine instructions, many of which have effects that can't be modelled, and which have many different forms generated from templates. It's too much of a burden (on software engineers) to encode all of this information to happen after isel.

IMO, there are other ways we could recover this information, such as re-analysing the LLVM-IR when a location goes missing in the MachineFunction, but that's out of scope for this ticket :)

[0] https://reviews.llvm.org/D58238
[1] llvm/llvm-bugzilla-archive#43855

avl-llvm · 2019-12-03T21:27:59Z

thank you,

Just to confirm I understand what you're saying:
you've added DW_OP_convert to the expression there,
meaning we should recover the effect of the sunk
MOVZX32rr16 and encode it in the DIExpression, yes?

correct. Though this is the second point. I also agree with following :

That would definitely be desirable, and that's what happens
when something gets moved / deleted in LLVM-IR. However, after
instruction selection this becomes much more difficult as
there are literally thousands of machine instructions,
many of which have effects that can't be modelled, and which
have many different forms generated from templates. It's too much
of a burden (on software engineers) to encode all of this
information to happen after isel.

But, my suggestion is not about creating SalvageValue function based on MIR. That would be nice to have, as well as other heuristics which would allow to calculate proper value(like re-analysing the LLVM-IR when a location goes missing in the MachineFunction).

My main point is that we probably should not sink any DBG_VALUE except
very first one. As far as I understand [0] and [1] would move/clone not only
first DBG_VALUE but in some cases other DBG_VALUE related to sunk value.
I think, when MachineSinking pass see following code :

DBG_VALUE %3:gr32, $noreg, !"a", !DIExpression(), debug-location !21; test.c:0 line no:5
%5:gr16 = COPY %3.sub_16bit:gr32, debug-location !22; test.c:7:19
%0:gr32 = MOVZX32rr16 killed %5:gr16, debug-location !22; test.c:7:19
DBG_VALUE %0:gr32, $noreg, !"floogie", !DIExpression(), debug-location !21; test.c:0 line no:7
DBG_VALUE %0:gr32, $noreg, !"badgers", !DIExpression(DW_OP_plus_uconst, 12, DW_OP_stack_value), debug-locatio

It should move only first DBG_VALUE("floogie") and leave second value("badgers") in place. Additionally, if it can salvage "badgers" then it would change its expression accordingly. Otherwise, it would set "badgers" to undef.

The reason for this is that only the first DBG_VALUE relates to moved value(%0:gr32). Other DBG_VALUEs relate to different values that were optimized out earlier and salvaged on the sunk value(%0:gr32) basis. I think it would be generally incorrect to move such connected values. Machine sinking pass knows that it is safe to move value for "floogie". But whether "badgers" could be sunk or not - is unknown. The problem described in current PR[2] exactly such a case - there was sunk DBG_VALUE for "badgers" while it could not be sunk because of changes done on another control flow path.

Do you think that solution could work: "When sinking instruction, move only first corresponding DBG_VALUE, leave all others in place. For all left DBG_BALUES - either salvage value either set it to undef"?

I think that there should not be performance regression in that case since DBG_VALUE would not be cloned. And there would not be incorrectly reported DBG_VALUE.

[0] https://reviews.llvm.org/D58238
[1] https://reviews.llvm.org/D70672
[2] #43462

jmorse · 2019-12-10T11:52:51Z

My main point is that we probably should not sink any DBG_VALUE except
very first one.

Ah, I missed that sorry. I don't think it's a direction we should pursue: we don't currently track whether or not a DBG_VALUE refers to something that got optimised or not, there are code sequences that can be salvaged / common-subexpression-eliminated without changing the DIExpression, and would look like an un-touched DBG_VALUE.

Plus, if there weren't the other interfering assignment to "badgers", I think we would want to sink all the DBG_VALUEs. In that case, the sinking optimisation is only shortening the range that the location is defined over.

I think it would be generally incorrect to move such connected values.
[...]
Do you think that solution could work: "When sinking instruction, move
only first corresponding DBG_VALUE, leave all others in place. For all
left DBG_BALUES - either salvage value either set it to undef"?

I think this hinges on being able to distinguish between the "original" assignment and "connected values" -- unfortunately I don't think we can safely recover that information this far into compilation, variable locations can have been hoisted / CSEd / sunk to such an extent that no dbg.value reflects the "original" assignment.

~

In addition, I think my example might be slightly misleading, because the salvaging of the first assignment to "badgers" isn't really necessary for the fault I'm describing, it's just part of the test I had to hand. The key requirement is that there are two "dead" variable assignments in different blocks. Because they're dead, they don't receive a PHI instruction that would get its own dbg.value. We then enter this situation where sinking a DBG_VALUE would change the instructions that it dominates, and where it's expensive to identify when this happens.

(Note that the original source doesn't necessarily need to have "dead" assignments -- various optimisations like dead store elimination might make a use dead along the way).

avl-llvm · 2019-12-13T15:57:56Z

Ah, I missed that sorry. I don't think it's a direction we should pursue: we
don't currently track whether or not a DBG_VALUE refers to something that got
optimised or not, there are code sequences that can be salvaged / common-
subexpression-eliminated without changing the DIExpression, and would look
like an un-touched DBG_VALUE.

Agreed that using DIExpresion as the marker, whether DBG_VALUE refers to something that got optimized or not, is not suitable. Though this probably does not make the idea of separation DBG_VALUE useless. When sinking instruction, we already know that it is safe to sink value created by this instruction, and we do_not_know whether it is safe to sink all other connected values. Thus it would probably be useful to understand which value directly defined by instruction and which is not so that it would be possible to handle them differently.

Instead of analyzing DIExpresion, there could probably be enforced the rule that the first DBG_VALUE is that which describes value defined by the instruction. That seems logical since if other DBG_VALUES use the value created by that instruction, they could not appear before the value is created.

Probably we could use another way of separating values if this one does not work(adding link to DBG_VALUE into the instruction?)...

Plus, if there weren't the other interfering assignment to "badgers", I think
we would want to sink all the DBG_VALUEs. In that case, the sinking
optimisation is only shortening the range that the location is defined over.

Right, but it is not known whether the other interfering assignment exists.
Also, I think that if we could salvage value then we do not wish to sink all DBG_VALUES. for example :

x = 10
...
x++
printf("hello");
x--;

mov r1, 10 // x = 10
dbg.value r1, "x", DIExpression()
mov r2, r1
dbg.value r2, "x", DIExpression()
....
dbg.value r2, "x", DIExpression(DW_OP_plus_uconst, 1, DW_OP_stack_value) // x++
....
printf("hello")
dbg.value r2, "x", DIExpression() // x--

let`s assume that "mov r2, r1" should be sinked:

mov r1, 10 x = 10
dbg.value r1, "x", DIExpression()
....
dbg.value r2, "x", DIExpression(DW_OP_plus_uconst, 1, DW_OP_stack_value) // x++
^^^^^^^^^ it is better to not sink it down
^^^^^^^^^ but make it "dbg.value r1, x, DIExpression(DW_OP_plus_uconst, 1, DW_OP_stack_value)"
....
printf("hello")
dbg.value r2, "x", DIExpression() // x--
^^^^^^^^^ it is better to not sink it down
^^^^^^^^^ but make it "dbg.value r1, x, DIExpression()"
....
mov r2, r1
dbg.value r2, "x", DIExpression()

Though we sank copying x value from r1 to r2, it is better to see ++ and -- at their original places still.
So that, when we stop debugger at "printf" we see x++ value; (for the example from this PR it would mean that we would see the value of "badgers" closer to its source code location)

thus we want to sink DBG_VALUE only if :

we could not salvage it(in above example it is when we could not replace dbg.value r2, "x" with dbg.value r1, "x")
it is last value for that variable. i.e. instead of sinking all three DBG_VALUE:

dbg.value r2, "x", DIExpression()
dbg.value r2, "x", DIExpression(DW_OP_plus_uconst, 1, DW_OP_stack_value) // x++
dbg.value r2, "x", DIExpression() // x--

sink only DBG_VALUE related to last effective value for variable "x":
dbg.value r2, "x", DIExpression()

we prove that after sinking order of assignments was not changed.

That thing would generally require to perform full dataflow analysis for debug values. Which could be expensive. Instead of full dataflow analysis we could probably try to use chipper alternatives:
a) do not do that analysis for original assignment value(safeness is already proved by optimization).
b) use simple fast heuristic for connected values, like instead of checking data flow just check that there is no other DBG_VALUEs for concrete variable.(that would work for example from this PR)
c) if not #a and #b - then do not sink value and put undef into it.

I think this hinges on being able to distinguish between the "original"
assignment and "connected values" -- unfortunately I don't think we can safely
recover that information this far into compilation, variable locations can have
been hoisted / CSEd / sunk to such an extent that no dbg.value reflects the
"original" assignment.

if we would take the rule that first DBG_VALUE shows original assignment then we could probably require DBG_VALUE undef to be first in that case.

In addition, I think my example might be slightly misleading, because the
salvaging of the first assignment to "badgers" isn't really necessary for the
fault I'm describing, it's just part of the test I had to hand. The key
requirement is that there are two "dead" variable assignments in different
blocks. Because they're dead, they don't receive a PHI instruction that would
get its own dbg.value. We then enter this situation where sinking a DBG_VALUE
would change the instructions that it dominates, and where it's expensive to
identify when this happens.

(Note that the original source doesn't necessarily need to have "dead"
assignments -- various optimisations like dead store elimination might make a
use dead along the way).

yeah. it is the example when we need to have expensive analysis to understand how DBG_VALUE should be handled. It seems that separating values to "original assinment" and "connected values" could probably help here.
if "dead variable assignment" is "original assignment" - then we could safely sink DBG_VALUE with the instruction.
if "dead variable assignment" is a "connected value" - then we leave it at original place and either salvage it either set it to undef.
In the result there would not be wrong value propagated.

jmorse mentioned this issue Aug 30, 2018

[meta][DebugInfo] Umbrella bug for poor debug experiences #38116

Open

llvmbot transferred this issue from llvm/llvm-bugzilla-archive Dec 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DebugInfo@O2] MachineSink can unsoundly extend variable location ranges #43462

[DebugInfo@O2] MachineSink can unsoundly extend variable location ranges #43462

jmorse commented Nov 22, 2019

avl-llvm commented Nov 29, 2019

jmorse commented Dec 3, 2019

avl-llvm commented Dec 3, 2019

jmorse commented Dec 10, 2019

avl-llvm commented Dec 13, 2019

[DebugInfo@O2] MachineSink can unsoundly extend variable location ranges #43462

[DebugInfo@O2] MachineSink can unsoundly extend variable location ranges #43462

Comments

jmorse commented Nov 22, 2019

Extended Description

avl-llvm commented Nov 29, 2019

*** IR Dump After Machine Common Subexpression Elimination ***:

*** IR Dump After Machine code sinking ***:

*** IR Dump After Machine code sinking ***:

jmorse commented Dec 3, 2019

avl-llvm commented Dec 3, 2019

jmorse commented Dec 10, 2019

avl-llvm commented Dec 13, 2019

* IR Dump After Machine Common Subexpression Elimination *:

* IR Dump After Machine code sinking *:

* IR Dump After Machine code sinking *: