[LangRef] Clarify semantics of masked vector load/store #82469

RalfJung · 2024-02-21T07:44:52Z

Basically, these operations are equivalent to a loop that iterates all elements and then does a getelementptr (without inbounds!) plus load/store only for the masked-on elements.

llvmbot · 2024-02-21T07:45:22Z

@llvm/pr-subscribers-llvm-ir

Author: Ralf Jung (RalfJung)

Changes

This is based on what I think has to follow from the statement about preventing exceptions. But I don't actually know what LLVM IR passes will do with these intrinsics, so this requires careful review by someone who does. :)

@nikic do you know these passes / know who knows these passes to do the review?

Also, there's an open question that remains: for the purpose of noalias, do these operations access the masked-off lanes or not? I sure hope they don't, but I realized that while data races are mentioned, noalias is not.

Full diff: https://github.com/llvm/llvm-project/pull/82469.diff

1 Files Affected:

(modified) llvm/docs/LangRef.rst (+2)

diff --git a/llvm/docs/LangRef.rst b/llvm/docs/LangRef.rst
index fd2e3aacd0169c..496773c4d3d386 100644
--- a/llvm/docs/LangRef.rst
+++ b/llvm/docs/LangRef.rst
@@ -23752,6 +23752,7 @@ Semantics:
 
 The '``llvm.masked.load``' intrinsic is designed for conditional reading of selected vector elements in a single IR operation. It is useful for targets that support vector masked loads and allows vectorizing predicated basic blocks on these targets. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar load operations.
 The result of this operation is equivalent to a regular vector load instruction followed by a 'select' between the loaded and the passthru values, predicated on the same mask. However, using this intrinsic prevents exceptions on memory access to masked-off lanes.
+In particular, this means that only the masked-on lanes of the vector need to be inbounds of an allocation (but all these lanes need to be inbounds of the same allocation).
 
 
 ::
@@ -23794,6 +23795,7 @@ Semantics:
 
 The '``llvm.masked.store``' intrinsics is designed for conditional writing of selected vector elements in a single IR operation. It is useful for targets that support vector masked store and allows vectorizing predicated basic blocks on these targets. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar store operations.
 The result of this operation is equivalent to a load-modify-store sequence. However, using this intrinsic prevents exceptions and data races on memory access to masked-off lanes.
+In particular, this means that only the masked-on lanes of the vector need to be inbounds of an allocation (but all these lanes need to be inbounds of the same allocation).
 
 ::

RalfJung · 2024-02-21T07:45:35Z

llvm/docs/LangRef.rst

@@ -23752,6 +23752,7 @@ Semantics:

 The '``llvm.masked.load``' intrinsic is designed for conditional reading of selected vector elements in a single IR operation. It is useful for targets that support vector masked loads and allows vectorizing predicated basic blocks on these targets. Other targets may support this intrinsic differently, for example by lowering it into a sequence of branches that guard scalar load operations.
 The result of this operation is equivalent to a regular vector load instruction followed by a 'select' between the loaded and the passthru values, predicated on the same mask. However, using this intrinsic prevents exceptions on memory access to masked-off lanes.
+In particular, this means that only the masked-on lanes of the vector need to be inbounds of an allocation (but all these lanes need to be inbounds of the same allocation).


Is "masked-on" the opposite of "masked-off"? Or is there some other term I could use?

nikic · 2024-02-21T08:13:07Z

I would rephrase this in terms of something like this:

However, these intrinsics behave as-is the masked off lanes are not accessed.

Which should tell use everything necessary about their semantics. Then can continue to clarify that this means no exceptions / data races / etc.

RalfJung · 2024-02-21T10:03:14Z

That doesn't quite say everything -- there's the question of whether this Rust PR should say offset (aka getelementptr inbounds) or offset_wrapping (aka getelementptr) when describing how the pointers to the individual elements being loaded are computed.

programmerjake · 2024-02-21T11:03:32Z

That doesn't quite say everything -- there's the question of whether this Rust PR should say offset (aka getelementptr inbounds)

there is the additional caveat that LLVM is allowed to create a poison value without UB (which is what happens with getelementptr inbounds with out-of-bounds indexes), but Rust defines out-of-bounds offset to be immediate UB, instead of deferred to the load/store.

a major difference between the two choices is that doing a masked load on a pointer before the beginning of it's allocation is disallowed with inbounds, but allowed without inbounds as long as vector elements are masked-off until the offset is big enough to be in the allocation's bounds.

RalfJung · 2024-02-21T12:29:31Z

a major difference between the two choices is that doing a masked load on a pointer before the beginning of it's allocation is disallowed with inbounds, but allowed without inbounds as long as vector elements are masked-off until the offset is big enough to be in the allocation's bounds.

Yes that is indeed the key point: if the first half of the vector is masked-off, and that first half is actually out-of-bounds, then the pointer itself is conceptually out-of-bounds and "computing the pointer to the actually loaded element" would be a non-inbounds pointer computation. I expect this usecase to be allowed, which is why I added the following in this PR:

Only the masked-on lanes of the vector need to be inbounds of an allocation (but all these lanes need to be inbounds of the same allocation).

RalfJung · 2024-05-02T07:08:59Z

@nikic I have updated the wording to

The result of this operation is equivalent to a regular vector load instruction followed by a 'select' between the loaded and the passthru values, predicated on the same mask, except that the masked-off lanes are not accessed.

Followed by clarification regarding exceptions, noalias, and data races. Does that work for you?

nikic

LGTM, but a second opinion wouldn't hurt.

llvm/docs/LangRef.rst

RalfJung · 2024-06-15T08:31:48Z

@llvm/pr-subscribers-llvm-ir this PR has one review but "a second opinion wouldn't hurt" -- would be nice if someone could take a look. :)

RalfJung · 2024-07-22T09:58:02Z

@nikic any recommendation for how one could get a second opinion for this PR? I don't know how to navigate the LLVM review process to move this PR forwards...

RalfJung · 2024-08-02T09:23:34Z

@nunoplopes any chance you could take a look at this? :)

appujee · 2024-08-02T17:58:11Z

cc: @fhahn and @alexey-bataev who are code owners of Autovectorizer

alexey-bataev · 2024-08-02T18:02:17Z

No objections from my side

RalfJung · 2024-08-03T08:47:40Z

Thanks! Could someone merge this please then? :)

nikic · 2024-08-03T10:48:14Z

Thanks! Could someone merge this please then? :)

Sure, but you need to update the PR description first, which becomes the commit message.

RalfJung · 2024-08-03T11:01:58Z

@nikic I updated the description, does that work?

nunoplopes · 2024-08-03T11:10:36Z

LGTM

Basically, these operations are equivalent to a loop that iterates all elements and then does a `getelementptr` (without `inbounds`!) plus `load`/`store` only for the masked-on elements.

RalfJung mentioned this pull request Feb 21, 2024

Correct the simd_masked_{load,store} intrinsic docs rust-lang/rust#119203

Merged

llvmbot added the llvm:ir label Feb 21, 2024

RalfJung commented Feb 21, 2024

View reviewed changes

nikic requested a review from topperc February 21, 2024 08:07

RalfJung force-pushed the vector-masked branch from ad9bcf6 to acf5422 Compare February 21, 2024 10:05

RalfJung mentioned this pull request May 2, 2024

clarify semantics of masked.load/store #77449

Closed

RalfJung force-pushed the vector-masked branch 2 times, most recently from 32f4ea4 to 9c21fa7 Compare May 2, 2024 07:07

nikic approved these changes May 3, 2024

View reviewed changes

llvm/docs/LangRef.rst Outdated Show resolved Hide resolved

llvm/docs/LangRef.rst Outdated Show resolved Hide resolved

nikic changed the title ~~clarify semantics of masked vector load/store~~ [LangRef] Clarify semantics of masked vector load/store May 3, 2024

nikic requested a review from preames May 3, 2024 03:32

clarify semantics of masked vector load/store

fcdd2f5

RalfJung force-pushed the vector-masked branch from b772a03 to fcdd2f5 Compare May 3, 2024 05:42

nikic merged commit 79f7630 into llvm:main Aug 3, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LangRef] Clarify semantics of masked vector load/store #82469

[LangRef] Clarify semantics of masked vector load/store #82469

RalfJung commented Feb 21, 2024 •

edited

Loading

llvmbot commented Feb 21, 2024

RalfJung Feb 21, 2024

nikic commented Feb 21, 2024

RalfJung commented Feb 21, 2024

programmerjake commented Feb 21, 2024

RalfJung commented Feb 21, 2024 •

edited

Loading

RalfJung commented May 2, 2024

nikic left a comment

RalfJung commented Jun 15, 2024

RalfJung commented Jul 22, 2024 •

edited

Loading

RalfJung commented Aug 2, 2024

appujee commented Aug 2, 2024

alexey-bataev commented Aug 2, 2024

RalfJung commented Aug 3, 2024

nikic commented Aug 3, 2024

RalfJung commented Aug 3, 2024

nunoplopes commented Aug 3, 2024

[LangRef] Clarify semantics of masked vector load/store #82469

[LangRef] Clarify semantics of masked vector load/store #82469

Conversation

RalfJung commented Feb 21, 2024 • edited Loading

llvmbot commented Feb 21, 2024

RalfJung Feb 21, 2024

Choose a reason for hiding this comment

nikic commented Feb 21, 2024

RalfJung commented Feb 21, 2024

programmerjake commented Feb 21, 2024

RalfJung commented Feb 21, 2024 • edited Loading

RalfJung commented May 2, 2024

nikic left a comment

Choose a reason for hiding this comment

RalfJung commented Jun 15, 2024

RalfJung commented Jul 22, 2024 • edited Loading

RalfJung commented Aug 2, 2024

appujee commented Aug 2, 2024

alexey-bataev commented Aug 2, 2024

RalfJung commented Aug 3, 2024

nikic commented Aug 3, 2024

RalfJung commented Aug 3, 2024

nunoplopes commented Aug 3, 2024

RalfJung commented Feb 21, 2024 •

edited

Loading

RalfJung commented Feb 21, 2024 •

edited

Loading

RalfJung commented Jul 22, 2024 •

edited

Loading