[mlir][vector] Improve vector.gather description #153278

newling · 2025-08-12T20:29:57Z

Hopefully this change to the semantics example will make it clearer what vector.gather does. It wasn't clear to me how indexing worked until I looked at lit lowering tests.

llvmbot · 2025-08-12T20:30:32Z

@llvm/pr-subscribers-mlir-vector

@llvm/pr-subscribers-mlir

Author: James Newling (newling)

Changes

Hopefully this change to the semantics example will make it clearer what vector.gather does. It wasn't clear to me how indexing worked until I looked at lit lowering tests.

Full diff: https://github.com/llvm/llvm-project/pull/153278.diff

1 Files Affected:

(modified) mlir/include/mlir/Dialect/Vector/IR/VectorOps.td (+16-12)

diff --git a/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td b/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td
index 30c1d97ba58f1..db5de0c70d0d0 100644
--- a/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td
+++ b/mlir/include/mlir/Dialect/Vector/IR/VectorOps.td
@@ -2058,23 +2058,27 @@ def Vector_GatherOp :
     Results<(outs AnyVectorOfNonZeroRank:$result)> {
 
   let summary = [{
-    gathers elements from memory or ranked tensor into a vector as defined by an
-    index vector and a mask vector
+    Gathers elements from memory or ranked tensor into a vector as defined by an
+    index vector and a mask vector.
   }];
 
   let description = [{
     The gather operation returns an n-D vector whose elements are either loaded
-    from memory or ranked tensor, or taken from a pass-through vector, depending
+    from a k-D memref or tensor, or taken from an n-D pass-through vector, depending
     on the values of an n-D mask vector.
-    If a mask bit is set, the corresponding result element is defined by the base
-    with indices and the n-D index vector (each index is a 1-D offset on the base).
-    Otherwise, the corresponding element is taken from the n-D pass-through vector.
-    Informally the semantics are:
+
+    If a mask bit is set, the corresponding result element is taken from `base`
+    at an index defined by k `indices` and n-D `index_vec`. Otherwise, the element
+    is taken from the pass-through vector. As an example, suppose that base is
+    3-D and the result is 2-D. The indexing semantics are then,
+
     ```
-    result[0] := if mask[0] then base[index[0]] else pass_thru[0]
-    result[1] := if mask[1] then base[index[1]] else pass_thru[1]
-    etc.
+    result[i,j] := if mask[i,j] then
+                      base[indices[0], indices[1], indices[2] + index_vec[i,j]]
+                   else
+                      pass_thru[i,j]
     ```
+    The index into `base` only varies in the dimension k-1.
 
     If a mask bit is set and the corresponding index is out-of-bounds for the
     given base, the behavior is undefined. If a mask bit is not set, the value
@@ -2082,8 +2086,8 @@ def Vector_GatherOp :
     allowed to be out-of-bounds.
 
     The gather operation can be used directly where applicable, or can be used
-    during progressively lowering to bring other memory operations closer to
-    hardware ISA support for a gather.
+    during progressive lowering to bring other memory operations closer to hardware
+    ISA support for a gather.
 
     Examples:

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

amd-eochoalo

Good improvements. Thank you!

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

banach-space · 2025-08-14T14:51:09Z

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

+    result[i,j] := if mask[i,j] then base[i0, i1, i2 + index_vec[i,j]]
+                   else pass_thru[i,j]


This could also be written as:

result[i,j] := if mask[i,j] then base[i0, i1, i2] + index_vec[i,j] else pass_thru[i,j]

As in, base[i0, i1, i2] provides the base address and then index_vec[i,j] is the "element" index, similarly to how pointer arithmetic works in C.

I wanted to bring it up to make sure that our interpretations are consistent. If that's the case, then I would consider rephrasing:

The index into `base` only varies in the innermost ((k-1)-th) dimension.

(which assumes one interpretation) as

The index vector defines the indices from the base address as defined by the offsets.

This is a bit tricky/nuanced though, as Tensors have no notion of "base address" 😅

Taking a step back, we should probably rename the input arguments as:

index -> offsets

index_vec -> indices

Have you thought about it?

@banach-space Thanks for the feedback, and apologies for landing this faster than necessary. Let me know if you think this can be improved further and I'll definitely make a follow-up PR.

With respect to

result[i,j] := if mask[i,j] then base[i0, i1, i2] + index_vec[i,j] else pass_thru[i,j]

I find interpreting base as a pointer less clear.

This is a bit tricky/nuanced though, as Tensors have no notion of "base address" 😅

Exactly!

I'll add that memrefs can be strided (see this test so should strides be included?

Another subtle difference is what 'out of bounds' means. Current lowering ends up as vector.loads of single elements

[...] %foo = vector.load %base[%i, %j] : memref<100x100xf32>, vector<1xf32> [...]

There is nothing in the vector.load definition about out-of-bounds, but I assume the natural definition there would be that if %j excedes 99 above, it's out of bounds a UB. Which I think is more inline with the current definition of adding index_vec[i,j] to i2.

@banach-space Thanks for the feedback, and apologies for landing this faster than necessary.

No worries - this was in review for two days and two reviewers approved it, so it’s totally expected that you landed it. But since post-commit reviews are a thing in LLVM, and this is interesting... 😅

I find interpreting base as a pointer less clear.

Fair enough!

I'll add that memrefs can be strided (see this test so should strides be included?

Hm, not at the vector.gather nor Vector level, no. Are they?

Another subtle difference is what 'out of bounds' means.

UB sounds about right. Masks should take care of "out-of-bounds". If they don't, it would be a UB, yes. Admittedly, we haven't paid that much attention to gathers/scatters - performance is not great and we try to avoid them.

I'll add that memrefs can be strided (see this test so should strides be included?

Hm, not at the vector.gather nor Vector level, no. Are they?

I meant, in that test (copied below)

%0 = vector.gather %base[%c0][%v], %mask, %pass_thru : memref<4xf32, strided<[2]>>, vector<1xindex>, vector<1xi1>, vector<1xf32> into vector<1xf32>

if the explanation of vector.gather was pointer based:

result[i,j] := if mask[i,j] then base[i0, i1, i2] + index_vec[i,j] else pass_thru[i,j]

then we should probably include a stride in the above. i.e. it should + index_vec[i,j] should be + stride * index_vec[i,j]

This is one reason why I think the pointer-based definition of gather a bit less clear. With the tensor-based definition (just add index_vec[i,j] to another index, not a pointer) this question doesn't come up.

Sorry for not replying earlier, I was OOO.

then we should probably include a stride in the above. i.e. it should + index_vec[i,j] should be + stride * index_vec[i,j]

I view this differently. To me, base[i0, i1, i2] + index_vec[i,j] is the vector abstraction and that's all we care about here. Later, these vector level indices are interpreted at either the memref or tensor abstraction levels - that's when "stride" would matter. Put differently, I agree that the actual meaning of this will depend on what base is:

base[i0, i1, i2] + index_vec[i,j]

However, to me that should not be a concern at the vector level.

Anyway, this is a side point - it's obviously totally fine to see things differently. Your change is a much appreciated improvement, lets leave it as is.

newling · 2025-08-14T15:44:15Z

Taking a step back, we should probably rename the input arguments as:

index -> offsets
index_vec -> indices
Have you thought about it?

I hadn't considered renaming variables, but those names would definitely be an improvement.

banach-space · 2025-08-14T18:27:18Z

Taking a step back, we should probably rename the input arguments as:

index -> offsets
index_vec -> indices
Have you thought about it?

I hadn't considered renaming variables, but those names would definitely be an improvement.

#153640

Let me know what you think :)

tighten the example

7b9cff1

newling requested a review from kuhar as a code owner August 12, 2025 20:29

llvmbot added mlir:vectorops mlir mlir:vector labels Aug 12, 2025

kuhar requested review from banach-space, dcaballe and amd-eochoalo August 12, 2025 20:31

kuhar reviewed Aug 12, 2025

View reviewed changes

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td Outdated Show resolved Hide resolved

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td Outdated Show resolved Hide resolved

newling added 2 commits August 12, 2025 14:46

add IR snippet for 3-D to 2-D, remove comment about gradual lowering

6b431e3

k-1 clarify

52133ed

amd-eochoalo approved these changes Aug 13, 2025

View reviewed changes

kuhar approved these changes Aug 13, 2025

View reviewed changes

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td Outdated Show resolved Hide resolved

newling added 2 commits August 12, 2025 17:57

indentation fix

8e9936b

Merge branch 'main' into vector_gather_doc_improvement

e8b8de3

newling merged commit 2796336 into llvm:main Aug 13, 2025
9 checks passed

newling deleted the vector_gather_doc_improvement branch August 13, 2025 20:50

banach-space reviewed Aug 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][vector] Improve vector.gather description #153278

[mlir][vector] Improve vector.gather description #153278

Uh oh!

newling commented Aug 12, 2025

Uh oh!

llvmbot commented Aug 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

amd-eochoalo left a comment

Uh oh!

Uh oh!

Uh oh!

banach-space Aug 14, 2025

Uh oh!

newling Aug 14, 2025

Uh oh!

banach-space Aug 14, 2025

Uh oh!

newling Aug 15, 2025 •

edited

Loading

Uh oh!

banach-space Aug 25, 2025

Uh oh!

newling commented Aug 14, 2025

Uh oh!

banach-space commented Aug 14, 2025

Uh oh!

Uh oh!

		result[i,j] := if mask[i,j] then base[i0, i1, i2 + index_vec[i,j]]
		else pass_thru[i,j]

[mlir][vector] Improve vector.gather description #153278

[mlir][vector] Improve vector.gather description #153278

Uh oh!

Conversation

newling commented Aug 12, 2025

Uh oh!

llvmbot commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amd-eochoalo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

banach-space Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

newling Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

banach-space Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

newling Aug 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

banach-space Aug 25, 2025

Choose a reason for hiding this comment

Uh oh!

newling commented Aug 14, 2025

Uh oh!

banach-space commented Aug 14, 2025

Uh oh!

Uh oh!

llvmbot commented Aug 12, 2025 •

edited

Loading

newling Aug 15, 2025 •

edited

Loading