[mlir][vector] Determine vector sizes from the result shape for tensor.pad operation #97248

BRUCE11111 · 2024-07-01T01:47:48Z

tensor.pad operation also need to support empty inputVectorSizes to do the vectorization.

….pad operation

github-actions · 2024-07-01T01:48:08Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be
notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write
permissions for the repository. In which case you can instead tag reviewers by
name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review
by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate
is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2024-07-01T01:48:35Z

@llvm/pr-subscribers-mlir-linalg

@llvm/pr-subscribers-mlir

Author: xiaohui1.xu (BRUCE11111)

Changes

tensor.pad operation also need to support empty inputVectorSizes to do the vectorization.

Full diff: https://github.com/llvm/llvm-project/pull/97248.diff

3 Files Affected:

(modified) mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp (+16-2)
(modified) mlir/test/Dialect/Linalg/vectorization-unsupported.mlir (+20)
(modified) mlir/test/Dialect/Linalg/vectorization.mlir (+32)

diff --git a/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp b/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp
index 3a75d2ac08157..a826b153e979f 100644
--- a/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp
+++ b/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp
@@ -1708,6 +1708,13 @@ vectorizeAsTensorPadOp(RewriterBase &rewriter, tensor::PadOp padOp,
           .reifyResultShapes(rewriter, reifiedReturnShapes);
   (void)status; // prevent unused variable warning on non-assert builds
   assert(succeeded(status) && "failed to reify result shapes");
+
+  // If the input vector sizes are not provided, then the vector sizes are
+  // determined by the result tensor shape.
+  if (inputVectorSizes.empty()) {
+    inputVectorSizes =
+        mlir::dyn_cast<RankedTensorType>(padOp->getResultTypes()[0]).getShape();
+  }
   auto maskedRead = vector::createReadOrMaskedRead(
       rewriter, loc, padOp.getSource(), inputVectorSizes, padValue,
       /*useInBoundsInsteadOfMasking=*/false);
@@ -1915,9 +1922,16 @@ vectorizePadOpPrecondition(tensor::PadOp padOp,
     return failure();
   }
 
+  bool satisfyEmptyCond = true;
+  if (inputVectorSizes.empty()) {
+    if (!mlir::dyn_cast<RankedTensorType>(padOp->getResultTypes()[0])
+             .hasStaticShape() ||
+        !padOp.getSourceType().hasStaticShape())
+      satisfyEmptyCond = false;
+  }
   ArrayRef<int64_t> resultTensorShape = padOp.getResultType().getShape();
-  if (failed(vector::isValidMaskedInputVector(resultTensorShape,
-                                              inputVectorSizes)))
+  if (!satisfyEmptyCond && failed(vector::isValidMaskedInputVector(
+                               resultTensorShape, inputVectorSizes)))
     return failure();
 
   if (llvm::any_of(padOp.getLow(), [](Value v) {
diff --git a/mlir/test/Dialect/Linalg/vectorization-unsupported.mlir b/mlir/test/Dialect/Linalg/vectorization-unsupported.mlir
index 5d3c07c8e23c1..ea7b81871ede3 100644
--- a/mlir/test/Dialect/Linalg/vectorization-unsupported.mlir
+++ b/mlir/test/Dialect/Linalg/vectorization-unsupported.mlir
@@ -126,3 +126,23 @@ module attributes {transform.with_named_sequence} {
     transform.yield
   }
 }
+
+  // -----
+
+func.func @test_pad_no_vectorize_dynamic_shape(%arg0: tensor<?x?xf32>, %arg1: tensor<4x16xf32>) -> tensor<4x16xf32> {
+  %f0 = arith.constant 0.0 : f32
+  // expected-error @+1 {{Attempted to vectorize, but failed}}
+ %pad = tensor.pad %arg0 low[0, 0] high[1,1] {
+  ^bb0(%arg3: index, %arg4: index):
+    tensor.yield %f0 : f32
+  } : tensor<?x?xf32> to tensor<4x16xf32>
+  return %pad : tensor<4x16xf32>
+}
+
+module attributes {transform.with_named_sequence} {
+  transform.named_sequence @__transform_main(%arg0: !transform.any_op {transform.readonly}) {
+    %0 = transform.structured.match ops{["tensor.pad"]} in %arg0 : (!transform.any_op) -> !transform.any_op
+    transform.structured.vectorize %0 : !transform.any_op
+    transform.yield
+  }
+}
diff --git a/mlir/test/Dialect/Linalg/vectorization.mlir b/mlir/test/Dialect/Linalg/vectorization.mlir
index bbeccc7fecd68..e5e70ca40f300 100644
--- a/mlir/test/Dialect/Linalg/vectorization.mlir
+++ b/mlir/test/Dialect/Linalg/vectorization.mlir
@@ -1055,3 +1055,35 @@ func.func @test_vectorize_unpack_no_vector_sizes_permute(%source: tensor<4x7x4xf
     transform.yield
   } 
  }
+
+   // -----
+
+// CHECK-LABEL: test_vectorize_pad_no_vector_sizes
+func.func @test_vectorize_pad_no_vector_sizes(%arg0: tensor<63x63xf32>) -> tensor<64x64xf32> {
+  %f0 = arith.constant 0.0 : f32
+  %pad = tensor.pad %arg0 low[0, 0] high[1, 1] {
+  ^bb0(%arg1: index, %arg2: index):
+    tensor.yield %f0 : f32
+  } : tensor<63x63xf32> to tensor<64x64xf32>
+  return %pad : tensor<64x64xf32>
+}
+//  CHECK-DAG: %[[cst:.*]] = arith.constant 0.000000e+00 : f32
+//  CHECK-DAG: %[[c0:.*]] = arith.constant 0 : index
+//  CHECK-DAG: %[[c63:.*]] = arith.constant 63 : index
+//  CHECK-DAG: %[[c63_0:.*]] = arith.constant 63 : index
+//  CHECK: %[[mask:.*]] = vector.create_mask %[[c63]], %[[c63_0]] : vector<64x64xi1>
+//  CHECK: %[[read:.*]] = vector.mask %0 { vector.transfer_read {{.*}}, %cst {in_bounds = [true, true]}
+//  CHECK-SAME : tensor<63x63xf32>, vector<64x64xf32> } : vector<64x64xi1> -> vector<64x64xf32>
+//  CHECK: %[[empty:.*]] = tensor.empty() : tensor<64x64xf32>
+//  CHECK: %[[c0_1:.*]] = arith.constant 0 : index
+//  CHECK: %[[result:.*]] = vector.transfer_write %[[read]], {{.*}} {in_bounds = [true, true]} :
+//  CHECK-SAME : vector<64x64xf32>, tensor<64x64xf32>
+//  CHECK: return %[[result:.*]] : tensor<64x64xf32>
+
+module attributes {transform.with_named_sequence} {
+  transform.named_sequence @__transform_main(%arg0: !transform.any_op {transform.readonly}) {
+    %0 = transform.structured.match ops{["tensor.pad"]} in %arg0 : (!transform.any_op) -> !transform.any_op
+    transform.structured.vectorize %0 : !transform.any_op
+    transform.yield
+  }
+}

hanhanW · 2024-07-01T16:36:43Z

mlir/test/Dialect/Linalg/vectorization.mlir

+//  CHECK: %[[mask:.*]] = vector.create_mask %[[c63]], %[[c63_0]] : vector<64x64xi1>
+//  CHECK: %[[read:.*]] = vector.mask %0 { vector.transfer_read {{.*}}, %cst {in_bounds = [true, true]}


The approach is using in_bounds set to false when input vector sizes are not provided. Masking should only be used when input vector sizes are provided. Eventually we may get rid of in_bounds that when we would have to emulate its behavior from a masked operation.

Hi~ Thanks for the advice~. In order to get better code implementation, I have a question here.
First I don't see any essential difference between inferring the vectorize size from the result shape and setting the vectorize size directly. They both provide vector size finally.
Second, I try to figure out why we prefer to use in_bounds other than mask, I may find some reasons here.
For transfer_read or transfer_write operations, when we lower them to llvm (code link), we can find that we can't support mask operation perfectly now. Sometimes we need a lowering pattern can lowering mask read to just read like code.
But when the input vector sizes are provided why we can use mask operation?
A more precise question is, under what conditions can we safely use mask to mask operation when writing code?
Thanks~

First I don't see any essential difference between inferring the vectorize size from the result shape and setting the vectorize size directly. They both provide vector size finally.

How do you infer sizes when the input shapes are dynamic?

I try to figure out why we prefer to use in_bounds other than mask, I may find some reasons here.

There's a lot of discussion about the semantics in_bounds ATM. I am trying to clarify some of it here:

[mlir][vector] Make the in_bounds attribute mandatory #97049

But when the input vector sizes are provided why we can use mask operation?

I do believe that masks and in_bounds model slightly different things, but the distinction is not well documented ATM.

banach-space

This is lacking motivation.

Why not use GenericPadOpVectorizationPattern and lower to linalg.fill instead? That's already available upstream.

banach-space · 2024-07-10T14:36:46Z

mlir/test/Dialect/Linalg/vectorization.mlir

+//  CHECK: %[[mask:.*]] = vector.create_mask %[[c63]], %[[c63_0]] : vector<64x64xi1>
+//  CHECK: %[[read:.*]] = vector.mask %0 { vector.transfer_read {{.*}}, %cst {in_bounds = [true, true]}


First I don't see any essential difference between inferring the vectorize size from the result shape and setting the vectorize size directly. They both provide vector size finally.

How do you infer sizes when the input shapes are dynamic?

I try to figure out why we prefer to use in_bounds other than mask, I may find some reasons here.

There's a lot of discussion about the semantics in_bounds ATM. I am trying to clarify some of it here:

[mlir][vector] Make the in_bounds attribute mandatory #97049

But when the input vector sizes are provided why we can use mask operation?

I do believe that masks and in_bounds model slightly different things, but the distinction is not well documented ATM.

BRUCE11111 · 2024-07-11T01:28:24Z

This is lacking motivation.

This PR is about the method in here and here.
After PR merged, these three functions should also be unified for the empty input vector size. Otherwise when users call vectorize() function, pad operation can't do similar things like other operations in here.(Although I rarely use the pad in this place for vectorize.)

This PR is just a fix for that. No other motivation.

Why not use GenericPadOpVectorizationPattern and lower to linalg.fill instead?

Maybe it should be linalg::populatePadOpVectorizationPatterns(patterns)? Definitely I use it in downstream project.

How do you infer sizes when the input shapes are dynamic?

In the empty input size case, it must be static shape : https://github.com/BRUCE11111/llvm-project/blob/0653099322a7e96a693c6739cf9b640a4c1fca19/mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp#L1926C3-L1931C4

There's a lot of discussion about the semantics in_bounds ATM. I am trying to clarify some of it here:
#97049

Great work. I will read it carefully.

I do believe that masks and in_bounds model slightly different things

I think so. But I still can't understand in current method vectorize (which mentioned above), why we prefer in_bounds than mask default. Or seems like mask can't support some things even in static shape? Or it just the coding style in this function?(Coding style here means that we use in_bounds default when we implement this vectorize function.) I think later one is the reason. I will follow the style to fix the code.

[mlir][vector] Determin vector sizes from the result shape for tensor…

0653099

….pad operation

BRUCE11111 requested review from hanhanW and nicolasvasilache as code owners July 1, 2024 01:47

llvmbot added mlir:linalg mlir labels Jul 1, 2024

BRUCE11111 changed the title ~~[mlir][vector] Determin vector sizes from the result shape for tensor.pad operation~~ [mlir][vector] Determine vector sizes from the result shape for tensor.pad operation Jul 1, 2024

hanhanW requested changes Jul 1, 2024

View reviewed changes

hanhanW requested review from banach-space and dcaballe July 1, 2024 16:37

BRUCE11111 mentioned this pull request Jul 10, 2024

[mlir][vector] add tensor.concat, bitcast, expand_shape, collapse_shape vectorization support #97297

Open

banach-space requested changes Jul 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][vector] Determine vector sizes from the result shape for tensor.pad operation #97248

[mlir][vector] Determine vector sizes from the result shape for tensor.pad operation #97248

BRUCE11111 commented Jul 1, 2024

Uh oh!

github-actions bot commented Jul 1, 2024

Uh oh!

llvmbot commented Jul 1, 2024 •

edited

Loading

Uh oh!

hanhanW Jul 1, 2024

Uh oh!

BRUCE11111 Jul 4, 2024

Uh oh!

banach-space Jul 10, 2024

Uh oh!

banach-space left a comment

Uh oh!

banach-space Jul 10, 2024

Uh oh!

BRUCE11111 commented Jul 11, 2024

Uh oh!

Uh oh!

		// CHECK: %[[mask:.*]] = vector.create_mask %[[c63]], %[[c63_0]] : vector<64x64xi1>
		// CHECK: %[[read:.]] = vector.mask %0 { vector.transfer_read {{.}}, %cst {in_bounds = [true, true]}

[mlir][vector] Determine vector sizes from the result shape for tensor.pad operation #97248

Are you sure you want to change the base?

[mlir][vector] Determine vector sizes from the result shape for tensor.pad operation #97248

Conversation

BRUCE11111 commented Jul 1, 2024

Uh oh!

github-actions bot commented Jul 1, 2024

Uh oh!

llvmbot commented Jul 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanhanW Jul 1, 2024

Choose a reason for hiding this comment

Uh oh!

BRUCE11111 Jul 4, 2024

Choose a reason for hiding this comment

Uh oh!

banach-space Jul 10, 2024

Choose a reason for hiding this comment

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

banach-space Jul 10, 2024

Choose a reason for hiding this comment

Uh oh!

BRUCE11111 commented Jul 11, 2024

Uh oh!

Uh oh!

llvmbot commented Jul 1, 2024 •

edited

Loading