[mlir][vector] Update `castAwayContractionLeadingOneDim` to omit transposes solely on leading unit dims. #85694

KoolJBlack · 2024-03-18T20:31:30Z

Updates castAwayContractionLeadingOneDim to check for leading unit dimensions before inserting vector.transpose ops.

Currently castAwayContractionLeadingOneDim removes all leading unit dims based on the accumulator and transpose any subsequent operands to match the accumulator indexing. This does not take into account if the transpose is strictly necessary, for instance when given this vector-matrix contract:

  %result = vector.contract {indexing_maps = [affine_map<(d0, d1, d2, d3) -> (d0, d1, d3)>, affine_map<(d0, d1, d2, d3) -> (d0, d2, d3)>, affine_map<(d0, d1, d2, d3) -> (d1, d2)>], iterator_types = ["parallel", "parallel", "parallel", "reduction"], kind = #vector.kind<add>} %lhs, %rhs, %acc : vector<1x1x8xi32>, vector<1x8x8xi32> into vector<1x8xi32>

Passing this through castAwayContractionLeadingOneDim pattern produces the following:

    %0 = vector.transpose %arg0, [1, 0, 2] : vector<1x1x8xi32> to vector<1x1x8xi32>
    %1 = vector.extract %0[0] : vector<1x8xi32> from vector<1x1x8xi32>
    %2 = vector.extract %arg2[0] : vector<8xi32> from vector<1x8xi32>
    %3 = vector.contract {indexing_maps = [affine_map<(d0, d1, d2) -> (d0, d2)>, affine_map<(d0, d1, d2) -> (d0, d1, d2)>, affine_map<(d0, d1, d2) -> (d1)>], iterator_types = ["parallel", "parallel", "reduction"], kind = #vector.kind<add>} %1, %arg1, %2 : vector<1x8xi32>, vector<1x8x8xi32> into vector<8xi32>
    %4 = vector.broadcast %3 : vector<8xi32> to vector<1x8xi32>

The vector.transpose introduced does not affect the underlying data layout (effectively a no op), but it cannot be folded automatically. This change avoids inserting transposes when only leading unit dimensions are involved.

Fixes #85691

llvmbot · 2024-03-19T16:47:11Z

@llvm/pr-subscribers-mlir-vector

@llvm/pr-subscribers-mlir

Author: Kojo Acquah (KoolJBlack)

Changes

Fixes #85691

Full diff: https://github.com/llvm/llvm-project/pull/85694.diff

2 Files Affected:

(modified) mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp (+17-2)
(modified) mlir/test/Dialect/Vector/vector-dropleadunitdim-transforms.mlir (+22)

diff --git a/mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp b/mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp
index 74382b027c2f48..6b69f5f1932ad7 100644
--- a/mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp
+++ b/mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp
@@ -399,13 +399,28 @@ mlir::vector::castAwayContractionLeadingOneDim(vector::ContractionOp contractOp,
           transposeResults.push_back(targetExpr);
         }
       }
+
+      // Check if the transpose effects outer unit dims only. Such transposes do
+      // not materially effect the underlying vector and can be omitted.
+      bool tranposeNonOuterUnitDims = false;
+      for (int64_t i = 0; i < (int64_t)perm.size(); ++i) {
+        if (perm[i] != i && i != (int64_t)perm.size() - 1) {
+          if (operands[it.index()].getType().cast<ShapedType>().getDimSize(i) !=
+              1) {
+            tranposeNonOuterUnitDims = true;
+          }
+        }
+      }
+
       // Do the tranpose now if needed so that we can drop the
       // correct dim using extract later.
       if (tranposeNeeded) {
         map = AffineMap::get(map.getNumDims(), 0, transposeResults,
                              contractOp.getContext());
-        operands[it.index()] = rewriter.create<vector::TransposeOp>(
-            contractOp.getLoc(), operands[it.index()], perm);
+        if (tranposeNonOuterUnitDims) {
+          operands[it.index()] = rewriter.createOrFold<vector::TransposeOp>(
+              contractOp.getLoc(), operands[it.index()], perm);
+        }
       }
     }
     // We have taken care to have the dim to be dropped be
diff --git a/mlir/test/Dialect/Vector/vector-dropleadunitdim-transforms.mlir b/mlir/test/Dialect/Vector/vector-dropleadunitdim-transforms.mlir
index af6e636245b04e..31b0867c851f58 100644
--- a/mlir/test/Dialect/Vector/vector-dropleadunitdim-transforms.mlir
+++ b/mlir/test/Dialect/Vector/vector-dropleadunitdim-transforms.mlir
@@ -166,6 +166,28 @@ func.func @cast_away_contraction_leading_one_dims_nonleadingunitdim_rank4_acctra
 
 // -----
 
+// CHECK-DAG: #[[$MAP_0:.+]] = affine_map<(d0, d1, d2) -> (d0, d2)>
+// CHECK-DAG: #[[$MAP_1:.+]] = affine_map<(d0, d1, d2) -> (d0, d1, d2)>
+// CHECK-DAG: #[[$MAP_2:.+]] = affine_map<(d0, d1, d2) -> (d1)>
+
+// CHECK-LABEL:   func.func @cast_away_contraction_leading_one_dims_vec_mat(
+// CHECK-SAME:                                 %[[VAL_0:.*]]: vector<1x1x8xi32>,
+// CHECK-SAME:                                 %[[VAL_1:.*]]: vector<1x8x8xi32>,
+// CHECK-SAME:                                 %[[VAL_2:.*]]: vector<1x8xi32>) -> vector<1x8xi32> {
+// CHECK:           %[[VAL_3:.*]] = vector.extract %[[VAL_0]][0] : vector<1x8xi32> from vector<1x1x8xi32>
+// CHECK:           %[[VAL_4:.*]] = vector.extract %[[VAL_2]][0] : vector<8xi32> from vector<1x8xi32>
+// CHECK:           %[[VAL_5:.*]] = vector.contract {indexing_maps = [#[[$MAP_0]], #[[$MAP_1]], #[[$MAP_2]]], iterator_types = ["parallel", "parallel", "reduction"], kind = #vector.kind<add>} %[[VAL_3]], %[[VAL_1]], %[[VAL_4]] : vector<1x8xi32>, vector<1x8x8xi32> into vector<8xi32>
+// CHECK:           %[[VAL_6:.*]] = vector.broadcast %[[VAL_5]] : vector<8xi32> to vector<1x8xi32>
+// CHECK:           return %[[VAL_6]] : vector<1x8xi32>
+// CHECK:         }
+func.func @cast_away_contraction_leading_one_dims_vec_mat(%lhs: vector<1x1x8xi32>,
+                          %rhs: vector<1x8x8xi32>,
+                          %acc: vector<1x8xi32>) -> vector<1x8xi32> {
+  %result = vector.contract {indexing_maps = [affine_map<(d0, d1, d2, d3) -> (d0, d1, d3)>, affine_map<(d0, d1, d2, d3) -> (d0, d2, d3)>, affine_map<(d0, d1, d2, d3) -> (d1, d2)>], iterator_types = ["parallel", "parallel", "parallel", "reduction"], kind = #vector.kind<add>} %lhs, %rhs, %acc : vector<1x1x8xi32>, vector<1x8x8xi32> into vector<1x8xi32>
+  return %result : vector<1x8xi32>
+}
+
+// -----
 // CHECK-DAG: #[[MAP0:.*]] = affine_map<(d0, d1, d2, d3) -> (d0, d1, d3)>
 // CHECK-DAG: #[[MAP1:.*]] = affine_map<(d0, d1, d2, d3) -> (d0, d3, d2)>
 // CHECK-DAG: #[[MAP2:.*]] = affine_map<(d0, d1, d2, d3) -> (d0, d1, d2)>

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp

banach-space

Sorry, forgot to submit this earlier :(

Similar comments to what @hanhanW has posted. I would also ask for a some justification in the commit summay (see https://mlir.llvm.org/getting_started/Contributing/#commit-messages). A reference to IREE issue is very helpful, but the commit summary should be self-contained - could you add a brief overview?

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp

mlir/test/Dialect/Vector/vector-dropleadunitdim-transforms.mlir

banach-space

Thanks for the updates - few more comments inline

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp

mlir/test/Dialect/Vector/vector-dropleadunitdim-transforms.mlir

hanhanW · 2024-03-25T23:08:25Z

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp

+      // Checks if only the outer, unit dimensions (of size 1) are permuted.
+      // Such transposes do not materially effect the underlying vector and can
+      // be omitted. EG: perm [1, 0, 2] applied to vector<1x1x8xi32>
+      bool tranposeNonOuterUnitDims = false;
+      for (auto [index, dim] :


Thanks for the example, I now understand why we need this.. Instead of adding the ad-hoc logic here, would it make sense to add the canonicalization pattern to vector.transpose op?

I considered this. I wasn't certain if it was %100 safe to remove transposes like this in every scenario (for instance, if there is some pass that creates transposes like this in one pattern and consumes them in a subsequent pattern).

On the flipside, since this was an acute issue created in this pass it is pretty straightforward to handle these transposes here understanding the entire pass.

I would be in favor of using this modification for now and can look intro transpose canonicalizer in following, less you feel differently?

The follow up SGTM as well. Having that canonicalization pattern makes sense but given that we already have logic to decide if a transpose should be generated or not, it would make sense to extend that logic to support this extra case (and hopefully avoid a canonicalization pass just for this).

going to commit this for now and start work on a broader transpose fold

KoolJBlack changed the title ~~Update castAwayContractionLeadingOneDim to omit transposes solely on leading unit dims.~~ [mlir][vector] Update castAwayContractionLeadingOneDim to omit transposes solely on leading unit dims. Mar 18, 2024

KoolJBlack requested a review from dcaballe March 19, 2024 16:46

KoolJBlack marked this pull request as ready for review March 19, 2024 16:46

KoolJBlack requested review from hanhanW and nicolasvasilache as code owners March 19, 2024 16:46

llvmbot added mlir:vectorops mlir mlir:vector labels Mar 19, 2024

KoolJBlack force-pushed the vector_drop_unit_transpose branch from b1ff4b0 to a561286 Compare March 19, 2024 17:04

dcaballe requested a review from banach-space March 19, 2024 17:12

hanhanW reviewed Mar 19, 2024

View reviewed changes

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp Outdated Show resolved Hide resolved

banach-space reviewed Mar 19, 2024

View reviewed changes

KoolJBlack force-pushed the vector_drop_unit_transpose branch from a561286 to 163ea73 Compare March 21, 2024 22:04

KoolJBlack requested review from banach-space and hanhanW March 21, 2024 22:04

dcaballe approved these changes Mar 22, 2024

View reviewed changes

banach-space reviewed Mar 25, 2024

View reviewed changes

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp Show resolved Hide resolved

mlir/lib/Dialect/Vector/Transforms/VectorDropLeadUnitDim.cpp Outdated Show resolved Hide resolved

mlir/test/Dialect/Vector/vector-dropleadunitdim-transforms.mlir Outdated Show resolved Hide resolved

hanhanW reviewed Mar 25, 2024

View reviewed changes

KoolJBlack force-pushed the vector_drop_unit_transpose branch 2 times, most recently from f40efe1 to bfd2fc4 Compare April 3, 2024 21:28

KoolJBlack requested review from dcaballe, hanhanW and banach-space April 3, 2024 21:52

KoolJBlack force-pushed the vector_drop_unit_transpose branch from bfd2fc4 to 55af0b0 Compare April 3, 2024 23:21

KoolJBlack added 2 commits April 3, 2024 23:26

only tranpose non leading unit dims

7975d0b

review comments

d36279f

KoolJBlack force-pushed the vector_drop_unit_transpose branch from 55af0b0 to d36279f Compare April 3, 2024 23:26

KoolJBlack merged commit 66fed33 into llvm:main Apr 3, 2024

KoolJBlack deleted the vector_drop_unit_transpose branch April 3, 2024 23:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][vector] Update `castAwayContractionLeadingOneDim` to omit transposes solely on leading unit dims. #85694

[mlir][vector] Update `castAwayContractionLeadingOneDim` to omit transposes solely on leading unit dims. #85694

Uh oh!

KoolJBlack commented Mar 18, 2024 •

edited

Loading

Uh oh!

llvmbot commented Mar 19, 2024 •

edited

Loading

Uh oh!

Uh oh!

banach-space left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

banach-space left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hanhanW Mar 25, 2024

Uh oh!

KoolJBlack Apr 3, 2024

Uh oh!

hanhanW Apr 3, 2024

Uh oh!

dcaballe Apr 3, 2024

Uh oh!

KoolJBlack Apr 3, 2024

Uh oh!

Uh oh!

[mlir][vector] Update castAwayContractionLeadingOneDim to omit transposes solely on leading unit dims. #85694

[mlir][vector] Update castAwayContractionLeadingOneDim to omit transposes solely on leading unit dims. #85694

Uh oh!

Conversation

KoolJBlack commented Mar 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Mar 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hanhanW Mar 25, 2024

Choose a reason for hiding this comment

Uh oh!

KoolJBlack Apr 3, 2024

Choose a reason for hiding this comment

Uh oh!

hanhanW Apr 3, 2024

Choose a reason for hiding this comment

Uh oh!

dcaballe Apr 3, 2024

Choose a reason for hiding this comment

Uh oh!

KoolJBlack Apr 3, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

[mlir][vector] Update `castAwayContractionLeadingOneDim` to omit transposes solely on leading unit dims. #85694

[mlir][vector] Update `castAwayContractionLeadingOneDim` to omit transposes solely on leading unit dims. #85694

KoolJBlack commented Mar 18, 2024 •

edited

Loading

llvmbot commented Mar 19, 2024 •

edited

Loading