[MLIR] Add pattern to bubble up tensor.extract_slice #126898

ofri-frishman · 2025-02-12T11:53:40Z

Add a pattern that bubbles up tensor.extract_slice through
tensor.expand_shape, and add a transform op to tensor dialect
to directly use this pattern.
This pattern enables tiling and fusing op chains which contain
tensor.expand_shape if added as a cleanup pattern of tile and fuse
utility.
Without this pattern that would not be possible, as
tensor.expand_shape does not implement the tiling interface.
In addition, registering this pattern as a cleanup pattern for
transform.structured.fuse.
The pattern was first implement in IREE project by
Quinn Dawkins and is being upstreamed.

Co-authored-by: Quinn Dawkins [email protected]

llvmbot · 2025-02-12T11:54:17Z

@llvm/pr-subscribers-mlir-linalg
@llvm/pr-subscribers-mlir-tensor

@llvm/pr-subscribers-mlir

Author: ofri frishman (ofri-frishman)

Changes

Add a pattern that bubbles up tensor.extract_slice through tensor.expand_shape.
This pattern enables tiling and fusing op chains which contain tensor.expand_shape if added as a cleanup pattern of tile and fuse utility.
Without this pattern that would not be possible, as tensor.expand_shape does not implement the tiling interface. In addition, registering this pattern as a cleanup pattern for transform.structured.fuse.
The pattren was first implement in IREE project by Quinn Dawkins and is being upstreamed.

Full diff: https://github.com/llvm/llvm-project/pull/126898.diff

5 Files Affected:

(modified) mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h (+6)
(modified) mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp (+1)
(added) mlir/lib/Dialect/Tensor/Transforms/BubbleUpExtractSlice.cpp (+207)
(modified) mlir/lib/Dialect/Tensor/Transforms/CMakeLists.txt (+1)
(modified) mlir/test/Dialect/Linalg/transform-op-fuse.mlir (+138)

diff --git a/mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h b/mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
index ae695e0326ca1..dc4558a605a59 100644
--- a/mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
+++ b/mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
@@ -58,6 +58,12 @@ void populateFoldTensorSubsetIntoVectorTransferPatterns(
 void populateMergeConsecutiveInsertExtractSlicePatterns(
     RewritePatternSet &patterns);
 
+/// Appends patterns that are used to bubble up tensor.extract slice op above
+/// its producer. When used as cleanup patterns of tile and fuse, enables fusing
+/// the producer with the consumer even if the producer does not implement the
+/// tiling interface.
+void populateBubbleUpExtractSliceOpPatterns(RewritePatternSet &patterns);
+
 /// Populates `patterns` with patterns that drop redundant tensor.insert_slice
 /// rank expansions.
 void populateDropRedundantInsertSliceRankExpansionPatterns(
diff --git a/mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp b/mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp
index 51d1df52598c7..5146bebe0108e 100644
--- a/mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp
+++ b/mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp
@@ -582,6 +582,7 @@ transform::FuseOp::apply(transform::TransformRewriter &rewriter,
     RewritePatternSet patterns(context);
     tensor::ExtractSliceOp::getCanonicalizationPatterns(patterns, context);
     tensor::populateMergeConsecutiveInsertExtractSlicePatterns(patterns);
+    tensor::populateBubbleUpExtractSliceOpPatterns(patterns);
     tileAndFuseOptions.cleanupPatterns = std::move(patterns);
   }
 
diff --git a/mlir/lib/Dialect/Tensor/Transforms/BubbleUpExtractSlice.cpp b/mlir/lib/Dialect/Tensor/Transforms/BubbleUpExtractSlice.cpp
new file mode 100644
index 0000000000000..a0d3c6d25bbe8
--- /dev/null
+++ b/mlir/lib/Dialect/Tensor/Transforms/BubbleUpExtractSlice.cpp
@@ -0,0 +1,207 @@
+//===- BubbleUpExtractSlice.cpp ---------------------===//
+//
+// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
+// See https://llvm.org/LICENSE.txt for license information.
+// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+//
+//===----------------------------------------------------------------------===//
+//
+// Swap a `tensor.extract_slice` with the producer of the source in some cases
+// where that is valid. When used as cleanup patterns of tile and fuse, enables
+// fusing the producer with the consumer even if the producer does not implement
+// the tiling interface.
+//
+//===----------------------------------------------------------------------===//
+
+#include "mlir/Dialect/Affine/IR/AffineOps.h"
+#include "mlir/Dialect/Arith/Utils/Utils.h"
+#include "mlir/Dialect/Tensor/Transforms/Transforms.h"
+#include "mlir/Dialect/Tensor/Utils/Utils.h"
+#include "mlir/IR/BuiltinTypes.h"
+#include "mlir/IR/OpDefinition.h"
+#include "mlir/IR/PatternMatch.h"
+#include "mlir/Interfaces/ValueBoundsOpInterface.h"
+
+using namespace mlir;
+using namespace mlir::tensor;
+
+/// Converts `tensor.extract_slice(tensor.expand_shape)` to
+/// `tensor.expand_shape(tensor.extract_slice)`.
+/// For this transformation to be possible, the slice must be fully contiguous
+/// within each reassociation group of the expand_shape. If the transformation
+/// is not possible, or if the slice is rank reducting, the function returns
+/// failure.
+///
+/// Example:
+/// ```
+/// %reshape = tensor.expand_shape %in [[0, 1], [2, 3], [4, 5, 6]]
+///     tensor<8x16x32xf32> to tensor<2x4x2x8x4x2x4xf32>
+/// %slice = tensor.extract_slice %reshape ...
+///     tensor<2x4x2x8x4x2x4xf32> to tensor<2x4x1x5x1x1x4xf32>
+///
+/// // The transformation is possible because each reassociation group has a
+/// // contiguous slice. (i.e., [2x4->2x4], [2x8->1x5], [4x2x4->1x1x4])
+/// // After the transformation:
+///
+/// %slice = tensor.extract_slice %in ...
+///     tensor<8x16x32xf32> to tensor<8x5x4xf32>
+/// %reshape = tensor.expand_shape %slice [[0, 1], [2, 3], [4, 5, 6]]
+///     tensor<8x5x4xf32> to tensor<2x4x1x5x1x1x4xf32>
+/// ```
+static LogicalResult
+swapExpandShapeWithSlice(RewriterBase &rewriter,
+                         tensor::ExpandShapeOp expandShapeOp,
+                         tensor::ExtractSliceOp sliceOp) {
+  SmallVector<OpFoldResult> offsets = sliceOp.getMixedOffsets();
+  SmallVector<OpFoldResult> sizes = sliceOp.getMixedSizes();
+
+  if (static_cast<size_t>(sliceOp.getResultType().getRank()) != sizes.size()) {
+    return rewriter.notifyMatchFailure(sliceOp,
+                                       "unimplemented: rank reducing slice");
+  }
+
+  // Helper variables and function for accumulating the new offset and length
+  // values.
+  Location loc = expandShapeOp->getLoc();
+  AffineExpr d0, d1, d2;
+  bindDims(rewriter.getContext(), d0, d1, d2);
+  // Multiply two integers.
+  auto mul = [&](OpFoldResult v1, OpFoldResult v2) {
+    auto mulMap = AffineMap::get(2, 0, {d0 * d1});
+    return affine::makeComposedFoldedAffineApply(rewriter, loc, mulMap,
+                                                 {v1, v2});
+  };
+
+  SmallVector<OpFoldResult> outputShape =
+      getMixedValues(expandShapeOp.getStaticOutputShape(),
+                     expandShapeOp.getOutputShape(), rewriter);
+
+  auto isZeroOffsetAndFullSize = [](OpFoldResult offset, OpFoldResult sliceSize,
+                                    OpFoldResult size) {
+    if (!isConstantIntValue(offset, 0))
+      return false;
+    FailureOr<bool> maybeEqual =
+        ValueBoundsConstraintSet::areEqual(sliceSize, size);
+    return llvm::succeeded(maybeEqual) && maybeEqual.value();
+  };
+
+  // First verify that this is a full slice of the expanded tensor.
+  for (const ReassociationIndices &indices :
+       expandShapeOp.getReassociationIndices()) {
+    int64_t i = 0;
+    int64_t e = indices.size();
+    // Find the first expanded dim after the first dim with non-unit extracted
+    // size.
+    for (; i < e; ++i) {
+      if (!isConstantIntValue(sizes[indices[i]], 1)) {
+        // +1 to skip the first non-unit size dim.
+        i++;
+        break;
+      }
+    }
+
+    // Verify that all subsequent dimensions extract the full size of the
+    // source tensor.
+    for (; i < e; ++i) {
+      int64_t expandedDim = indices[i];
+      if (!isZeroOffsetAndFullSize(offsets[expandedDim], sizes[expandedDim],
+                                   outputShape[expandedDim])) {
+        return rewriter.notifyMatchFailure(
+            sliceOp, "Not a contiguous slice of the expanded tensor.");
+      }
+    }
+  }
+
+  // Compute new offsets, lengths, and strides.
+  SmallVector<OpFoldResult> newOffsets, newLengths, newStrides;
+  for (const ReassociationIndices &indices :
+       expandShapeOp.getReassociationIndices()) {
+    OpFoldResult newSize = rewriter.getIndexAttr(1);
+    SmallVector<OpFoldResult> basis, delinOffsets;
+
+    int64_t i = 0;
+    int64_t e = indices.size();
+    // Offset = cumulative product of leading unit extracted dims.
+    for (; i < e; ++i) {
+      int64_t expandedDim = indices[i];
+      if (!isConstantIntValue(sizes[expandedDim], 1))
+        break;
+
+      basis.push_back(outputShape[expandedDim]);
+      delinOffsets.push_back(offsets[expandedDim]);
+    }
+
+    if (i != e) {
+      int64_t expandedDim = indices[i];
+      basis.push_back(outputShape[expandedDim]);
+      delinOffsets.push_back(offsets[expandedDim]);
+      newSize = sizes[expandedDim];
+      i++;
+    }
+
+    for (; i < e; ++i) {
+      OpFoldResult fullSize = outputShape[indices[i]];
+      basis.push_back(fullSize);
+      delinOffsets.push_back(rewriter.getIndexAttr(0));
+      newSize = mul(newSize, fullSize);
+    }
+    SmallVector<Value> offsetVals =
+        llvm::map_to_vector(delinOffsets, [&](OpFoldResult ofr) {
+          return getValueOrCreateConstantIndexOp(rewriter, loc, ofr);
+        });
+    OpFoldResult newOffset = rewriter
+                                 .create<affine::AffineLinearizeIndexOp>(
+                                     loc, offsetVals, basis, /*disjoint=*/true)
+                                 .getResult();
+    newOffsets.push_back(newOffset);
+    newLengths.push_back(newSize);
+
+    // Only unit stride supported.
+    newStrides.push_back(rewriter.getIndexAttr(1));
+  }
+
+  // The shape of the result can be obtained from the sizes passed in.
+  SmallVector<Value> dynDims;
+  SmallVector<int64_t> shape;
+  dispatchIndexOpFoldResults(sizes, dynDims, shape);
+  RankedTensorType resultType = RankedTensorType::get(
+      shape, expandShapeOp.getResultType().getElementType());
+
+  // Create a new ExtractSliceOp and ExpandShapeOp.
+  Value newSliceOp = rewriter.create<tensor::ExtractSliceOp>(
+      loc, expandShapeOp.getSrc(), newOffsets, newLengths, newStrides);
+  auto newExpandShapeOp = rewriter.create<tensor::ExpandShapeOp>(
+      loc, resultType, newSliceOp, expandShapeOp.getReassociationIndices(),
+      sizes);
+  rewriter.replaceOp(sliceOp, newExpandShapeOp);
+  return success();
+}
+
+namespace {
+
+struct SwapExpandShapeWithSlicePattern
+    : public OpRewritePattern<tensor::ExtractSliceOp> {
+  using OpRewritePattern<tensor::ExtractSliceOp>::OpRewritePattern;
+
+  LogicalResult matchAndRewrite(tensor::ExtractSliceOp sliceOp,
+                                PatternRewriter &rewriter) const override {
+    auto expandOp = sliceOp.getSource().getDefiningOp<tensor::ExpandShapeOp>();
+    if (!expandOp) {
+      return failure();
+    }
+
+    if (!sliceOp.hasUnitStride()) {
+      return rewriter.notifyMatchFailure(sliceOp,
+                                         "unsupported: non-unit stride");
+    }
+
+    return swapExpandShapeWithSlice(rewriter, expandOp, sliceOp);
+  }
+};
+
+} // namespace
+
+void mlir::tensor::populateBubbleUpExtractSliceOpPatterns(
+    RewritePatternSet &patterns) {
+  patterns.add<SwapExpandShapeWithSlicePattern>(patterns.getContext());
+}
diff --git a/mlir/lib/Dialect/Tensor/Transforms/CMakeLists.txt b/mlir/lib/Dialect/Tensor/Transforms/CMakeLists.txt
index cc6275fee671a..634cc93a08352 100644
--- a/mlir/lib/Dialect/Tensor/Transforms/CMakeLists.txt
+++ b/mlir/lib/Dialect/Tensor/Transforms/CMakeLists.txt
@@ -11,6 +11,7 @@ add_mlir_dialect_library(MLIRTensorTransforms
   RewriteAsConstant.cpp
   SwapExtractSliceWithProducerPatterns.cpp
   SubsetInsertionOpInterfaceImpl.cpp
+  BubbleUpExtractSlice.cpp
 
   ADDITIONAL_HEADER_DIRS
   ${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Tensor/Transforms
diff --git a/mlir/test/Dialect/Linalg/transform-op-fuse.mlir b/mlir/test/Dialect/Linalg/transform-op-fuse.mlir
index ac1ca9319d335..22796611c5934 100644
--- a/mlir/test/Dialect/Linalg/transform-op-fuse.mlir
+++ b/mlir/test/Dialect/Linalg/transform-op-fuse.mlir
@@ -278,3 +278,141 @@ module attributes {transform.with_named_sequence} {
     transform.yield
   }
 }
+
+// -----
+
+// CHECK-LABEL: func.func @swap_expand_shape_with_extract_slice
+//     CHECK: scf.for %[[X:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:   scf.for %[[Y:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:     scf.for %[[Z:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:       %[[LINEAR_IDX:.+]] = affine.linearize_index disjoint [%[[X]], %[[Y]], %[[Z]]] by (2, 3, 10)
+//     CHECK:       %[[SLICE:.+]] = tensor.extract_slice %{{.*}}[%[[LINEAR_IDX]]] [5] [1] : tensor<60xf32> to tensor<5xf32>
+//     CHECK:       %[[EXPAND:.+]] = tensor.expand_shape %[[SLICE]] {{\[\[}}0, 1, 2]] output_shape [1, 1, 5]
+//     CHECK:       linalg.exp ins(%[[EXPAND]]
+func.func @swap_expand_shape_with_extract_slice(%0: tensor<60xf32>) -> tensor<2x3x10xf32> {
+  %expand = tensor.expand_shape %0 [[0, 1, 2]] output_shape [2, 3, 10] : tensor<60xf32> into tensor<2x3x10xf32>
+  %empty = tensor.empty() : tensor<2x3x10xf32>
+  %exp = linalg.exp ins(%expand : tensor<2x3x10xf32>) outs(%empty : tensor<2x3x10xf32>) -> tensor<2x3x10xf32>
+  return %exp : tensor<2x3x10xf32>
+}
+
+module attributes {transform.with_named_sequence} {
+  transform.named_sequence @__transform_main(%arg0: !transform.any_op {transform.readonly}) {
+    %0 = transform.structured.match ops{["linalg.exp"]} in %arg0 : (!transform.any_op) -> !transform.any_op
+    %transformed, %loops:3 = transform.structured.fuse %0 [1, 1, 5] interchange [0, 1, 2] apply_cleanup = true : 
+      (!transform.any_op) -> (!transform.any_op, !transform.op<"scf.for">, !transform.any_op, !transform.any_op)
+    transform.yield 
+  }
+}
+
+// -----
+
+// CHECK-LABEL: func.func @swap_expand_shape_with_extract_slice_full_inner_dim
+//     CHECK: scf.for %[[X:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:   scf.for %[[Y:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:       %[[LINEAR_IDX:.+]] = affine.linearize_index disjoint [%[[X]], %[[Y]]{{.*}} by (3, 4, 10)
+//     CHECK:       %[[SLICE:.+]] = tensor.extract_slice %{{.*}}[%[[LINEAR_IDX]]] [20] [1] : tensor<120xf32> to tensor<20xf32>
+//     CHECK:       %[[EXPAND:.+]] = tensor.expand_shape %[[SLICE]] {{\[\[}}0, 1, 2]] output_shape [1, 2, 10]
+//     CHECK:       linalg.exp ins(%[[EXPAND]]
+func.func @swap_expand_shape_with_extract_slice_full_inner_dim(%0: tensor<120xf32>) -> tensor<3x4x10xf32> {
+  %expand = tensor.expand_shape %0 [[0, 1, 2]] output_shape [3, 4, 10] : tensor<120xf32> into tensor<3x4x10xf32>
+  %empty = tensor.empty() : tensor<3x4x10xf32>
+  %exp = linalg.exp ins(%expand : tensor<3x4x10xf32>) outs(%empty : tensor<3x4x10xf32>) -> tensor<3x4x10xf32>
+  return %exp : tensor<3x4x10xf32>
+}
+
+module attributes {transform.with_named_sequence} {
+  transform.named_sequence @__transform_main(%arg0: !transform.any_op {transform.readonly}) {
+    %0 = transform.structured.match ops{["linalg.exp"]} in %arg0 : (!transform.any_op) -> !transform.any_op
+    %transformed, %loops:2 = transform.structured.fuse %0 [1, 2, 0] interchange [0, 1, 2] apply_cleanup = true : 
+      (!transform.any_op) -> (!transform.any_op, !transform.op<"scf.for">, !transform.any_op)
+    transform.yield 
+  }
+}
+
+// -----
+
+// CHECK-LABEL: func.func @swap_expand_shape_with_extract_slice_full_inner_dim
+//     CHECK: tensor.expand_shape
+//     CHECK: scf.for
+//     CHECK:   scf.for
+//     CHECK:     scf.for
+//     CHECK:       linalg.exp
+func.func @swap_expand_shape_with_extract_slice_full_inner_dim(%0: tensor<120xf32>) -> tensor<3x4x10xf32> {
+  %expand = tensor.expand_shape %0 [[0, 1, 2]] output_shape [3, 4, 10] : tensor<120xf32> into tensor<3x4x10xf32>
+  %empty = tensor.empty() : tensor<3x4x10xf32>
+  %exp = linalg.exp ins(%expand : tensor<3x4x10xf32>) outs(%empty : tensor<3x4x10xf32>) -> tensor<3x4x10xf32>
+  return %exp : tensor<3x4x10xf32>
+}
+
+module attributes {transform.with_named_sequence} {
+  transform.named_sequence @__transform_main(%arg0: !transform.any_op {transform.readonly}) {
+    %0 = transform.structured.match ops{["linalg.exp"]} in %arg0 : (!transform.any_op) -> !transform.any_op
+    %transformed, %loops:3 = transform.structured.fuse %0 [1, 2, 5] interchange [0, 1, 2] apply_cleanup = true : 
+      (!transform.any_op) -> (!transform.any_op, !transform.op<"scf.for">, !transform.any_op, !transform.any_op)
+    transform.yield 
+  }
+}
+
+// -----
+
+// CHECK-LABEL: func.func @swap_expand_shape_with_extract_slice_multiple_expanded_dims
+//     CHECK: %[[C0:.+]] = arith.constant 0 : index
+//     CHECK: scf.for %[[X:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:   scf.for %[[Y:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:     scf.for %[[Z:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:       scf.for %[[W:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:       %[[LINEAR_IDX0:.+]] = affine.linearize_index disjoint [%[[X]], %[[Y]], %[[C0]]] by (3, 4, 10)
+//     CHECK:       %[[LINEAR_IDX1:.+]] = affine.linearize_index disjoint [%[[Z]], %[[W]]] by (7, 8)
+//     CHECK:       %[[SLICE:.+]] = tensor.extract_slice %{{.*}}[%[[LINEAR_IDX0]], %[[LINEAR_IDX1]]] [20, 4] [1, 1] : tensor<120x56xf32> to tensor<20x4xf32>
+//     CHECK:       %[[EXPAND:.+]] = tensor.expand_shape %[[SLICE]] {{\[\[}}0, 1, 2], [3, 4]] output_shape [1, 2, 10, 1, 4]
+//     CHECK:       linalg.exp ins(%[[EXPAND]]
+module {
+  func.func @swap_expand_shape_with_extract_slice_multiple_expanded_dims(%0: tensor<120x56xf32>) -> tensor<3x4x10x7x8xf32> {
+    %expand = tensor.expand_shape %0 [[0, 1, 2], [3, 4]] output_shape [3, 4, 10, 7, 8] : tensor<120x56xf32> into tensor<3x4x10x7x8xf32>
+    %empty = tensor.empty() : tensor<3x4x10x7x8xf32>
+    %exp = linalg.exp ins(%expand : tensor<3x4x10x7x8xf32>) outs(%empty : tensor<3x4x10x7x8xf32>) -> tensor<3x4x10x7x8xf32>
+    return %exp : tensor<3x4x10x7x8xf32>
+  }
+}
+
+module attributes {transform.with_named_sequence} {
+  transform.named_sequence @__transform_main(%arg0: !transform.any_op {transform.readonly}) {
+    %0 = transform.structured.match ops{["linalg.exp"]} in %arg0 : (!transform.any_op) -> !transform.any_op
+    %transformed, %loops:4 = transform.structured.fuse %0 [1, 2, 0, 1, 4] interchange [0, 1, 2, 3, 4] apply_cleanup = true : 
+      (!transform.any_op) -> (!transform.any_op, !transform.op<"scf.for">, !transform.any_op, !transform.any_op, !transform.any_op)
+    transform.yield 
+  }
+}
+
+// -----
+
+//     CHECK: scf.for %[[X:[A-Za-z0-9]+]] = {{.*}}
+//     CHECK:    %[[LINEAR_IDX:.+]] = affine.linearize_index disjoint [%[[X]], {{.*}} by (8, 32)
+//     CHECK:    %[[SLICE:.+]] = tensor.extract_slice %{{.*}}[0, 0, %[[LINEAR_IDX]]] [1, 1800, 32] [1, 1, 1] : tensor<1x1800x256xf32> to tensor<1x1800x32xf32>
+//     CHECK:    %[[ABS:.+]] = linalg.abs ins(%[[SLICE]]
+//     CHECK:    %[[EXPAND:.+]] = tensor.expand_shape %[[ABS]] {{\[\[}}0], [1], [2, 3]] output_shape [1, 1800, 1, 32]
+//     CHECK:    linalg.exp ins(%[[EXPAND]]
+module {
+  func.func @swap_expand_shape_with_extract_slice_and_fuse_with_expand_producer(%0: tensor<1x1800x256xf32>) -> tensor<1x1800x8x32xf32> {
+    %empty1 = tensor.empty() : tensor<1x1800x256xf32>
+    %exp1 = linalg.abs ins(%0 : tensor<1x1800x256xf32>) outs(%empty1 : tensor<1x1800x256xf32>) -> tensor<1x1800x256xf32>
+    %expand = tensor.expand_shape %exp1 [[0], [1], [2, 3]] output_shape [1, 1800, 8, 32] : tensor<1x1800x256xf32> into tensor<1x1800x8x32xf32>
+    %empty2 = tensor.empty() : tensor<1x1800x8x32xf32>
+    %exp2 = linalg.exp ins(%expand : tensor<1x1800x8x32xf32>) outs(%empty2 : tensor<1x1800x8x32xf32>) -> tensor<1x1800x8x32xf32>
+    return %exp2 : tensor<1x1800x8x32xf32>
+  }
+}
+
+module attributes {transform.with_named_sequence} {
+  transform.named_sequence @__transform_main(%arg0: !transform.any_op {transform.readonly}) {
+    %0 = transform.structured.match ops{["linalg.exp"]} in %arg0 : (!transform.any_op) -> !transform.any_op
+    %transformed, %loops:1 = transform.structured.fuse %0 [0, 0, 1, 0] interchange [0, 1, 2, 3] apply_cleanup = true : 
+      (!transform.any_op) -> (!transform.any_op, !transform.op<"scf.for">)
+    transform.yield 
+  }
+}
+
+
+
+

mgehre-amd · 2025-02-12T13:01:12Z

There are very similar patterns in mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp. How are they related? Should the new pattern also live in that file?

ofri-frishman · 2025-02-12T14:59:56Z

I agree that the patterns in mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp are similar. The pattern I added is meant to swap between expand_shape and extract_slice which when used as a cleanup pattern for tile and fuse utility enables adding the expand_shape into a loop nest even though it does not implement the tiling interface.
I'm not familiar the the use of the other patterns, but don't think it is related to tiling.
I placed it in a separate file since I thought that other patterns that bubble up extract_slice through tensor ops could be placed there as well, so the emphasis is more on the extract_slice than on the expand_shape. But currently there is only one such pattern, so it could be placed together with the other reshape patterns.
But it would still require a separate populate function, since only this pattern should be added to the FuseOp cleanup patterns.
BTW there is another place with a similar pattern - in mlir/lib/Dialect/Linalg/Transforms/BubbleUpExtractSlice.cpp there is a pattern that bubbles up a extract_slice through a linalg op.
Given this, if you think that the best place for this pattern is in mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp I can move it to there

mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp

mgehre-amd · 2025-02-13T07:46:05Z

I agree that the patterns in mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp are similar. The pattern I added is meant to swap between expand_shape and extract_slice which when used as a cleanup pattern for tile and fuse utility enables adding the expand_shape into a loop nest even though it does not implement the tiling interface. I'm not familiar the the use of the other patterns, but don't think it is related to tiling. I placed it in a separate file since I thought that other patterns that bubble up extract_slice through tensor ops could be placed there as well, so the emphasis is more on the extract_slice than on the expand_shape. But currently there is only one such pattern, so it could be placed together with the other reshape patterns. But it would still require a separate populate function, since only this pattern should be added to the FuseOp cleanup patterns. BTW there is another place with a similar pattern - in mlir/lib/Dialect/Linalg/Transforms/BubbleUpExtractSlice.cpp there is a pattern that bubbles up a extract_slice through a linalg op. Given this, if you think that the best place for this pattern is in mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp I can move it to there

Ok, make sense to me. Thanks for the analysis! Discoverability has been a big headache for me: There are many useful patterns, but its really hard to find whether a certain pattern already exists.
You can't probably do much about it in this PR.
Anyway, your patterns looks very useful, so thanks for the PR!

banach-space · 2025-02-13T09:15:31Z

Discoverability has been a big headache for me: There are many useful patterns, but its really hard to find whether a certain pattern already exists.

Let me suggest a partial solution.

We should be able to identify relevant transformations by simply "grep"-ing for "interesting" Ops in the test directory. For this to work, we need to make sure that:

we have good (meaningful + descriptive) test file names,
we have good (meaningful + descriptive) function names,
we test patterns in isolation and document what patterns are being excercised (e.g. here).

As a stepping stone, @ofri-frishman, please make sure that the new pattern is tested in isolation. As for the right location, there seems to be 2 good candidates:

"mlir/lib/Dialect/Linalg/Transforms/BubbleUpExtractSlice.cpp",
"mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp".

This pattern feels like "bubbling up" to me, though "swap" might be more fitting actually. Just to avoid bike-shedding, I suggest "BubbleUpExtractSlice.cpp", but lets note (in comments) that this is basically a "swap". We can re-visit later.

ofri-frishman · 2025-02-14T11:52:44Z

Just a general question - currently in github I see that I am defined as a contributor to the llvm project and cannot add reviewers to review the PR and will not be able to land the PR once approved. Do you know who I can ask to give me the relevant permissions?

mgehre-amd · 2025-02-14T12:14:35Z

Just a general question - currently in github I see that I am defined as a contributor to the llvm project and cannot add reviewers to review the PR and will not be able to land the PR once approved. Do you know who I can ask to give me the relevant permissions?

Hey, https://llvm.org/docs/DeveloperPolicy.html#obtaining-commit-access should have the relevant information.

qedawkins · 2025-02-14T18:23:54Z

Re: landing the PR. I don't know if the policy has changed, but typically for first time contributors a reviewer can land the PR once approved/ready.

qedawkins

Thanks for contributing this! I wrote most of the code here so it might be worth getting final approval from someone else. +1 to adding a separate testing op from transform.structured.fuse.

mlir/lib/Dialect/Tensor/Transforms/BubbleUpExtractSlice.cpp

ofri-frishman · 2025-02-14T19:05:58Z

Thanks for contributing this! I wrote most of the code here so it might be worth getting final approval from someone else. +1 to adding a separate testing op from transform.structured.fuse.

I was actually wondering how best to go about that and wasn't aware of the option to add test ops, I was thinking about adding a test pass. You think the best way to have a test that isolates the pattern is via a test op (I guess you mean a transform op)? If so, any chance you could point me to an example of such an op?

ofri-frishman · 2025-02-16T13:42:39Z

Let me suggest a partial solution.

We should be able to identify relevant transformations by simply "grep"-ing for "interesting" Ops in the test directory. For this to work, we need to make sure that:

we have good (meaningful + descriptive) test file names,

we have good (meaningful + descriptive) function names,

we test patterns in isolation and document what patterns are being excercised (e.g. here).

As a stepping stone, @ofri-frishman, please make sure that the new pattern is tested in isolation. As for the right location, there seems to be 2 good candidates:

"mlir/lib/Dialect/Linalg/Transforms/BubbleUpExtractSlice.cpp",

"mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp".

This pattern feels like "bubbling up" to me, though "swap" might be more fitting actually. Just to avoid bike-shedding, I suggest "BubbleUpExtractSlice.cpp", but lets note (in comments) that this is basically a "swap". We can re-visit later.

I amended the commit and added a separate transform op to apply the pattern directly + tests that isolate the pattern.
Moved the pattern to "mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp".
Added a comment about the pattern currently being a bubble up pattern but that there is a possibility to rework it into a swap pattern.

ofri-frishman · 2025-02-23T06:02:07Z

Gentle reminder @banach-space @qedawkins

banach-space

Nice, thanks for the updates! And for the ping - this had slipped off my radar.

I’ve left some minor suggestions inline. I’ll take another look later.

mlir/test/Dialect/Tensor/bubble-up-extract-slice-op.mlir

mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp

mlir/test/Dialect/Tensor/bubble-up-extract-slice-op.mlir

ofri-frishman · 2025-02-25T09:41:10Z

Nice, thanks for the updates! And for the ping - this had slipped off my radar.

I’ve left some minor suggestions inline. I’ll take another look later.

Thanks for the additional review, I've made changes according to your suggestions.

banach-space

Thanks for the updates!

The tests are incredibly helpful for understanding the code—very nicely formatted and consistent, too. Thank you!

I have a few minor points, but nothing major.

@qedawkins, what are your thoughts? Also, could you take a quick look at "transform-op-fuse.mlir" with your expert eye?

mlir/test/Dialect/Linalg/transform-op-fuse.mlir

mlir/test/Dialect/Tensor/bubble-up-extract-slice-op.mlir

mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp

mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp

mlir/test/Dialect/Tensor/bubble-up-extract-slice-op.mlir

qedawkins

Tests look great, just a couple minor comments. Thanks for contributing this upstream!

(Make sure @banach-space's concerns are addressed before landing).

mlir/include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.td

mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp

mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp

banach-space

LGTM % some final nits

@ofri-frishman , you've done really great job refactoring, documenting and testing this, thank you 🙏🏻 Honestly, I find it so much to follow the code after your changes.

I've left some minor nits/suggestions. Nothing major and hopefully not too much work.

mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp

mlir/test/Dialect/Tensor/bubble-up-extract-slice-op.mlir

mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp

ofri-frishman · 2025-02-28T09:23:17Z

Regarding the negative test for BubbleUpExpandShapeThroughExtractSlice within the tests for transform.structured.fuse - I understand your point, but I see this a bit differently.
I think that having such negative tests in transform.structured.fuse help clarify what you could expect to work and not work for transform.structured.fuse. It isn't meant to get negative test coverage for the BubbleUpExpandShapeThroughExtractSlice as that is already achieved in the specific tests written directly for BubbleUpExpandShapeThroughExtractSlice.

ofri-frishman · 2025-02-28T09:24:50Z

LGTM % some final nits

@ofri-frishman , you've done really great job refactoring, documenting and testing this, thank you 🙏🏻 Honestly, I find it so much to follow the code after your changes.

I've left some minor nits/suggestions. Nothing major and hopefully not too much work.

Thanks for your time on this. I made some final updates based on your suggestions.

banach-space

LGTM, thanks!

@qedawkins has already approved, and I’m the only other reviewer, so this should be good to land.

Since it’s Friday, we can wait until Monday if you prefer—just in case the bots start complaining. 🙂 (They shouldn’t if pre-commit is 🟢, but you never know!)

Thanks for putting in all the effort on this - great work! 🙏🏻

ofri-frishman · 2025-03-02T08:46:57Z

LGTM, thanks!

@qedawkins has already approved, and I’m the only other reviewer, so this should be good to land.

Since it’s Friday, we can wait until Monday if you prefer—just in case the bots start complaining. 🙂 (They shouldn’t if pre-commit is 🟢, but you never know!)

Thanks for putting in all the effort on this - great work! 🙏🏻

@banach-space I made a few final tweaks. Since I don't have write permissions yet, could you land this PR for me?

ofri-frishman · 2025-03-02T08:51:37Z

@mgehre-amd @banach-space @qedawkins thank you very much for the time spent on reviewing this PR, it is greatly appreciated.
This is my first meaningful contribution to LLVM repository, and your comments and suggestions are instrumental in making sure it is a high quality contribution.

banach-space · 2025-03-02T19:07:30Z

A bit of book-keeping before I land this.

From you summary:

The pattren was first implement in IREE project by Quinn Dawkins and is being upstreamed.

You may want to add Quinn as a co-author.
There's a typo in your summary ("pattren" -> "pattern")

Also, when I try to merge it, GitHub adds this note:

Co-authored-by: Ofri Frishman [email protected]

It also "prints" this note:

This commit will be authored by [email protected].

Please make sure that you have your e-mail address correctly set-up:

https://llvm.org/docs/DeveloperPolicy.html#github-email-address

I suspect that that's the reason for all the extra noise from GitHub.

Thanks!

ofri-frishman · 2025-03-03T06:02:21Z

A bit of book-keeping before I land this.

From you summary:

The pattren was first implement in IREE project by Quinn Dawkins and is being upstreamed.

You may want to add Quinn as a co-author.

There's a typo in your summary ("pattren" -> "pattern")

Also, when I try to merge it, GitHub adds this note:

Co-authored-by: Ofri Frishman [email protected]

It also "prints" this note:

This commit will be authored by [email protected].

Please make sure that you have your e-mail address correctly set-up:

https://llvm.org/docs/DeveloperPolicy.html#github-email-address

I suspect that that's the reason for all the extra noise from GitHub.

Thanks!

Added Quinn as a co-author. I wasn't aware of the convention of adding co-authors as a separate part after the commit message, thanks.
Fixed the typo.
I have a bit of a mess between my private and work emails. I changed the email in the commit message to my private one which is connected to the GitHub account, and updated the email settings in GitHub. Could you see if now GitHub doesn't add extra lines?

Add a pattern that bubbles up tensor.extract_slice through tensor.expand_shape, and add a transform op to tensor dialect to directly use this pattern. This pattern enables tiling and fusing op chains which contain tensor.expand_shape if added as a cleanup pattern of tile and fuse utility. Without this pattern that would not be possible, as tensor.expand_shape does not implement the tiling interface. In addition, registering this pattern as a cleanup pattern for transform.structured.fuse. The pattern was first implement in IREE project by Quinn Dawkins and is being upstreamed. --------- Co-authored-by: Quinn Dawkins <[email protected]>

banach-space · 2025-03-03T08:30:56Z

Added Quinn as a co-author.

Thanks!

Just a heads-up: you’ve updated the commit message instead of the PR summary. When I press "Squash and merge" on your behalf, GitHub will use the PR summary rather than the commit message (*). This is covered in the LLVM documentation (https://llvm.org/docs/GitHub.html#github-reviews):

Use the button Squash and merge in GitHub’s web interface. If you do this, remember to review the commit message when prompted.

We should probably add a clear note about using the PR summary - there’s no way for a new contributor to see this info in the UI 😅 (I only know it because I’ve done it a few times).

EDIT: See #129497.

Additionally, I noticed you’ve been force-pushing instead of using fix-up commits. LLVM's policy recommends using fix-up commits instead of force-pushing:

https://llvm.org/docs/GitHub.html#updating-pull-requests

When updating a pull request, you should push additional “fix-up” commits to your branch instead of force pushing.

On Co-Authors in Commit Messages

I wasn't aware of the convention of adding co-authors as a separate part after the commit message, thanks.

I had assumed this was already covered by our code-review policy, but it seems like it isn’t:

https://discourse.llvm.org/t/code-review-process-when-upstreaming-patches

Regardless, it’s a good practice to follow - we should ensure all contributions are properly attributed (not to take anything away from your fantastic effort here!).

—Andrzej

(*) Otherwise, in the case of multiple commits, GitHub wouldn’t know which commit message to use (all of them?).

ofri-frishman · 2025-03-03T11:25:14Z

Thanks for all the important info.

I updated the message at the top to match the commit message. I hope that is the PR summary that you mentioned.
I wasn't really sure when I started work on this if I should force merge or create additional commits, since I didn't know if squashing commits was possible. I'll know to use that for next time.
Do you see any other updates that are required?

banach-space · 2025-03-03T18:20:36Z

I wasn't really sure when I started work on this if I should force merge or create additional commits, since I didn't know if squashing commits was possible.

TBH, finding that info is not the easiest thing ATM. I know that some folks are working hard to improve that, but most of these improvements are hobby projects and these tend to take time.

That said, you are a new contributor and it's for the more established contributors to mentor you :) So, mea culpa for not pointing these things out earlier!

I am landing this shortly, thanks for bearing with me 🙏🏻

Add a pattern that bubbles up tensor.extract_slice through tensor.collapse_shape. The pattern is registered in a pattern population function that is used by the transform op transform.apply_patterns.tensor.bubble_up_extract_slice and by the tranform op transform.structured.fuse as a cleanup pattern. This pattern enables tiling and fusing op chains which contain tensor.collapse_shape if added as a cleanup pattern of tile and fuse utility. Without this pattern that would not be possible, as tensor.collapse_shape does not implement the tiling interface. This is an additional pattern to the one added in PR llvm#126898

Add a pattern that bubbles up tensor.extract_slice through tensor.expand_shape, and add a transform op to tensor dialect to directly use this pattern. This pattern enables tiling and fusing op chains which contain tensor.expand_shape if added as a cleanup pattern of tile and fuse utility. Without this pattern that would not be possible, as tensor.expand_shape does not implement the tiling interface. In addition, registering this pattern as a cleanup pattern for transform.structured.fuse. The pattern was first implement in IREE project by Quinn Dawkins and is being upstreamed. --------- Co-authored-by: Quinn Dawkins <[email protected]>

…131982) Add a pattern that bubbles up tensor.extract_slice through tensor.collapse_shape. The pattern is registered in a pattern population function that is used by the transform op transform.apply_patterns.tensor.bubble_up_extract_slice and by the tranform op transform.structured.fuse as a cleanup pattern. This pattern enables tiling and fusing op chains which contain tensor.collapse_shape if added as a cleanup pattern of tile and fuse utility. Without this pattern that would not be possible, as tensor.collapse_shape does not implement the tiling interface. This is an additional pattern to the one added in PR #126898

ofri-frishman requested review from ftynse, nicolasvasilache and hanhanW as code owners February 12, 2025 11:53

llvmbot added mlir:linalg mlir mlir:tensor labels Feb 12, 2025

hanhanW requested a review from qedawkins February 12, 2025 16:42

qedawkins reviewed Feb 12, 2025

View reviewed changes

mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp Show resolved Hide resolved

amrami requested a review from MaheshRavishankar February 13, 2025 08:01

ofri-frishman mentioned this pull request Feb 14, 2025

Request Commit Access For ofri-frishman #127235

Open

qedawkins reviewed Feb 14, 2025

View reviewed changes

mlir/lib/Dialect/Tensor/Transforms/BubbleUpExtractSlice.cpp Outdated Show resolved Hide resolved

mlir/lib/Dialect/Tensor/Transforms/BubbleUpExtractSlice.cpp Outdated Show resolved Hide resolved

ofri-frishman force-pushed the llvm/extract_slice_bubbleup_pattern branch from c59dc1a to 004a3e0 Compare February 16, 2025 13:01

ofri-frishman requested review from banach-space and qedawkins February 16, 2025 13:44

banach-space reviewed Feb 23, 2025

View reviewed changes

ofri-frishman force-pushed the llvm/extract_slice_bubbleup_pattern branch from 004a3e0 to 25a359b Compare February 25, 2025 07:45

ofri-frishman requested a review from banach-space February 25, 2025 09:40

banach-space reviewed Feb 25, 2025

View reviewed changes

qedawkins approved these changes Feb 25, 2025

View reviewed changes

ofri-frishman force-pushed the llvm/extract_slice_bubbleup_pattern branch from 25a359b to b8903a8 Compare February 27, 2025 06:27

ofri-frishman requested a review from banach-space February 27, 2025 07:05

banach-space approved these changes Feb 27, 2025

View reviewed changes

ofri-frishman force-pushed the llvm/extract_slice_bubbleup_pattern branch from b8903a8 to a0c4175 Compare February 28, 2025 09:04

ofri-frishman commented Feb 28, 2025

View reviewed changes

mlir/lib/Dialect/Tensor/Transforms/ReshapePatterns.cpp Show resolved Hide resolved

ofri-frishman requested a review from banach-space February 28, 2025 09:23

banach-space approved these changes Feb 28, 2025

View reviewed changes

qedawkins mentioned this pull request Feb 28, 2025

[Codegen] Add expand slice swap pattern to TileAndDistributeToWorkgroups iree-org/iree#20123

Merged

ofri-frishman force-pushed the llvm/extract_slice_bubbleup_pattern branch from a0c4175 to df1819c Compare March 2, 2025 08:35

ofri-frishman force-pushed the llvm/extract_slice_bubbleup_pattern branch from df1819c to de63bb8 Compare March 3, 2025 05:56

ofri-frishman force-pushed the llvm/extract_slice_bubbleup_pattern branch from de63bb8 to 68a7815 Compare March 3, 2025 06:14

banach-space merged commit 6e59282 into llvm:main Mar 3, 2025
11 checks passed

ofri-frishman mentioned this pull request Mar 19, 2025

[MLIR] Bubble up tensor.extract_slice through tensor.collapse_shape #131982

Merged

[MLIR] Add pattern to bubble up tensor.extract_slice #126898

[MLIR] Add pattern to bubble up tensor.extract_slice #126898

Uh oh!

Conversation

ofri-frishman commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mgehre-amd commented Feb 12, 2025

Uh oh!

ofri-frishman commented Feb 12, 2025

Uh oh!

Uh oh!

mgehre-amd commented Feb 13, 2025

Uh oh!

banach-space commented Feb 13, 2025

Uh oh!

ofri-frishman commented Feb 14, 2025

Uh oh!

mgehre-amd commented Feb 14, 2025

Uh oh!

qedawkins commented Feb 14, 2025

Uh oh!

qedawkins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ofri-frishman commented Feb 14, 2025

Uh oh!

ofri-frishman commented Feb 16, 2025

Uh oh!

ofri-frishman commented Feb 23, 2025

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ofri-frishman commented Feb 25, 2025

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qedawkins left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

banach-space left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ofri-frishman commented Feb 12, 2025 •

edited

Loading

llvmbot commented Feb 12, 2025 •

edited

Loading

qedawkins left a comment •

edited

Loading

banach-space commented Mar 3, 2025 •

edited

Loading