[mlir][tosa] Fix lowering of tosa.conv2d #73240

sabauma · 2023-11-23T13:11:30Z

The lowering of tosa.conv2d produces an illegal tensor.empty operation where the number of inputs do not match the number of dynamic dimensions in the output type.

The fix is to base the generation of tensor.dim operations off the result type of the conv2d operation, rather than the input type. The problem and fix are very similar to this fix

#72724

but for convolution.

llvmbot · 2023-11-23T13:12:00Z

@llvm/pr-subscribers-mlir

@llvm/pr-subscribers-mlir-linalg

Author: Spenser Bauman (sabauma)

Changes

The lowering of tosa.conv2d produces an illegal tensor.empty operation where the number of inputs do not match the number of dynamic dimensions in the output type.

The fix is to base the generation of tensor.dim operations off the result type of the conv2d operation, rather than the input type. The problem and fix are very similar to this fix

#72724

but for convolution.

Full diff: https://github.com/llvm/llvm-project/pull/73240.diff

2 Files Affected:

(modified) mlir/lib/Conversion/TosaToLinalg/TosaToLinalgNamed.cpp (+2-2)
(modified) mlir/test/Conversion/TosaToLinalg/tosa-to-linalg-named.mlir (+23)

diff --git a/mlir/lib/Conversion/TosaToLinalg/TosaToLinalgNamed.cpp b/mlir/lib/Conversion/TosaToLinalg/TosaToLinalgNamed.cpp
index 9e374be534985e5..328fdac461e3de4 100644
--- a/mlir/lib/Conversion/TosaToLinalg/TosaToLinalgNamed.cpp
+++ b/mlir/lib/Conversion/TosaToLinalg/TosaToLinalgNamed.cpp
@@ -136,7 +136,7 @@ static SmallVector<Value> inferDynamicDimsForConv(
   for (uint32_t i = 0, s = inputSizeDims.size(); i < s; ++i) {
     int64_t inputDim = inputSizeDims[i];
     int64_t kernelDim = kernelSizeDims[i];
-    if (inputTy.isDynamicDim(inputDim)) {
+    if (resultTy.isDynamicDim(inputDim)) {
       auto padTop = padAttr[i * 2];
       auto padBottom = padAttr[i * 2 + 1];
       auto stride = strideAttr[i];
@@ -153,7 +153,7 @@ static SmallVector<Value> inferDynamicDimsForConv(
 
   // Get the batch/channels dimensions.
   for (int i = 0; i < inputRank; i++) {
-    if (inputTy.isDynamicDim(i) && !dynDims[i])
+    if (resultTy.isDynamicDim(i) && !dynDims[i])
       dynDims[i] = rewriter.create<tensor::DimOp>(loc, input, i);
   }
 
diff --git a/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg-named.mlir b/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg-named.mlir
index 4edc75331932803..6bbaf6dacdb53e0 100644
--- a/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg-named.mlir
+++ b/mlir/test/Conversion/TosaToLinalg/tosa-to-linalg-named.mlir
@@ -497,6 +497,29 @@ func.func @conv2d_dyn_w_h(%input: tensor<1x?x?x27xf32>, %weights: tensor<28x3x3x
 
 // -----
 
+// CHECK: [[$MAP1:.+]] = affine_map<(d0, d1, d2, d3) -> (d3)>
+// CHECK: [[$MAP2:.+]] = affine_map<(d0, d1, d2, d3) -> (d0, d1, d2, d3)>
+
+func.func @conv2d_dyn_output(%input: tensor<2x6x5x4xf32>, %weights: tensor<4x3x3x4xf32>, %bias: tensor<4xf32>) {
+  // %[[C0:.+]] = arith.constant 0 : index
+  // %[[DIM0:.+]] = tensor.dim %input, %[[C0]] : tensor<2x6x5x4xf32>
+  // %[[INIT_CONV:.+]] = tensor.empty(%[[DIM0]]) : tensor<?x4x3x4xf32>
+  // %[[ZERO:.+]] = arith.constant 0.000000e+00 : f32
+  // %[[FILL:.+]] = linalg.fill
+  // %[[INIT_GENERIC:.+]] = tensor.empty([[DIM0]]) : tensor<?x4x3x4xf32>
+
+  // %[[CONV:.+]] = linalg.conv_2d_nhwc_fhwc {dilations = dense<1> : tensor<2xi64>, strides = dense<1> : tensor<2xi64>} ins(%arg0, %arg1 : tensor<2x6x5x4xf32>, tensor<4x3x3x4xf32>) outs(%[[INIT_CONV]] : tensor<?x4x3x4xf32>) -> tensor<?x4x3x4xf32>
+  // linalg.generic {indexing_maps = [#[[MAP1]], #[[MAP2]], #[[MAP2]]], iterator_types = ["parallel", "parallel", "parallel", "parallel"]} ins(%arg2, %[[CONV]] : tensor<4xf32>, tensor<?x4x3x4xf32>) outs(%[[INIT_GENERIC]] : tensor<?x4x3x4xf32>) {
+  //   %[[ADD:.+]] = arith.addf
+  //   linalg.yield %[[ADD]] : f32
+  // } -> tensor<?x4x3x4xf32>
+
+  %0 = tosa.conv2d %input, %weights, %bias {dilation = array<i64: 1, 1>, pad = array<i64: 0, 0, 0, 0>, stride = array<i64: 1, 1>} : (tensor<2x6x5x4xf32    >, tensor<4x3x3x4xf32>, tensor<4xf32>) -> tensor<?x4x3x4xf32>
+  return
+}
+
+// -----
+
 // CHECK-LABEL: @conv2d_padded_f32
 func.func @conv2d_padded_f32(%input: tensor<1x47x40x28xf32>, %weights: tensor<28x3x3x28xf32>, %bias: tensor<28xf32>) -> () {
   // CHECK: %[[C0:.+]] = arith.constant 0

sabauma · 2023-11-28T14:29:48Z

@eric-k256 @jpienaar @rsuderman

eric-k256

How are you generating the operators where the outputs are dynamic, but the input dimensions are known. TOSA defines the calculations for the relation between the two, so it seems like you would always be able to calculate the output dimension at construction if the inputs are known.

sabauma · 2023-11-29T13:07:56Z

These operators come out of the TFL->TOSA lowerings where we end up with a mixture of TOSA and other dialects in the IR. The pattern looks something like the following:

Start with a fully dynamic tosa.conv operator:

%producer = tosa.something ... : tensor<?x?x?x?xf32>
%conv = tosa.conv2d %p ... : tensor<?x?x?x?xf32>
%consumer = non_tosa.op %conv

tosa-infer-shapes then resolves the dimensions of some of the inputs to the tosa.conv2d operator.

%producer = tosa.something ... : tensor<1x2x3x4xf32>
%conv = tosa.conv2d %p ... : tensor<?x?x?x?xf32>
%consumer = non_tosa.op %conv

tosa-infer-shapes will not update the output type of tosa.conv2d in this case because the consumer op does not have a shape inference related interface and is not a func.return op.

#72715 attempts to address this limitation in the tosa-infer-shapes pass.

It seemed desirable to address this issue in the tosa-to-linalg-named lowering as well, as the lowering currently produces invalid IR for inputs that pass all the verifiers.

sabauma · 2023-11-29T20:24:59Z

Possibly worth mentioning that this same issues occurs for tosa.fully_connected as well: #73049

eric-k256

Ok, thanks for the explanation. Looks good.

The lowering of tosa.conv2d produces an illegal tensor.empty operation where the number of inputs do not match the number of dynamic dimensions in the output type. The fix is to base the generation of tensor.dim operations off the result type of the conv2d operation, rather than the input type. The problem and fix are very similar to this fix llvm#72724 but for convolution.

llvmbot added mlir:linalg mlir labels Nov 23, 2023

sabauma force-pushed the conv2d-lowering branch from d45ff5b to fb27011 Compare November 23, 2023 14:15

eric-k256 reviewed Nov 29, 2023

View reviewed changes

eric-k256 approved these changes Nov 29, 2023

View reviewed changes

sabauma force-pushed the conv2d-lowering branch from fb27011 to e141181 Compare December 1, 2023 15:12

sabauma merged commit f58fb8c into llvm:main Dec 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][tosa] Fix lowering of tosa.conv2d #73240

[mlir][tosa] Fix lowering of tosa.conv2d #73240

Uh oh!

sabauma commented Nov 23, 2023

Uh oh!

llvmbot commented Nov 23, 2023 •

edited

Loading

Uh oh!

sabauma commented Nov 28, 2023

Uh oh!

eric-k256 left a comment

Uh oh!

sabauma commented Nov 29, 2023 •

edited

Loading

Uh oh!

sabauma commented Nov 29, 2023

Uh oh!

eric-k256 left a comment

Uh oh!

Uh oh!

[mlir][tosa] Fix lowering of tosa.conv2d #73240

[mlir][tosa] Fix lowering of tosa.conv2d #73240

Uh oh!

Conversation

sabauma commented Nov 23, 2023

Uh oh!

llvmbot commented Nov 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sabauma commented Nov 28, 2023

Uh oh!

eric-k256 left a comment

Choose a reason for hiding this comment

Uh oh!

sabauma commented Nov 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sabauma commented Nov 29, 2023

Uh oh!

eric-k256 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvmbot commented Nov 23, 2023 •

edited

Loading

sabauma commented Nov 29, 2023 •

edited

Loading