[mlir][nvvm] Fix the PTX lowering of wgmma.mma_async #76150

apaszke · 2023-12-21T13:07:33Z

The default layout of A and B matrices is row- and column-major respectively, meaning that the transpose flags have opposite meanings between those two operands.

llvmbot · 2023-12-21T13:08:02Z

@llvm/pr-subscribers-mlir-llvm

@llvm/pr-subscribers-mlir

Author: Adam Paszke (apaszke)

Changes

The default layout of A and B matrices is row- and column-major respectively, meaning that the transpose flags have opposite meanings between those two operands.

Full diff: https://github.com/llvm/llvm-project/pull/76150.diff

1 Files Affected:

(modified) mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp (+1-1)

diff --git a/mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp b/mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
index 4f5d71e10f68c1..a4de89d928e1be 100644
--- a/mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
+++ b/mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
@@ -1003,7 +1003,7 @@ void NVVM::WgmmaMmaAsyncOp::getAsmValues(
         {makeConstantI32(rewriter, static_cast<int>(getLayoutA())),
          mlir::NVVM::PTXRegisterMod::Read});
     asmValues.push_back(
-        {makeConstantI32(rewriter, static_cast<int>(getLayoutB())),
+        {makeConstantI32(rewriter, 1 - static_cast<int>(getLayoutB())),
          mlir::NVVM::PTXRegisterMod::Read});
   }
 }

joker-eph · 2023-12-21T16:58:11Z

Can this be exercise by a unit-test?

apaszke · 2023-12-21T18:01:59Z

Yeah seems like Conversion/NVVMToLLVM/nvvm-to-llvm.mlir is catching this change, so I'll need to update that as well.

The default layout of A and B matrices is row- and column-major respectively, meaning that the transpose flags have opposite meanings between those two operands.

apaszke · 2023-12-21T18:19:10Z

Ok the test should be updated now. It does a col-col matmul, so the right transpose args are 1, 0, not 1, 1 as it did previously.

grypp · 2023-12-21T21:21:48Z

Good catch.
We need to change the lowering nvgpu.warpgroup.mma as well.

apaszke · 2023-12-22T12:04:29Z

Seems like the Windows failure is unrelated to this PR?

grypp · 2023-12-22T13:46:10Z

Yes, let me merge this

The llvm#76150 fixed meaning of `transposeB` in NVVM dialect which was initially implemented with opposite meaning. This PR fixes the lowering of `nvgpu.warpgroup.mma` to NVVM dialect. This will fix two integration tests: gemm_f32_f16_f16_128x128x128.mlir gemm_pred_f32_f16_f16_128x128x128.mlir

The #76150 fixed meaning of `transposeB` in NVVM dialect which was initially implemented with opposite meaning. This PR fixes the lowering of `nvgpu.warpgroup.mma` to NVVM dialect. This will fix two integration tests: gemm_f32_f16_f16_128x128x128.mlir gemm_pred_f32_f16_f16_128x128x128.mlir

llvmbot added mlir:llvm mlir labels Dec 21, 2023

apaszke force-pushed the wgmma-transpose-b branch from 5eef4c0 to 0aff6ae Compare December 21, 2023 13:07

[mlir][nvvm] Fix the PTX lowering of wgmma.mma_async

3c22209

The default layout of A and B matrices is row- and column-major respectively, meaning that the transpose flags have opposite meanings between those two operands.

apaszke force-pushed the wgmma-transpose-b branch from 0aff6ae to 3c22209 Compare December 21, 2023 18:17

grypp self-requested a review December 21, 2023 21:19

grypp approved these changes Dec 21, 2023

View reviewed changes

grypp merged commit 85b2327 into llvm:main Dec 22, 2023

grypp mentioned this pull request Jan 24, 2024

[mlir][nvgpu] Fix transposeB in nvgpu.warpgroup.mma #79271

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][nvvm] Fix the PTX lowering of wgmma.mma_async #76150

[mlir][nvvm] Fix the PTX lowering of wgmma.mma_async #76150

Uh oh!

apaszke commented Dec 21, 2023

Uh oh!

llvmbot commented Dec 21, 2023 •

edited

Loading

Uh oh!

joker-eph commented Dec 21, 2023

Uh oh!

apaszke commented Dec 21, 2023

Uh oh!

apaszke commented Dec 21, 2023

Uh oh!

grypp commented Dec 21, 2023

Uh oh!

apaszke commented Dec 22, 2023

Uh oh!

grypp commented Dec 22, 2023

Uh oh!

Uh oh!

[mlir][nvvm] Fix the PTX lowering of wgmma.mma_async #76150

[mlir][nvvm] Fix the PTX lowering of wgmma.mma_async #76150

Uh oh!

Conversation

apaszke commented Dec 21, 2023

Uh oh!

llvmbot commented Dec 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joker-eph commented Dec 21, 2023

Uh oh!

apaszke commented Dec 21, 2023

Uh oh!

apaszke commented Dec 21, 2023

Uh oh!

grypp commented Dec 21, 2023

Uh oh!

apaszke commented Dec 22, 2023

Uh oh!

grypp commented Dec 22, 2023

Uh oh!

Uh oh!

llvmbot commented Dec 21, 2023 •

edited

Loading