[mlir][nvgpu] Fix `transposeB` in `nvgpu.warpgroup.mma` #79271

grypp · 2024-01-24T10:44:54Z

The #76150 fixed meaning of transposeB in NVVM dialect which was initially implemented with opposite meaning.

This PR fixes the lowering of nvgpu.warpgroup.mma to NVVM dialect.

This will fix two integration tests:
gemm_f32_f16_f16_128x128x128.mlir
gemm_pred_f32_f16_f16_128x128x128.mlir

The llvm#76150 fixed meaning of `transposeB` in NVVM dialect which was initially implemented with opposite meaning. This PR fixes the lowering of `nvgpu.warpgroup.mma` to NVVM dialect. This will fix two integration tests: gemm_f32_f16_f16_128x128x128.mlir gemm_pred_f32_f16_f16_128x128x128.mlir

llvmbot · 2024-01-24T10:45:25Z

@llvm/pr-subscribers-mlir-gpu

@llvm/pr-subscribers-mlir

Author: Guray Ozen (grypp)

Changes

The #76150 fixed meaning of transposeB in NVVM dialect which was initially implemented with opposite meaning.

This PR fixes the lowering of nvgpu.warpgroup.mma to NVVM dialect.

This will fix two integration tests:
gemm_f32_f16_f16_128x128x128.mlir
gemm_pred_f32_f16_f16_128x128x128.mlir

Full diff: https://github.com/llvm/llvm-project/pull/79271.diff

1 Files Affected:

(modified) mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp (+1-1)

diff --git a/mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp b/mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp
index 43d05b872a4fbc8..5080956a4589828 100644
--- a/mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp
+++ b/mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp
@@ -1407,7 +1407,7 @@ struct NVGPUWarpgroupMmaOpLowering
       NVVM::WGMMAScaleOutAttr scaleOut = generateScaleOut();
       NVVM::WGMMAScaleInAttr scaleIn = generateScaleIn();
       NVVM::MMALayoutAttr layoutA = generateWgmmaLayout(op.getTransposeA());
-      NVVM::MMALayoutAttr layoutB = generateWgmmaLayout(op.getTransposeB());
+      NVVM::MMALayoutAttr layoutB = generateWgmmaLayout(!op.getTransposeB());
 
       auto overflow = NVVM::MMAIntOverflowAttr::get(
           op->getContext(), NVVM::MMAIntOverflow::wrapped);

grypp · 2024-01-25T08:25:37Z

I am submitting this PR without reviews as it is fixes the sm_90 integration tests. The nightly tests did not catch them, because we don't test sm_90 targets yet.

llvmbot added mlir:gpu mlir labels Jan 24, 2024

grypp added 2 commits January 24, 2024 12:33

fix test

7de3c17

fix the right test

ab9f67c

grypp merged commit fa13c3e into llvm:main Jan 25, 2024

grypp deleted the fix-test branch January 25, 2024 08:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir][nvgpu] Fix `transposeB` in `nvgpu.warpgroup.mma` #79271

[mlir][nvgpu] Fix `transposeB` in `nvgpu.warpgroup.mma` #79271

Uh oh!

grypp commented Jan 24, 2024

Uh oh!

llvmbot commented Jan 24, 2024 •

edited

Loading

Uh oh!

grypp commented Jan 25, 2024

Uh oh!

Uh oh!

[mlir][nvgpu] Fix transposeB in nvgpu.warpgroup.mma #79271

[mlir][nvgpu] Fix transposeB in nvgpu.warpgroup.mma #79271

Uh oh!

Conversation

grypp commented Jan 24, 2024

Uh oh!

llvmbot commented Jan 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

grypp commented Jan 25, 2024

Uh oh!

Uh oh!

[mlir][nvgpu] Fix `transposeB` in `nvgpu.warpgroup.mma` #79271

[mlir][nvgpu] Fix `transposeB` in `nvgpu.warpgroup.mma` #79271

llvmbot commented Jan 24, 2024 •

edited

Loading