[MLIR][NVGPU] Fix the cga_cluster.mlir test #112191

durga4github · 2024-10-14T12:34:41Z

This patch fixes the sm90 cluster test by:

Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op conversion pattern should actually be for the
ClusterDimBlocks Op. This addresses the compilation error for this test.
The grid-size should be (4,4,1) instead of (2,2,1). This passes the scf-if check against the threshold of 3 below and actually
generates the required prints from the GPU.

llvmbot · 2024-10-14T12:35:17Z

@llvm/pr-subscribers-mlir-gpu

@llvm/pr-subscribers-mlir

Author: Durgadoss R (durga4github)

Changes

This patch fixes the sm90 cluster test by:

Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op conversion pattern should actually be for the
ClusterDimBlocks Op. This addresses the compilation error for this test.
The grid-size should be (4,4,1) instead of (2,2,1). This passes the scf-if check against the threshold of 3 below and actually
generates the required prints from the GPU.

Full diff: https://github.com/llvm/llvm-project/pull/112191.diff

2 Files Affected:

(modified) mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp (+2-2)
(modified) mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir (+1-1)

diff --git a/mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp b/mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
index e83574b7342725..8638ae603a0bea 100644
--- a/mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
+++ b/mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
@@ -373,8 +373,8 @@ void mlir::populateGpuToNVVMConversionPatterns(
       NVVM::BlockInClusterIdYOp, NVVM::BlockInClusterIdZOp>>(
       converter, IndexKind::Other, IntrType::Id);
   patterns.add<gpu::index_lowering::OpLowering<
-      gpu::ClusterDimOp, NVVM::ClusterDimXOp, NVVM::ClusterDimYOp,
-      NVVM::ClusterDimZOp>>(converter, IndexKind::Other, IntrType::Dim);
+      gpu::ClusterDimBlocksOp, NVVM::ClusterDimBlocksXOp, NVVM::ClusterDimBlocksYOp,
+      NVVM::ClusterDimBlocksZOp>>(converter, IndexKind::Other, IntrType::Dim);
   patterns.add<gpu::index_lowering::OpLowering<
       gpu::BlockIdOp, NVVM::BlockIdXOp, NVVM::BlockIdYOp, NVVM::BlockIdZOp>>(
       converter, IndexKind::Grid, IntrType::Id);
diff --git a/mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir b/mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir
index 5c11d80178f727..c70c940564a264 100644
--- a/mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir
+++ b/mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir
@@ -18,7 +18,7 @@ module attributes {gpu.container_module} {
     return
   }
   gpu.module @gpumodule {
-    gpu.func @kernel_cluster() kernel attributes {gpu.known_block_size = array<i32: 1, 1, 1>, gpu.known_grid_size = array<i32: 2, 2, 1>} {
+    gpu.func @kernel_cluster() kernel attributes {gpu.known_block_size = array<i32: 1, 1, 1>, gpu.known_grid_size = array<i32: 4, 4, 1>} {
       %cidX = gpu.cluster_id  x
       %cidY = gpu.cluster_id  y
       %cidZ = gpu.cluster_id  z

durga4github · 2024-10-14T12:35:52Z

@grypp, Please help review this change.

github-actions · 2024-10-14T12:38:13Z

✅ With the latest revision this PR passed the C/C++ code formatter.

This patch fixes the sm90 cluster test by: * Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op conversion pattern should actually be for the ClusterDimBlocks Op. This addresses the compilation error for this test. * The grid-size should be (4,4,1) instead of (2,2,1). This passes the scf-if check against the threshold of 3 below and actually generates the required prints from the GPU. Signed-off-by: Durgadoss R <[email protected]>

durga4github · 2024-10-14T13:03:02Z

Addressed clang-format issues,

durga4github · 2024-10-14T14:13:41Z

Builds are clean, merging this.

This patch fixes the sm90 cluster test by: * Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op conversion pattern should actually be for the ClusterDimBlocks Op. This addresses the compilation error for this test. * The grid-size should be (4,4,1) instead of (2,2,1). This passes the scf-if check against the threshold of 3 below and actually generates the required prints from the GPU. Signed-off-by: Durgadoss R <[email protected]>

durga4github requested a review from grypp as a code owner October 14, 2024 12:34

llvmbot added mlir:gpu mlir labels Oct 14, 2024

grypp approved these changes Oct 14, 2024

View reviewed changes

durga4github force-pushed the durgadossr/nvgpu_test_fix2 branch from fc903a4 to 56fcfdf Compare October 14, 2024 13:02

durga4github merged commit a8b5115 into llvm:main Oct 14, 2024
8 checks passed

durga4github deleted the durgadossr/nvgpu_test_fix2 branch October 14, 2024 14:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MLIR][NVGPU] Fix the cga_cluster.mlir test #112191

[MLIR][NVGPU] Fix the cga_cluster.mlir test #112191

Uh oh!

durga4github commented Oct 14, 2024

Uh oh!

llvmbot commented Oct 14, 2024 •

edited

Loading

Uh oh!

durga4github commented Oct 14, 2024

Uh oh!

github-actions bot commented Oct 14, 2024 •

edited

Loading

Uh oh!

durga4github commented Oct 14, 2024

Uh oh!

durga4github commented Oct 14, 2024

Uh oh!

Uh oh!

Uh oh!

[MLIR][NVGPU] Fix the cga_cluster.mlir test #112191

[MLIR][NVGPU] Fix the cga_cluster.mlir test #112191

Uh oh!

Conversation

durga4github commented Oct 14, 2024

Uh oh!

llvmbot commented Oct 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

durga4github commented Oct 14, 2024

Uh oh!

github-actions bot commented Oct 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

durga4github commented Oct 14, 2024

Uh oh!

durga4github commented Oct 14, 2024

Uh oh!

Uh oh!

Uh oh!

llvmbot commented Oct 14, 2024 •

edited

Loading

github-actions bot commented Oct 14, 2024 •

edited

Loading