[MLIR][GPU] Add gpu.cluster_dim_blocks and gpu.cluster_block_id Ops #95245

schwarzschild-radius · 2024-06-12T13:11:28Z

This commit adds support for gpu.cluster_dim_blocks and gpu.cluster_block_id Ops to represent number of blocks per cluster and block id inside a cluster respectively. Also, fixed the description of gpu.cluster_dim Op and updated the cga_cluster.mlir test file to use gpu.cluster_dim_blocks

llvmbot · 2024-06-12T13:12:01Z

@llvm/pr-subscribers-mlir
@llvm/pr-subscribers-mlir-llvm

@llvm/pr-subscribers-mlir-gpu

Author: Pradeep Kumar (schwarzschild-radius)

Changes

This commit adds support for gpu.cluster_dim_blocks Op to represent number of blocks per cluster and updated the description of gpu.cluster_dim Op. Also, updated the cga_cluster.mlir test file to use gpu.cluster_dim_blocks

Full diff: https://github.com/llvm/llvm-project/pull/95245.diff

5 Files Affected:

(modified) mlir/include/mlir/Dialect/GPU/IR/GPUOps.td (+14-1)
(modified) mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td (+3-3)
(modified) mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp (+2)
(modified) mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp (+5)
(modified) mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir (+3-3)

diff --git a/mlir/include/mlir/Dialect/GPU/IR/GPUOps.td b/mlir/include/mlir/Dialect/GPU/IR/GPUOps.td
index eb81b6469746f..e7e55ab42d51f 100644
--- a/mlir/include/mlir/Dialect/GPU/IR/GPUOps.td
+++ b/mlir/include/mlir/Dialect/GPU/IR/GPUOps.td
@@ -70,7 +70,7 @@ class GPU_IndexOp<string mnemonic, list<Trait> traits = []> :
 
 def GPU_ClusterDimOp : GPU_IndexOp<"cluster_dim"> {
   let description = [{
-    Returns the number of thread blocks in the cluster along
+    Returns the number of cluster identifiers per grid along
     the x, y, or z `dimension`.
 
     Example:
@@ -81,6 +81,19 @@ def GPU_ClusterDimOp : GPU_IndexOp<"cluster_dim"> {
   }];
 }
 
+def GPU_ClusterDimBlocksOp : GPU_IndexOp<"cluster_dim_blocks"> {
+  let description = [{
+    Returns the number of thread blocks in the cluster along
+    the x, y, or z `dimension`.
+
+    Example:
+
+    ```mlir
+    %cDimBlocksX = gpu.cluster_dim_blocks x
+    ```
+  }];
+}
+
 def GPU_ClusterIdOp : GPU_IndexOp<"cluster_id"> {
   let description = [{
     Returns the cluster id, i.e. the index of the current cluster within the
diff --git a/mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td b/mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
index 4daeeab093863..4d48b3de7a57e 100644
--- a/mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
+++ b/mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
@@ -160,9 +160,9 @@ def NVVM_ClusterDimZOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.nclusterid.z">;
 def NVVM_BlockInClusterIdXOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.ctaid.x">;
 def NVVM_BlockInClusterIdYOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.ctaid.y">;
 def NVVM_BlockInClusterIdZOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.ctaid.z">;
-def NVVM_GridInClusterDimXOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.nctaid.x">;
-def NVVM_GridInClusterDimYOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.nctaid.y">;
-def NVVM_GridInClusterDimZOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.nctaid.z">;
+def NVVM_ClusterDimBlocksXOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.nctaid.x">;
+def NVVM_ClusterDimBlocksYOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.nctaid.y">;
+def NVVM_ClusterDimBlocksZOp : NVVM_SpecialRegisterOp<"read.ptx.sreg.cluster.nctaid.z">;
 
 //===----------------------------------------------------------------------===//
 // CTA index and across Cluster dimensions
diff --git a/mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp b/mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
index b95fba20a00cb..811f9efb62951 100644
--- a/mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
+++ b/mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
@@ -344,6 +344,8 @@ void mlir::populateGpuToNVVMConversionPatterns(LLVMTypeConverter &converter,
                                   NVVM::ClusterIdYOp, NVVM::ClusterIdZOp>,
       GPUIndexIntrinsicOpLowering<gpu::ClusterDimOp, NVVM::ClusterDimXOp,
                                   NVVM::ClusterDimYOp, NVVM::ClusterDimZOp>,
+      GPUIndexIntrinsicOpLowering<gpu::ClusterDimBlocksOp, NVVM::ClusterDimBlocksXOp,
+                                  NVVM::ClusterDimBlocksYOp, NVVM::ClusterDimBlocksZOp>,
       GPUIndexIntrinsicOpLowering<gpu::BlockIdOp, NVVM::BlockIdXOp,
                                   NVVM::BlockIdYOp, NVVM::BlockIdZOp>,
       GPUIndexIntrinsicOpLowering<gpu::GridDimOp, NVVM::GridDimXOp,
diff --git a/mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp b/mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp
index 69017efb9a0e6..80ea102c03bd2 100644
--- a/mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp
+++ b/mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp
@@ -86,6 +86,11 @@ static std::optional<uint64_t> getKnownLaunchDim(Op op, LaunchDims type) {
 
 void ClusterDimOp::inferResultRanges(ArrayRef<ConstantIntRanges>,
                                      SetIntRangeFn setResultRange) {
+  setResultRange(getResult(), getIndexRange(1, kMaxDim));
+}
+
+void ClusterDimBlocksOp::inferResultRanges(ArrayRef<ConstantIntRanges>,
+                                     SetIntRangeFn setResultRange) {
   setResultRange(getResult(), getIndexRange(1, kMaxClusterDim));
 }
 
diff --git a/mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir b/mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir
index 025282ec0d688..5c11d80178f72 100644
--- a/mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir
+++ b/mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir
@@ -22,9 +22,9 @@ module attributes {gpu.container_module} {
       %cidX = gpu.cluster_id  x
       %cidY = gpu.cluster_id  y
       %cidZ = gpu.cluster_id  z
-      %cdimX = gpu.cluster_dim  x
-      %cdimY = gpu.cluster_dim  y
-      %cdimZ = gpu.cluster_dim  z
+      %cdimX = gpu.cluster_dim_blocks  x
+      %cdimY = gpu.cluster_dim_blocks  y
+      %cdimZ = gpu.cluster_dim_blocks  z
       %bidX = gpu.block_id  x
       %bidY = gpu.block_id  y
       %bidZ = gpu.block_id  z

github-actions · 2024-06-12T13:14:27Z

✅ With the latest revision this PR passed the C/C++ code formatter.

schwarzschild-radius · 2024-06-12T13:23:32Z

CC += @durga4github for viz

grypp

Looks great. I left some comments

mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp

mlir/include/mlir/Dialect/GPU/IR/GPUOps.td

This commit adds support for `gpu.cluster_dim_blocks` and `gpu.cluster_block_id` Ops to represent number of blocks per cluster and block id inside a cluster respectively. Also, fixed the description of `gpu.cluster_dim` Op and updated the `cga_cluster.mlir` test file to use `gpu.cluster_dim_blocks`

mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp

mlir/include/mlir/Dialect/GPU/IR/GPUOps.td

durga4github

Looks good to me.

Co-authored-by: Guray Ozen <[email protected]>

schwarzschild-radius · 2024-06-14T03:44:51Z

@durga4github can you please merge it?

…lvm#95245) This commit adds support for `gpu.cluster_dim_blocks` and `gpu.cluster_block_id` Ops to represent number of blocks per cluster and block id inside a cluster respectively. Also, fixed the description of `gpu.cluster_dim` Op and updated the `cga_cluster.mlir` test file to use `gpu.cluster_dim_blocks` Co-authored-by: pradeepku <[email protected]> Co-authored-by: Guray Ozen <[email protected]>

schwarzschild-radius requested a review from grypp as a code owner June 12, 2024 13:11

llvmbot added mlir:llvm mlir:gpu mlir labels Jun 12, 2024

schwarzschild-radius force-pushed the gpu_cluster_dim_op_fix branch from cd6d4cf to 4430b67 Compare June 12, 2024 13:21

grypp approved these changes Jun 12, 2024

View reviewed changes

mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp Outdated Show resolved Hide resolved

mlir/include/mlir/Dialect/GPU/IR/GPUOps.td Show resolved Hide resolved

schwarzschild-radius force-pushed the gpu_cluster_dim_op_fix branch from 4430b67 to 87b384b Compare June 12, 2024 15:19

schwarzschild-radius changed the title ~~[MLIR][GPU] Add gpu.cluster_dim_blocks Op to represent number of blocks per cluster~~ [MLIR][GPU] Add gpu.cluster_dim_blocks and gpu.cluster_block_id Ops Jun 12, 2024

grypp reviewed Jun 13, 2024

View reviewed changes

mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp Show resolved Hide resolved

grypp approved these changes Jun 13, 2024

View reviewed changes

mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp Outdated Show resolved Hide resolved

durga4github reviewed Jun 13, 2024

View reviewed changes

mlir/include/mlir/Dialect/GPU/IR/GPUOps.td Show resolved Hide resolved

durga4github reviewed Jun 13, 2024

View reviewed changes

Update mlir/lib/Dialect/GPU/IR/InferIntRangeInterfaceImpls.cpp

4590d49

Co-authored-by: Guray Ozen <[email protected]>

durga4github merged commit bd6568c into llvm:main Jun 14, 2024
7 checks passed

jpienaar mentioned this pull request Jun 17, 2024

[mlirc] Add missing extern C #95829

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MLIR][GPU] Add gpu.cluster_dim_blocks and gpu.cluster_block_id Ops #95245

[MLIR][GPU] Add gpu.cluster_dim_blocks and gpu.cluster_block_id Ops #95245

Uh oh!

schwarzschild-radius commented Jun 12, 2024 •

edited

Loading

Uh oh!

llvmbot commented Jun 12, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Jun 12, 2024 •

edited

Loading

Uh oh!

schwarzschild-radius commented Jun 12, 2024

Uh oh!

grypp left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

durga4github left a comment

Uh oh!

schwarzschild-radius commented Jun 14, 2024

Uh oh!

Uh oh!

Uh oh!

[MLIR][GPU] Add gpu.cluster_dim_blocks and gpu.cluster_block_id Ops #95245

[MLIR][GPU] Add gpu.cluster_dim_blocks and gpu.cluster_block_id Ops #95245

Uh oh!

Conversation

schwarzschild-radius commented Jun 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Jun 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jun 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

schwarzschild-radius commented Jun 12, 2024

Uh oh!

grypp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

durga4github left a comment

Choose a reason for hiding this comment

Uh oh!

schwarzschild-radius commented Jun 14, 2024

Uh oh!

Uh oh!

Uh oh!

schwarzschild-radius commented Jun 12, 2024 •

edited

Loading

llvmbot commented Jun 12, 2024 •

edited

Loading

github-actions bot commented Jun 12, 2024 •

edited

Loading