AMDGPU: Make sqrt and rsq intrinsics propagate poison #130914

arsenm · 2025-03-12T07:13:42Z

No description provided.

arsenm · 2025-03-12T07:13:59Z

This stack of pull requests is managed by Graphite. Learn more about stacking.

llvmbot · 2025-03-12T07:15:14Z

@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-backend-amdgpu

Author: Matt Arsenault (arsenm)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/130914.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp (+2)
(modified) llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll (+24)

diff --git a/llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp b/llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp
index 6f6556365ebf6..5314738b2b8ac 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp
+++ b/llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp
@@ -548,6 +548,8 @@ GCNTTIImpl::instCombineIntrinsic(InstCombiner &IC, IntrinsicInst &II) const {
   case Intrinsic::amdgcn_sqrt:
   case Intrinsic::amdgcn_rsq: {
     Value *Src = II.getArgOperand(0);
+    if (isa<PoisonValue>(Src))
+      return IC.replaceInstUsesWith(II, Src);
 
     // TODO: Move to ConstantFolding/InstSimplify?
     if (isa<UndefValue>(Src)) {
diff --git a/llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll b/llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll
index 42ddc71dab848..fca3860240294 100644
--- a/llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll
+++ b/llvm/test/Transforms/InstCombine/AMDGPU/amdgcn-intrinsics.ll
@@ -89,6 +89,14 @@ declare half @llvm.amdgcn.sqrt.f16(half) nounwind readnone
 declare float @llvm.amdgcn.sqrt.f32(float) nounwind readnone
 declare double @llvm.amdgcn.sqrt.f64(double) nounwind readnone
 
+define half @test_constant_fold_sqrt_f16_poison() nounwind {
+; CHECK-LABEL: @test_constant_fold_sqrt_f16_poison(
+; CHECK-NEXT:    ret half poison
+;
+  %val = call half @llvm.amdgcn.sqrt.f16(half poison) nounwind readnone
+  ret half %val
+}
+
 define half @test_constant_fold_sqrt_f16_undef() nounwind {
 ; CHECK-LABEL: @test_constant_fold_sqrt_f16_undef(
 ; CHECK-NEXT:    ret half 0xH7E00
@@ -97,6 +105,14 @@ define half @test_constant_fold_sqrt_f16_undef() nounwind {
   ret half %val
 }
 
+define float @test_constant_fold_sqrt_f32_poison() nounwind {
+; CHECK-LABEL: @test_constant_fold_sqrt_f32_poison(
+; CHECK-NEXT:    ret float poison
+;
+  %val = call float @llvm.amdgcn.sqrt.f32(float poison) nounwind readnone
+  ret float %val
+}
+
 define float @test_constant_fold_sqrt_f32_undef() nounwind {
 ; CHECK-LABEL: @test_constant_fold_sqrt_f32_undef(
 ; CHECK-NEXT:    ret float 0x7FF8000000000000
@@ -234,6 +250,14 @@ define double @test_amdgcn_sqrt_f64(double %arg) {
 
 declare float @llvm.amdgcn.rsq.f32(float) nounwind readnone
 
+define float @test_constant_fold_rsq_f32_poison() nounwind {
+; CHECK-LABEL: @test_constant_fold_rsq_f32_poison(
+; CHECK-NEXT:    ret float poison
+;
+  %val = call float @llvm.amdgcn.rsq.f32(float poison) nounwind readnone
+  ret float %val
+}
+
 define float @test_constant_fold_rsq_f32_undef() nounwind {
 ; CHECK-LABEL: @test_constant_fold_rsq_f32_undef(
 ; CHECK-NEXT:    ret float 0x7FF8000000000000

shiltian · 2025-03-12T13:20:07Z

llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp

@@ -548,6 +548,8 @@ GCNTTIImpl::instCombineIntrinsic(InstCombiner &IC, IntrinsicInst &II) const {
  case Intrinsic::amdgcn_sqrt:
  case Intrinsic::amdgcn_rsq: {
    Value *Src = II.getArgOperand(0);
+    if (isa<PoisonValue>(Src))
+      return IC.replaceInstUsesWith(II, Src);


Why does undef give QNaN while poison give poison?

We've done this for a while for FP ops. I think the reasoning is that if the original value could have been a signaling nan or denormal, that would go through canonicalization. We're still guaranteeing a canonical value by returning a qnan (although technically we don't guarantee this for generic math ops, but I guess we can maintain it for target intrinsics)

arsenm · 2025-03-13T02:54:03Z

Merge activity

Mar 12, 10:54 PM EDT: A user started a stack merge that includes this pull request via Graphite.
Mar 12, 10:59 PM EDT: Graphite rebased this pull request as part of a merge.
Mar 12, 11:01 PM EDT: A user merged this pull request with Graphite.

This was referenced Mar 12, 2025

AMDGPU: Make rcp intrinsic propagate poison #130913

Merged

AMDGPU: Make frexp_exp and frexp_mant intrinsics propagate poison #130915

Merged

arsenm added the backend:AMDGPU label Mar 12, 2025 — with Graphite App

arsenm requested review from jayfoad, Pierre-vh, rovka and shiltian March 12, 2025 07:15

arsenm marked this pull request as ready for review March 12, 2025 07:15

llvmbot added llvm:instcombine Covers the InstCombine, InstSimplify and AggressiveInstCombine passes llvm:transforms labels Mar 12, 2025

shiltian reviewed Mar 12, 2025

View reviewed changes

shiltian approved these changes Mar 13, 2025

View reviewed changes

arsenm force-pushed the users/arsenm/amdgpu/make-rcp-intrinsic-propagate-poison branch from 8fd7cd3 to 2b21ffa Compare March 13, 2025 02:55

Base automatically changed from users/arsenm/amdgpu/make-rcp-intrinsic-propagate-poison to main March 13, 2025 02:58

AMDGPU: Make sqrt and rsq intrinsics propagate poison

97c02c4

arsenm force-pushed the users/arsenm/amdgpu/make-sqrt-rsq-intrinsics-propagate-poison branch from 9babf2e to 97c02c4 Compare March 13, 2025 02:59

arsenm merged commit d8f17b3 into main Mar 13, 2025
5 of 9 checks passed

arsenm deleted the users/arsenm/amdgpu/make-sqrt-rsq-intrinsics-propagate-poison branch March 13, 2025 03:01

frederik-h pushed a commit to frederik-h/llvm-project that referenced this pull request Mar 18, 2025

AMDGPU: Make sqrt and rsq intrinsics propagate poison (llvm#130914)

2047ce1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

AMDGPU: Make sqrt and rsq intrinsics propagate poison #130914

AMDGPU: Make sqrt and rsq intrinsics propagate poison #130914

Uh oh!

arsenm commented Mar 12, 2025

Uh oh!

arsenm commented Mar 12, 2025 •

edited

Loading

Uh oh!

llvmbot commented Mar 12, 2025 •

edited

Loading

Uh oh!

shiltian Mar 12, 2025

Uh oh!

arsenm Mar 12, 2025

Uh oh!

arsenm commented Mar 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

AMDGPU: Make sqrt and rsq intrinsics propagate poison #130914

AMDGPU: Make sqrt and rsq intrinsics propagate poison #130914

Uh oh!

Conversation

arsenm commented Mar 12, 2025

Uh oh!

arsenm commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shiltian Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Uh oh!

arsenm commented Mar 12, 2025 •

edited

Loading

llvmbot commented Mar 12, 2025 •

edited

Loading

arsenm commented Mar 13, 2025 •

edited

Loading