[AMDGPU] Fix mode register pass for constrained FP operations #90085

abhigargrepo · 2024-04-25T16:56:03Z

This PR will fix the si-mode-register pass which is inserting an extra setreg instruction in case of constrained FP operations. This pass will be ignored for strictfp functions.

llvmbot · 2024-04-25T16:56:37Z

@llvm/pr-subscribers-backend-amdgpu

Author: Abhinav Garg (abhigargrepo)

Changes

This PR will fix the si-mode-register pass which is inserting an extra setreg instruction in case of constrained FP operations. This pass will be ignored for strictfp functions.

Full diff: https://github.com/llvm/llvm-project/pull/90085.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/SIModeRegister.cpp (+3)
(modified) llvm/test/CodeGen/AMDGPU/mode-register-fpconstrain.ll (+2-4)

diff --git a/llvm/lib/Target/AMDGPU/SIModeRegister.cpp b/llvm/lib/Target/AMDGPU/SIModeRegister.cpp
index c01b1266a5530a..32a889279763a9 100644
--- a/llvm/lib/Target/AMDGPU/SIModeRegister.cpp
+++ b/llvm/lib/Target/AMDGPU/SIModeRegister.cpp
@@ -430,6 +430,9 @@ void SIModeRegister::processBlockPhase3(MachineBasicBlock &MBB,
 }
 
 bool SIModeRegister::runOnMachineFunction(MachineFunction &MF) {
+  const Function &F = MF.getFunction();
+  if (F.hasFnAttribute(llvm::Attribute::StrictFP))
+    return Changed;
   BlockInfo.resize(MF.getNumBlockIDs());
   const GCNSubtarget &ST = MF.getSubtarget<GCNSubtarget>();
   const SIInstrInfo *TII = ST.getInstrInfo();
diff --git a/llvm/test/CodeGen/AMDGPU/mode-register-fpconstrain.ll b/llvm/test/CodeGen/AMDGPU/mode-register-fpconstrain.ll
index 2403aeaa4428ad..edfaa7debe2f84 100644
--- a/llvm/test/CodeGen/AMDGPU/mode-register-fpconstrain.ll
+++ b/llvm/test/CodeGen/AMDGPU/mode-register-fpconstrain.ll
@@ -9,8 +9,7 @@ define double @ignoreStrictfp(double noundef %a, double noundef %b) #0 {
 ; GCN:       ; %bb.0:
 ; GCN-NEXT:    s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
 ; GCN-NEXT:    s_setreg_imm32_b32 hwreg(HW_REG_MODE, 2, 2), 1
-; GCN-NEXT:    s_nop 1
-; GCN-NEXT:    s_setreg_imm32_b32 hwreg(HW_REG_MODE, 2, 1), 0
+; GCN-NOT:     s_setreg_imm32_b32 hwreg(HW_REG_MODE, 2, 1), 0
 ; GCN-NEXT:    v_add_f64 v[0:1], v[0:1], v[2:3]
 ; GCN-NEXT:    s_setpc_b64 s[30:31]
   tail call void @llvm.amdgcn.s.setreg(i32 2177, i32 1)
@@ -24,8 +23,7 @@ define double @set_fpenv(double noundef %a, double noundef %b) #0 {
 ; GCN-NEXT:    s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
 ; GCN-NEXT:    s_setreg_imm32_b32 hwreg(HW_REG_MODE, 0, 23), 4
 ; GCN-NEXT:    s_setreg_imm32_b32 hwreg(HW_REG_TRAPSTS, 0, 5), 0
-; GCN-NEXT:    s_nop 0
-; GCN-NEXT:    s_setreg_imm32_b32 hwreg(HW_REG_MODE, 2, 1), 0
+; GCN-NOT:     s_setreg_imm32_b32 hwreg(HW_REG_MODE, 2, 1), 0
 ; GCN-NEXT:    v_add_f64 v[0:1], v[0:1], v[2:3]
 ; GCN-NEXT:    s_setpc_b64 s[30:31]
 entry:

llvm/test/CodeGen/AMDGPU/mode-register-fpconstrain.ll

llvm/lib/Target/AMDGPU/SIModeRegister.cpp

This PR will fix the si-mode-register pass which is inserting an extra setreg instruction in case of constrained FP operations. This pass will be ignored for strictfp functions.

github-actions · 2024-04-25T20:01:40Z

✅ With the latest revision this PR passed the C/C++ code formatter.

This PR will fix the si-mode-register pass which is inserting an extra setreg instruction in case of constrained FP operations. This pass will be ignored for strictfp functions.

[AMDGPU] Fix mode register pass for constrained FP operations

341ac99

This PR will fix the si-mode-register pass which is inserting an extra setreg instruction in case of constrained FP operations. This pass will be ignored for strictfp functions.

llvmbot added the backend:AMDGPU label Apr 25, 2024

arsenm reviewed Apr 25, 2024

View reviewed changes

llvm/test/CodeGen/AMDGPU/mode-register-fpconstrain.ll Outdated Show resolved Hide resolved

arsenm reviewed Apr 25, 2024

View reviewed changes

llvm/lib/Target/AMDGPU/SIModeRegister.cpp Show resolved Hide resolved

abhigargrepo added 2 commits April 26, 2024 01:24

Merge branch 'main' into fixBug-siMode

91bc013

[AMDGPU] Fix mode register pass for constrained FP operations

3f5392d

This PR will fix the si-mode-register pass which is inserting an extra setreg instruction in case of constrained FP operations. This pass will be ignored for strictfp functions.

abhigargrepo added 2 commits April 30, 2024 10:38

Merge branch 'main' into fixBug-siMode

74ca2cf

[AMDGPU] Fix mode register pass for constrained FP operations

a7f5bca

This PR will fix the si-mode-register pass which is inserting an extra setreg instruction in case of constrained FP operations. This pass will be ignored for strictfp functions.

arsenm approved these changes May 3, 2024

View reviewed changes

arsenm merged commit 76508dc into llvm:main May 3, 2024

abhigargrepo mentioned this pull request Dec 19, 2024

Request Commit Access For abhigargrepo #120604

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU] Fix mode register pass for constrained FP operations #90085

[AMDGPU] Fix mode register pass for constrained FP operations #90085

Uh oh!

abhigargrepo commented Apr 25, 2024

Uh oh!

llvmbot commented Apr 25, 2024

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Apr 25, 2024 •

edited

Loading

Uh oh!

Uh oh!

[AMDGPU] Fix mode register pass for constrained FP operations #90085

[AMDGPU] Fix mode register pass for constrained FP operations #90085

Uh oh!

Conversation

abhigargrepo commented Apr 25, 2024

Uh oh!

llvmbot commented Apr 25, 2024

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Apr 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Apr 25, 2024 •

edited

Loading