[X86] Do not combine LRINT and TRUNC #125848

phoebewang · 2025-02-05T12:49:43Z

Per to discussions in #125324, most participants are opposed to this optimization. So remove the combination to address the concerns.

Fixes #125324

Per to discussions in llvm#125324, most participants are opposed to this optimization. So remove the combination to address the concerns. Fixes llvm#125324

llvmbot · 2025-02-05T12:50:19Z

@llvm/pr-subscribers-backend-x86

Author: Phoebe Wang (phoebewang)

Changes

Per to discussions in #125324, most participants are opposed to this optimization. So remove the combination to address the concerns.

Fixes #125324

Full diff: https://github.com/llvm/llvm-project/pull/125848.diff

2 Files Affected:

(modified) llvm/lib/Target/X86/X86ISelLowering.cpp (-5)
(modified) llvm/test/CodeGen/X86/lrint-conv-i64.ll (+18)

diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp
index 6cf6061deba702..5686d8bcbe85cd 100644
--- a/llvm/lib/Target/X86/X86ISelLowering.cpp
+++ b/llvm/lib/Target/X86/X86ISelLowering.cpp
@@ -53906,11 +53906,6 @@ static SDValue combineTruncate(SDNode *N, SelectionDAG &DAG,
       return DAG.getNode(X86ISD::MMX_MOVD2W, DL, MVT::i32, BCSrc);
   }
 
-  // Try to combine (trunc (vNi64 (lrint x))) to (vNi32 (lrint x)).
-  if (Src.getOpcode() == ISD::LRINT && VT.getScalarType() == MVT::i32 &&
-      Src.hasOneUse())
-    return DAG.getNode(ISD::LRINT, DL, VT, Src.getOperand(0));
-
   return SDValue();
 }
 
diff --git a/llvm/test/CodeGen/X86/lrint-conv-i64.ll b/llvm/test/CodeGen/X86/lrint-conv-i64.ll
index 01b0af2f807f20..38fa09085e1898 100644
--- a/llvm/test/CodeGen/X86/lrint-conv-i64.ll
+++ b/llvm/test/CodeGen/X86/lrint-conv-i64.ll
@@ -45,6 +45,24 @@ entry:
   ret i64 %0
 }
 
+define i32 @PR125324(float %x) {
+; SSE-LABEL: PR125324:
+; SSE:       # %bb.0: # %entry
+; SSE-NEXT:    cvtss2si %xmm0, %rax
+; SSE-NEXT:    # kill: def $eax killed $eax killed $rax
+; SSE-NEXT:    retq
+;
+; AVX-LABEL: PR125324:
+; AVX:       # %bb.0: # %entry
+; AVX-NEXT:    vcvtss2si %xmm0, %rax
+; AVX-NEXT:    # kill: def $eax killed $eax killed $rax
+; AVX-NEXT:    retq
+entry:
+  %0 = tail call i64 @llvm.lrint.i64.f32(float %x)
+  %1 = trunc i64 %0 to i32
+  ret i32 %1
+}
+
 declare i64 @llvm.lrint.i64.f32(float) nounwind readnone
 declare i64 @llvm.lrint.i64.f64(double) nounwind readnone
 declare i64 @llvm.lrint.i64.f80(x86_fp80) nounwind readnone

topperc · 2025-02-05T17:42:12Z

llvm/test/CodeGen/X86/lrint-conv-i64.ll

@@ -45,6 +45,24 @@ entry:
  ret i64 %0
 }

+define i32 @PR125324(float %x) {


Does that mean we had no tests for this combine before?

Yes, that's true 🤦‍♀️

topperc

LGTM

phoebewang · 2025-02-06T03:40:12Z

/cherry-pick 8c222c1

llvmbot · 2025-02-06T03:48:35Z

/pull-request #125995

Try to improve performance after llvm#125848

Per to discussions in llvm#125324, most participants are opposed to this optimization. So remove the combination to address the concerns. Fixes llvm#125324 (cherry picked from commit 8c222c1)

Per to discussions in llvm#125324, most participants are opposed to this optimization. So remove the combination to address the concerns. Fixes llvm#125324

Per to discussions in llvm#125324, most participants are opposed to this optimization. So remove the combination to address the concerns. Fixes llvm#125324 (cherry picked from commit 8c222c1)

Per to discussions in llvm#125324, most participants are opposed to this optimization. So remove the combination to address the concerns. Fixes llvm#125324 (cherry picked from commit 8c222c1) Co-authored-by: Phoebe Wang <[email protected]>

[X86] Do not combine LRINT and TRUNC

38398b3

Per to discussions in llvm#125324, most participants are opposed to this optimization. So remove the combination to address the concerns. Fixes llvm#125324

phoebewang requested review from nikic, RKSimon, topperc, andykaylor and jcranmer-intel February 5, 2025 12:49

llvmbot added the backend:X86 label Feb 5, 2025

phoebewang mentioned this pull request Feb 5, 2025

[clang] [x86-64] lrint()/lrintf() using instruction writing to 32-bit register if assigned to 32-bit int even though long is 64-bit #125324

Closed

nikic approved these changes Feb 5, 2025

View reviewed changes

RKSimon approved these changes Feb 5, 2025

View reviewed changes

topperc reviewed Feb 5, 2025

View reviewed changes

topperc approved these changes Feb 5, 2025

View reviewed changes

jcranmer-intel approved these changes Feb 5, 2025

View reviewed changes

phoebewang merged commit 8c222c1 into llvm:main Feb 6, 2025
10 checks passed

phoebewang deleted the lrint branch February 6, 2025 02:58

phoebewang added this to the LLVM 20.X Release milestone Feb 6, 2025

phoebewang added a commit to phoebewang/llvm-project that referenced this pull request Feb 7, 2025

[X86] Combine LRINT/LLRINT and TRUNC when nuw/nsw

cf29bc4

Try to improve performance after llvm#125848

phoebewang mentioned this pull request Feb 7, 2025

[X86] Combine LRINT/LLRINT and TRUNC when TRUNC has nsw flag #126217

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[X86] Do not combine LRINT and TRUNC #125848

[X86] Do not combine LRINT and TRUNC #125848

Uh oh!

phoebewang commented Feb 5, 2025

Uh oh!

llvmbot commented Feb 5, 2025

Uh oh!

topperc Feb 5, 2025

Uh oh!

phoebewang Feb 6, 2025

Uh oh!

topperc left a comment

Uh oh!

Uh oh!

phoebewang commented Feb 6, 2025

Uh oh!

llvmbot commented Feb 6, 2025

Uh oh!

Uh oh!

[X86] Do not combine LRINT and TRUNC #125848

[X86] Do not combine LRINT and TRUNC #125848

Uh oh!

Conversation

phoebewang commented Feb 5, 2025

Uh oh!

llvmbot commented Feb 5, 2025

Uh oh!

topperc Feb 5, 2025

Choose a reason for hiding this comment

Uh oh!

phoebewang Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

topperc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

phoebewang commented Feb 6, 2025

Uh oh!

llvmbot commented Feb 6, 2025

Uh oh!

Uh oh!