[llvm][InstCombine] Fold select to cmp for weak and inverted inequalities #143445

yashnator · 2025-06-09T21:45:43Z

Currently:

int signum(int x) {
    if (x < 0) return -1;
    if (x > 0) return +1;
    return 0;
}

is not identified as scmp(x,0).

This patch adds folding for the edge case: (x > -1) ? zext(x != 0) : -1, which is generated for the above signum and is equivalent to scmp(x, 0). For other constants/variables, the fold is already optimised.

Alive2 proof (taken from issue)

Resolves #143259

llvmbot · 2025-06-09T21:46:19Z

@llvm/pr-subscribers-llvm-transforms

Author: Yash Solanki (yashnator)

Changes

Currently:

int signum(int x) {
    if (x &lt; 0) return -1;
    if (x &gt; 0) return +1;
    return 0;
}

is not identified as scmp(x,0).

This patch adds folding for the edge case: (x > -1) ? zext(x != 0) : -1, which is generated for the above signum and is equivalent to scmp(x, 0). For other constants/variables, the fold is already optimised.

Alive2 proof (taken from issue)

Resolves #143259

Full diff: https://github.com/llvm/llvm-project/pull/143445.diff

2 Files Affected:

(modified) llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp (+12)
(modified) llvm/test/Transforms/InstCombine/scmp.ll (+14)

diff --git a/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp b/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
index 8f46ae304353d..ec0cda18a6492 100644
--- a/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
+++ b/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp
@@ -3603,6 +3603,18 @@ Instruction *InstCombinerImpl::foldSelectToCmp(SelectInst &SI) {
        ICmpInst::getSwappedPredicate(ExtendedCmpPredicate) == Pred))
     Replace = true;
 
+  // Handle the edge case (x > -1) ? zext(x != 0), -1
+  if (IsSigned && ICmpInst::isGT(Pred) && match(FV, m_AllOnes()) &&
+      match(TV, m_ZExt(m_c_ICmp(ExtendedCmpPredicate, m_Specific(LHS),
+                                m_Zero()))) &&
+      (ExtendedCmpPredicate == ICmpInst::ICMP_NE ||
+       ICmpInst::getSwappedPredicate(ExtendedCmpPredicate) == Pred)) {
+    Value *Zero = ConstantInt::get(LHS->getType(), 0);
+    return replaceInstUsesWith(
+        SI,
+        Builder.CreateIntrinsic(SI.getType(), Intrinsic::scmp, {LHS, Zero}));
+  }
+
   // (x == y) ? 0 : (x > y ? 1 : -1)
   CmpPredicate FalseBranchSelectPredicate;
   const APInt *InnerTV, *InnerFV;
diff --git a/llvm/test/Transforms/InstCombine/scmp.ll b/llvm/test/Transforms/InstCombine/scmp.ll
index 2140a59de3fa9..b685ca20998bd 100644
--- a/llvm/test/Transforms/InstCombine/scmp.ll
+++ b/llvm/test/Transforms/InstCombine/scmp.ll
@@ -473,3 +473,17 @@ define i8 @scmp_from_select_eq_and_gt_neg3(i32 %x, i32 %y) {
   %r = select i1 %eq, i8 0, i8 %sel1
   ret i8 %r
 }
+
+; Fold (x > -1) ? zext(x != 0), -1 to scmp(x, 0)
+define i32 @scmp_x_0_from_gt_minus_1(i32 noundef %0) local_unnamed_addr #0 {
+; CHECK-LABEL: define i32 @scmp_x_0_from_gt_minus_1(
+; CHECK-SAME: i32 noundef [[TMP0:%.*]]) local_unnamed_addr {
+; CHECK-NEXT:    [[TMP2:%.*]] = call i32 @llvm.scmp.i32.i32(i32 [[TMP0]], i32 0)
+; CHECK-NEXT:    ret i32 [[TMP2]]
+;
+  %2 = icmp ne i32 %0, 0
+  %3 = zext i1 %2 to i32
+  %4 = icmp sgt i32 %0, -1
+  %5 = select i1 %4, i32 %3, i32 -1
+  ret i32 %5
+}

el-ev

Is there any test for the change?

yashnator · 2025-06-10T12:45:11Z

@el-ev I had added a test in scmp.ll- scmp_x_0_from_gt_minus_1

el-ev · 2025-06-10T12:58:37Z

@el-ev I had added a test in scmp.ll- scmp_x_0_from_gt_minus_1

Could you please add additional test cases and pre-commit the tests?

Please check https://llvm.org/docs/InstCombineContributorGuide.html

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp

llvm/test/Transforms/InstCombine/scmp.ll

AZero13 · 2025-06-10T17:25:11Z

Irrelevant to this PR in itself, but I did notice when running both src in tgt in llc, but why does llc lower the src in the alive2 posted better than the intrinsic?

topperc · 2025-06-10T18:16:43Z

Irrelevant to this PR in itself, but I did notice when running both src in tgt in llc, but why does llc lower the src in the alive2 posted better than the intrinsic?

I'm guessing you mean on X86. It looks neutral on RISC-V and scmp looks better for AArch64.

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp

yashnator · 2025-06-11T19:00:44Z

@el-ev I have added more tests. I also added changes to handle more general predicates

AZero13 · 2025-06-11T21:11:28Z

Irrelevant to this PR in itself, but I did notice when running both src in tgt in llc, but why does llc lower the src in the alive2 posted better than the intrinsic?

I'm guessing you mean on X86. It looks neutral on RISC-V and scmp looks better for AArch64.

Yeah, might have to work on that.

I mean human written asm for this in particular is:

  add edi, edi
  setnz cl
  sbb eax, eax
  or al, cl
  ret

but I do not know if I can get the compiler to lower that to this.

topperc · 2025-06-11T22:00:34Z

Irrelevant to this PR in itself, but I did notice when running both src in tgt in llc, but why does llc lower the src in the alive2 posted better than the intrinsic?

I'm guessing you mean on X86. It looks neutral on RISC-V and scmp looks better for AArch64.

Yeah, might have to work on that.

I mean human written asm for this in particular is:
  add edi, edi
  setnz cl
  sbb eax, eax
  or al, cl
  ret
but I do not know if I can get the compiler to lower that to this.

add edi, edi seems wrong there

AZero13 · 2025-06-11T22:21:50Z

add edi, edi seems wrong there

Why?

topperc · 2025-06-12T00:54:44Z

add edi, edi seems wrong there

Why?

Nevermind, I think its ok, but there's one partial register write and one false dependency in that code.

llvm/test/Transforms/InstCombine/select-to-cmp.ll

AZero13 · 2025-06-12T13:30:02Z

add edi, edi seems wrong there

Why?

Nevermind, I think its ok, but there's one partial register write and one false dependency in that code.

partial register write, yes, deliberate
false dependency? Where?

AZero13 · 2025-06-12T13:33:00Z

if you're looking at sbb eax, eax, then (1) that's probably special-cased by the CPU, and (2) even if not, eax isn't touched before that point, so it's very unlikely that the value is still in flight

I don't know if LLVM is aware of special cases like that though. Or if it's worth teaching LLVM that

nikic

We have a getFlippedStrictnessPredicateAndConstant() helper for cases like this. I think we can handle this generically by doing a swap of the select arms to get the constant into TV, inverting the predicate, and then using getFlippedStrictnessPredicateAndConstant() to convert the now non-strict predicate into a strict predicate.

topperc · 2025-06-12T22:45:24Z

if you're looking at sbb eax, eax, then (1) that's probably special-cased by the CPU, and (2) even if not, eax isn't touched before that point, so it's very unlikely that the value is still in flight

I don't know if LLVM is aware of special cases like that though. Or if it's worth teaching LLVM that

For a long time it was only special cased on AMD CPUs. I don't if Intel finally fixed it or not. There's a TuningSBBDepBreaking flag in X86Subtarget for it.

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp

to cmp with constants in non-canonical inequalities

…ualities For constant y, inequalities aren't always canonical so we need to check the conditions such as - (x > y - 1) ? zext(x != y) : -1 - (x > y - 1) ? zext(x > y) : -1 - (x < y + 1) ? sext(x != y) : 1 - (x < y + 1) ? sext(x < y) : 1 and similary non-strict equalities. Fold select into scmp/ucmp based on signedness of the comparison predicate. Resolves llvm#143259

nikic

LGTM

yashnator · 2025-06-13T13:51:15Z

I don't have commit access, can you help merge?

…ties (llvm#143445)

yashnator requested a review from nikic as a code owner June 9, 2025 21:45

llvmbot added llvm:instcombine Covers the InstCombine, InstSimplify and AggressiveInstCombine passes llvm:transforms labels Jun 9, 2025

el-ev reviewed Jun 10, 2025

View reviewed changes

nikic reviewed Jun 10, 2025

View reviewed changes

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp Outdated Show resolved Hide resolved

llvm/test/Transforms/InstCombine/scmp.ll Outdated Show resolved Hide resolved

This was referenced Jun 11, 2025

Task submission dtcxzyw/llvm-opt-benchmark#1312

Open

pre-commit: PR143445 dtcxzyw/llvm-opt-benchmark#2420

Closed

Fuzz PR143445 dtcxzyw/llvm-mutation-based-fuzz-service#57

Closed

dtcxzyw requested changes Jun 11, 2025

View reviewed changes

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp Outdated Show resolved Hide resolved

yashnator force-pushed the fold-scmp-x-0 branch 2 times, most recently from 1d1ad2b to a9e0877 Compare June 11, 2025 17:09

dtcxzyw requested changes Jun 12, 2025

View reviewed changes

llvm/test/Transforms/InstCombine/select-to-cmp.ll Outdated Show resolved Hide resolved

yashnator force-pushed the fold-scmp-x-0 branch from a9e0877 to 0b5914d Compare June 12, 2025 11:30

yashnator force-pushed the fold-scmp-x-0 branch from 0b5914d to 38b27b9 Compare June 12, 2025 18:36

nikic reviewed Jun 12, 2025

View reviewed changes

dtcxzyw requested changes Jun 13, 2025

View reviewed changes

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp Outdated Show resolved Hide resolved

yashnator force-pushed the fold-scmp-x-0 branch from 38b27b9 to 2b6d306 Compare June 13, 2025 08:23

dtcxzyw reviewed Jun 13, 2025

View reviewed changes

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp Outdated Show resolved Hide resolved

nikic reviewed Jun 13, 2025

View reviewed changes

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp Outdated Show resolved Hide resolved

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp Outdated Show resolved Hide resolved

dtcxzyw mentioned this pull request Jun 13, 2025

pre-commit: PR143445 dtcxzyw/llvm-opt-benchmark#2434

Closed

yashnator added 2 commits June 13, 2025 17:31

[Instcombine][Tests] Add test for folding select

90a256e

to cmp with constants in non-canonical inequalities

yashnator force-pushed the fold-scmp-x-0 branch from 2b6d306 to f4dd537 Compare June 13, 2025 12:07

nikic approved these changes Jun 13, 2025

View reviewed changes

yashnator changed the title ~~[llvm][InstCombine] Fold signum(x) into scmp(x, 0)~~ [llvm][InstCombine] Fold select to cmp for weak and inverted inequalities Jun 13, 2025

dtcxzyw approved these changes Jun 13, 2025

View reviewed changes

el-ev merged commit a361a3d into llvm:main Jun 13, 2025
7 checks passed

tomtor pushed a commit to tomtor/llvm-project that referenced this pull request Jun 14, 2025

[llvm][InstCombine] Fold select to cmp for weak and inverted inequali…

7b742d4

…ties (llvm#143445)

[llvm][InstCombine] Fold select to cmp for weak and inverted inequalities #143445

[llvm][InstCombine] Fold select to cmp for weak and inverted inequalities #143445

Uh oh!

Conversation

yashnator commented Jun 9, 2025

Uh oh!

llvmbot commented Jun 9, 2025

Uh oh!

el-ev left a comment

Choose a reason for hiding this comment

Uh oh!

yashnator commented Jun 10, 2025

Uh oh!

el-ev commented Jun 10, 2025

Uh oh!

Uh oh!

Uh oh!

AZero13 commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

topperc commented Jun 10, 2025

Uh oh!

Uh oh!

yashnator commented Jun 11, 2025

Uh oh!

AZero13 commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

topperc commented Jun 11, 2025

Uh oh!

AZero13 commented Jun 11, 2025

Uh oh!

topperc commented Jun 12, 2025

Uh oh!

Uh oh!

AZero13 commented Jun 12, 2025

Uh oh!

AZero13 commented Jun 12, 2025

Uh oh!

nikic left a comment

Choose a reason for hiding this comment

Uh oh!

topperc commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nikic left a comment

Choose a reason for hiding this comment

Uh oh!

yashnator commented Jun 13, 2025

Uh oh!

Uh oh!

Uh oh!

AZero13 commented Jun 10, 2025 •

edited

Loading

AZero13 commented Jun 11, 2025 •

edited

Loading

topperc commented Jun 12, 2025 •

edited

Loading