[RISCV] Inhibit DAG folding shl through zext.w pattern with zba #91626

preames · 2024-05-09T17:12:10Z

If we allow the fold, the zext.w pattern becomes an and by shifted 32 bit mask. In practice, we can't undo this during ISEL resulting in worse code in some cases. There is a cost to inhibiting the generic transform -- we loose out on the possibility of folds enabled by pushing the shift earlier.

If we allow the fold, the zext.w pattern becomes an and by shifted 32 bit mask. In practice, we can't undo this during ISEL resulting in worse code in some cases. There is a cost to inhibiting the generic transform -- we loose out on the possibily of folds enabled by pushing the shift earlier.

llvmbot · 2024-05-09T17:12:46Z

@llvm/pr-subscribers-backend-risc-v

Author: Philip Reames (preames)

Changes

If we allow the fold, the zext.w pattern becomes an and by shifted 32 bit mask. In practice, we can't undo this during ISEL resulting in worse code in some cases. There is a cost to inhibiting the generic transform -- we loose out on the possibility of folds enabled by pushing the shift earlier.

Full diff: https://github.com/llvm/llvm-project/pull/91626.diff

2 Files Affected:

(modified) llvm/lib/Target/RISCV/RISCVISelLowering.cpp (+7)
(modified) llvm/test/CodeGen/RISCV/rv64zba.ll (+2-4)

diff --git a/llvm/lib/Target/RISCV/RISCVISelLowering.cpp b/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
index 846768f6d631e..60b21fb508990 100644
--- a/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
+++ b/llvm/lib/Target/RISCV/RISCVISelLowering.cpp
@@ -17141,6 +17141,13 @@ bool RISCVTargetLowering::isDesirableToCommuteWithShift(
         return false;
     }
   }
+
+  // Don't break slli.uw patterns.
+  if (Subtarget.hasStdExtZba() && Ty.isScalarInteger() && N->getOpcode() == ISD::SHL &&
+      N0.getOpcode() == ISD::AND && isa<ConstantSDNode>(N0.getOperand(1)) &&
+      N0.getConstantOperandVal(1) == UINT64_C(0xffffffff))
+    return false;
+
   return true;
 }
 
diff --git a/llvm/test/CodeGen/RISCV/rv64zba.ll b/llvm/test/CodeGen/RISCV/rv64zba.ll
index 8fe221f2a297a..a0a7db538e835 100644
--- a/llvm/test/CodeGen/RISCV/rv64zba.ll
+++ b/llvm/test/CodeGen/RISCV/rv64zba.ll
@@ -2866,8 +2866,7 @@ define ptr @gep_lshr_i32(ptr %0, i64 %1) {
 ;
 ; RV64ZBA-LABEL: gep_lshr_i32:
 ; RV64ZBA:       # %bb.0: # %entry
-; RV64ZBA-NEXT:    slli a1, a1, 2
-; RV64ZBA-NEXT:    srli a1, a1, 4
+; RV64ZBA-NEXT:    srli a1, a1, 2
 ; RV64ZBA-NEXT:    slli.uw a1, a1, 4
 ; RV64ZBA-NEXT:    sh2add a1, a1, a1
 ; RV64ZBA-NEXT:    add a0, a0, a1
@@ -2891,8 +2890,7 @@ define i64 @srli_slliw(i64 %1) {
 ;
 ; RV64ZBA-LABEL: srli_slliw:
 ; RV64ZBA:       # %bb.0: # %entry
-; RV64ZBA-NEXT:    slli a0, a0, 2
-; RV64ZBA-NEXT:    srli a0, a0, 4
+; RV64ZBA-NEXT:    srli a0, a0, 2
 ; RV64ZBA-NEXT:    slli.uw a0, a0, 4
 ; RV64ZBA-NEXT:    ret
 entry:

github-actions · 2024-05-09T17:15:09Z

✅ With the latest revision this PR passed the C/C++ code formatter.

PR Link: llvm/llvm-project#91626

topperc · 2024-05-09T17:34:24Z

In practice, we can't undo this during ISEL resulting in worse code in some cases.

Why can't we undo it? Does it move too far away?

preames · 2024-05-09T17:56:04Z

In practice, we can't undo this during ISEL resulting in worse code in some cases.

Why can't we undo it? Does it move too far away?

Sorry, I hadn't meant this as "we can't" and more "we don't". We could write a non-trivial pattern match here, but we up having to match the whole (and (shl X, C), ShiftedMask) expression and then we have to check that C and the shift are compatible. I looked at doing that, but it didn't seem particularly clean in tablegen.

topperc · 2024-05-09T18:15:10Z

llvm/test/CodeGen/RISCV/rv64zba.ll

@@ -2891,8 +2890,7 @@ define i64 @srli_slliw(i64 %1) {
 ;
 ; RV64ZBA-LABEL: srli_slliw:
 ; RV64ZBA:       # %bb.0: # %entry
-; RV64ZBA-NEXT:    slli a0, a0, 2
-; RV64ZBA-NEXT:    srli a0, a0, 4
+; RV64ZBA-NEXT:    srli a0, a0, 2
 ; RV64ZBA-NEXT:    slli.uw a0, a0, 4
 ; RV64ZBA-NEXT:    ret
 entry:


Isn't this IR non-canonical per InstCombine

The original test case is canonical: https://godbolt.org/z/ee3YPfY4s

define ptr @test(ptr %0, i64 %1) { entry: %2 = lshr exact i64 %1, 2 %3 = and i64 %2, 4294967295 %4 = getelementptr inbounds i8, ptr %0, i64 600 %5 = getelementptr [80 x i8], ptr %4, i64 %3 ret ptr %5 }

Yes with a GEP its canonical, but with a shl it isn't.

Yeah, this was excessive reduction apparently.

The fact instcombine prefers the opposite form does hint that we should maybe (also?) do the late match. I was really hoping not to have to write that code...

I just posted it. We had the majority of the code already. Is there a 3 shift version of this we should do without Zba?

…srli x, c4-c2), c4). Where c1 is a shifted mask with 32 set bits and c4 trailing zeros. This is an alternative to llvm#91626.

preames · 2024-05-09T19:01:00Z

Abandon in favor of Craig's alternative.

preames requested review from dtcxzyw and topperc May 9, 2024 17:12

llvmbot added the backend:RISC-V label May 9, 2024

preames mentioned this pull request May 9, 2024

[RISCV] Move strength reduction of mul X, 3/5/9*2^N to combine #89966

Merged

dtcxzyw added a commit to dtcxzyw/llvm-codegen-benchmark that referenced this pull request May 9, 2024

pre-commit: test PR91626

c75c99e

PR Link: llvm/llvm-project#91626

dtcxzyw mentioned this pull request May 9, 2024

pre-commit: test PR91626 dtcxzyw/llvm-codegen-benchmark#44

Closed

Clang format

54fc5f6

topperc reviewed May 9, 2024

View reviewed changes

topperc mentioned this pull request May 9, 2024

[RISCV] Add isel special case for (and (shl X, c2), c1) -> (slli_uw (srli x, c4-c2), c4). #91638

Merged

preames closed this May 9, 2024

preames deleted the pr-riscv-desirable-shift-op branch May 9, 2024 19:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RISCV] Inhibit DAG folding shl through zext.w pattern with zba #91626

[RISCV] Inhibit DAG folding shl through zext.w pattern with zba #91626

Uh oh!

preames commented May 9, 2024

Uh oh!

llvmbot commented May 9, 2024

Uh oh!

github-actions bot commented May 9, 2024 •

edited

Loading

Uh oh!

topperc commented May 9, 2024

Uh oh!

preames commented May 9, 2024

Uh oh!

topperc May 9, 2024

Uh oh!

dtcxzyw May 9, 2024

Uh oh!

topperc May 9, 2024

Uh oh!

preames May 9, 2024

Uh oh!

topperc May 9, 2024

Uh oh!

preames commented May 9, 2024

Uh oh!

Uh oh!

[RISCV] Inhibit DAG folding shl through zext.w pattern with zba #91626

[RISCV] Inhibit DAG folding shl through zext.w pattern with zba #91626

Uh oh!

Conversation

preames commented May 9, 2024

Uh oh!

llvmbot commented May 9, 2024

Uh oh!

github-actions bot commented May 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

topperc commented May 9, 2024

Uh oh!

preames commented May 9, 2024

Uh oh!

topperc May 9, 2024

Choose a reason for hiding this comment

Uh oh!

dtcxzyw May 9, 2024

Choose a reason for hiding this comment

Uh oh!

topperc May 9, 2024

Choose a reason for hiding this comment

Uh oh!

preames May 9, 2024

Choose a reason for hiding this comment

Uh oh!

topperc May 9, 2024

Choose a reason for hiding this comment

Uh oh!

preames commented May 9, 2024

Uh oh!

Uh oh!

github-actions bot commented May 9, 2024 •

edited

Loading