[RISCV][CostModel] Correct the cost of some reductions #118072

LiqinWeng · 2024-11-29T09:16:58Z

Reductions include: and/or/max/min

llvmbot · 2024-11-29T09:17:34Z

@llvm/pr-subscribers-llvm-analysis

@llvm/pr-subscribers-backend-risc-v

Author: LiqinWeng (LiqinWeng)

Changes

Reductions include: and/or/max/min

Full diff: https://github.com/llvm/llvm-project/pull/118072.diff

1 Files Affected:

(modified) llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp (+13-6)

diff --git a/llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp b/llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp
index 8f0ef69258b165..b098dd0b7613e5 100644
--- a/llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp
+++ b/llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp
@@ -1470,27 +1470,27 @@ RISCVTTIImpl::getMinMaxReductionCost(Intrinsic::ID IID, VectorType *Ty,
     llvm_unreachable("Unsupported intrinsic");
   case Intrinsic::smax:
     SplitOp = RISCV::VMAX_VV;
-    Opcodes = {RISCV::VMV_S_X, RISCV::VREDMAX_VS, RISCV::VMV_X_S};
+    Opcodes = {RISCV::VREDMAX_VS, RISCV::VMV_X_S};
     break;
   case Intrinsic::smin:
     SplitOp = RISCV::VMIN_VV;
-    Opcodes = {RISCV::VMV_S_X, RISCV::VREDMIN_VS, RISCV::VMV_X_S};
+    Opcodes = {RISCV::VREDMIN_VS, RISCV::VMV_X_S};
     break;
   case Intrinsic::umax:
     SplitOp = RISCV::VMAXU_VV;
-    Opcodes = {RISCV::VMV_S_X, RISCV::VREDMAXU_VS, RISCV::VMV_X_S};
+    Opcodes = {RISCV::VREDMAXU_VS, RISCV::VMV_X_S};
     break;
   case Intrinsic::umin:
     SplitOp = RISCV::VMINU_VV;
-    Opcodes = {RISCV::VMV_S_X, RISCV::VREDMINU_VS, RISCV::VMV_X_S};
+    Opcodes = {RISCV::VREDMINU_VS, RISCV::VMV_X_S};
     break;
   case Intrinsic::maxnum:
     SplitOp = RISCV::VFMAX_VV;
-    Opcodes = {RISCV::VFMV_S_F, RISCV::VFREDMAX_VS, RISCV::VFMV_F_S};
+    Opcodes = {RISCV::VFREDMAX_VS, RISCV::VFMV_F_S};
     break;
   case Intrinsic::minnum:
     SplitOp = RISCV::VFMIN_VV;
-    Opcodes = {RISCV::VFMV_S_F, RISCV::VFREDMIN_VS, RISCV::VFMV_F_S};
+    Opcodes = {RISCV::VFREDMIN_VS, RISCV::VFMV_F_S};
     break;
   }
   // Add a cost for data larger than LMUL8
@@ -1534,6 +1534,13 @@ RISCVTTIImpl::getArithmeticReductionCost(unsigned Opcode, VectorType *Ty,
              getRISCVInstructionCost(Opcodes, LT.second, CostKind) +
              getCmpSelInstrCost(Instruction::ICmp, ElementTy, ElementTy,
                                 CmpInst::ICMP_EQ, CostKind);
+    } else if (ISD == ISD::XOR) {
+      // Example sequences:
+      //   vsetvli a0, zero, e8, mf8, ta, ma
+      //   vcpop.m a0, v0
+      //   andi a0, a0, 1
+      Opcodes = {RISCV::VCPOP_M};
+      return LT.first + getRISCVInstructionCost(Opcodes, LT.second, CostKind);
     } else {
       // Example sequences:
       //   vsetvli a0, zero, e8, mf8, ta, ma

Reductions include: and/or/max/min

llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp

lukel97 · 2024-11-30T11:06:13Z

llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp

+      //   vcpop.m a0, v0
+      //   andi a0, a0, 1
+      Opcodes = {RISCV::VCPOP_M};
+      return LT.first + getRISCVInstructionCost(Opcodes, LT.second, CostKind);


Shouldn't this be

Suggested change

return LT.first + getRISCVInstructionCost(Opcodes, LT.second, CostKind);

return LT.first * getRISCVInstructionCost(Opcodes, LT.second, CostKind);

With that said, I'm also not sure why the existing sequences are adding the legalization cost instead of multiplying it. We multiply it everywhere else in RISCVTargetTransformInfo. Maybe this is something to fix in a follow up patch

deal with the type: ElementTy->isIntegerTy(1), should we real need use *???? I don't understand the case of ISD == ISD::AND, it uses LT.first - 1. I'm not sure why it subtracts 1.

mycost : LT.first - 1 + getRISCVInstructionCost + 1(1 is cost of the andi)

(LT.first - 1) first appeared in this commit.
It seems to be used to calculate the additional cost when the LMUL exceeds 8.
For example,

define zeroext i1 @vreduce_and_nxv1i1(<vscale x 128 x i1> %v) { ; vmand.mm v8, v0, v8 // addtional cost when LMUL exceeds 8. ; vmnot.m v8, v8 ; vcpop.m a0, v8 ; seqz a0, a0 %red = call i1 @llvm.vector.reduce.and.nxv128i1(<vscale x 128 x i1> %v) ret i1 %red }

Yeah LT.first is the cost of type legalization, i.e. splitting. But it's the number of times the operation will be split, so we need to multiply that by the cost of whatever node. I'll open up a PR to fix the existing cases, but for this PR we might as well do it the correct way i.e. by multiplying. (Although I don't think it will make a difference to the costs, since these mask instructions are always LMUL 1!)

I've gone ahead and done this in df10f1c

arcbbb · 2024-12-02T02:04:12Z

llvm/lib/Target/RISCV/RISCVTargetTransformInfo.cpp

+    } else if (ISD == ISD::XOR) {
+      // Example sequences:
+      //   vsetvli a0, zero, e8, mf8, ta, ma
+      //   vcpop.m a0, v0
+      //   andi a0, a0, 1


Doesn't the else block already cover this case?

Yes， fixed

XOR doesn't use a seqz though so it shouldn't use getCmpSelInstrCost?

@lukel97 You're right. I initially considered replacing getCmpSelInstrCost with 1 to streamline the code. However, I'm fine if you'd prefer to handle the cost separately.

arcbbb

LGTM. Please wait for one more approval.

lukel97

LGTM, thanks

LiqinWeng requested a review from arcbbb November 29, 2024 09:16

llvmbot added the backend:RISC-V label Nov 29, 2024

LiqinWeng force-pushed the correct-reduction-cost branch from 429a864 to ffe5b35 Compare November 29, 2024 09:17

[RISCV][CostModel] Correct the cost of some reductions

2a43729

Reductions include: and/or/max/min

LiqinWeng force-pushed the correct-reduction-cost branch from ffe5b35 to 2a43729 Compare November 29, 2024 09:18

llvmbot added the llvm:analysis Includes value tracking, cost tables and constant folding label Nov 29, 2024

lukel97 reviewed Nov 30, 2024

View reviewed changes

fix the comments

1588e92

LiqinWeng force-pushed the correct-reduction-cost branch from 65d8998 to 1588e92 Compare December 1, 2024 05:24

arcbbb reviewed Dec 2, 2024

View reviewed changes

LiqinWeng added 3 commits December 2, 2024 10:29

Remove the vxi1 of XOR implement

c3c17c9

Merge branch 'main' into correct-reduction-cost

5acb7d0

add cost of llvm.vector.reduce.xor with nxvxi1 or vxi1

72c9cfc

arcbbb approved these changes Dec 4, 2024

View reviewed changes

lukel97 approved these changes Dec 4, 2024

View reviewed changes

LiqinWeng merged commit 46829e5 into llvm:main Dec 4, 2024
8 checks passed

LiqinWeng deleted the correct-reduction-cost branch December 10, 2024 03:39

	return LT.first + getRISCVInstructionCost(Opcodes, LT.second, CostKind);
	return LT.first * getRISCVInstructionCost(Opcodes, LT.second, CostKind);

[RISCV][CostModel] Correct the cost of some reductions #118072

[RISCV][CostModel] Correct the cost of some reductions #118072

Uh oh!

Conversation

LiqinWeng commented Nov 29, 2024

Uh oh!

llvmbot commented Nov 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

arcbbb left a comment

Choose a reason for hiding this comment

Uh oh!

lukel97 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

llvmbot commented Nov 29, 2024 •

edited

Loading