[SLP] Allow targets to add cost for nonstandard conditions #95328

jrbyrnes · 2024-06-12T23:04:31Z

There are conditions in which vectorization is profitable, but are not expressible by the current cost model. As an example, the vectorization profit may entirely be based on conditions of the users of the tree entry.

This gives targets a chance to express things of this nature.

llvmbot · 2024-06-12T23:05:02Z

@llvm/pr-subscribers-backend-amdgpu
@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-llvm-analysis

Author: Jeffrey Byrnes (jrbyrnes)

Changes

There are conditions in which vectorization is profitable, but are not expressible by the current cost model. As an example, the vectorization profit may entirely be based on conditions of the users of the tree entry.

This gives targets a chance to express things of this nature.

Full diff: https://github.com/llvm/llvm-project/pull/95328.diff

5 Files Affected:

(modified) llvm/include/llvm/Analysis/TargetTransformInfo.h (+16)
(modified) llvm/include/llvm/Analysis/TargetTransformInfoImpl.h (+5)
(modified) llvm/include/llvm/CodeGen/BasicTTIImpl.h (+5)
(modified) llvm/lib/Analysis/TargetTransformInfo.cpp (+5)
(modified) llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp (+8)

diff --git a/llvm/include/llvm/Analysis/TargetTransformInfo.h b/llvm/include/llvm/Analysis/TargetTransformInfo.h
index f55f21c94a85a..49117ca8c74c5 100644
--- a/llvm/include/llvm/Analysis/TargetTransformInfo.h
+++ b/llvm/include/llvm/Analysis/TargetTransformInfo.h
@@ -891,6 +891,11 @@ class TargetTransformInfo {
                                            bool Insert, bool Extract,
                                            TTI::TargetCostKind CostKind) const;
 
+  /// Whether or not there is any target-specific condition that imposes an
+  /// overhead for scalarization
+  bool hasScalarizationOverhead(ArrayRef<Value *> VL,
+                                std::pair<bool, bool> &ScalarizationKind) const;
+
   /// Estimate the overhead of scalarizing an instructions unique
   /// non-constant operands. The (potentially vector) types to use for each of
   /// argument are passes via Tys.
@@ -1921,6 +1926,10 @@ class TargetTransformInfo::Concept {
   getOperandsScalarizationOverhead(ArrayRef<const Value *> Args,
                                    ArrayRef<Type *> Tys,
                                    TargetCostKind CostKind) = 0;
+
+  virtual bool
+  hasScalarizationOverhead(ArrayRef<Value *> VL,
+                           std::pair<bool, bool> &ScalarizationKind) = 0;
   virtual bool supportsEfficientVectorElementLoadStore() = 0;
   virtual bool supportsTailCalls() = 0;
   virtual bool supportsTailCallFor(const CallBase *CB) = 0;
@@ -2456,6 +2465,13 @@ class TargetTransformInfo::Model final : public TargetTransformInfo::Concept {
     return Impl.getScalarizationOverhead(Ty, DemandedElts, Insert, Extract,
                                          CostKind);
   }
+
+  bool
+  hasScalarizationOverhead(ArrayRef<Value *> VL,
+                           std::pair<bool, bool> &ScalarizationKind) override {
+    return Impl.hasScalarizationOverhead(VL, ScalarizationKind);
+  }
+
   InstructionCost
   getOperandsScalarizationOverhead(ArrayRef<const Value *> Args,
                                    ArrayRef<Type *> Tys,
diff --git a/llvm/include/llvm/Analysis/TargetTransformInfoImpl.h b/llvm/include/llvm/Analysis/TargetTransformInfoImpl.h
index 7828bdc1f1f43..1d3e6752006d9 100644
--- a/llvm/include/llvm/Analysis/TargetTransformInfoImpl.h
+++ b/llvm/include/llvm/Analysis/TargetTransformInfoImpl.h
@@ -371,6 +371,11 @@ class TargetTransformInfoImplBase {
     return 0;
   }
 
+  bool hasScalarizationOverhead(ArrayRef<Value *> VL,
+                                std::pair<bool, bool> &ScalarizationKind) {
+    return false;
+  }
+
   InstructionCost
   getOperandsScalarizationOverhead(ArrayRef<const Value *> Args,
                                    ArrayRef<Type *> Tys,
diff --git a/llvm/include/llvm/CodeGen/BasicTTIImpl.h b/llvm/include/llvm/CodeGen/BasicTTIImpl.h
index 9f8d3ded9b3c1..2aa36c724bc03 100644
--- a/llvm/include/llvm/CodeGen/BasicTTIImpl.h
+++ b/llvm/include/llvm/CodeGen/BasicTTIImpl.h
@@ -807,6 +807,11 @@ class BasicTTIImplBase : public TargetTransformInfoImplCRTPBase<T> {
                                              CostKind);
   }
 
+  bool hasScalarizationOverhead(ArrayRef<Value *> VL,
+                                std::pair<bool, bool> &ScalarizationKind) {
+    return false;
+  }
+
   /// Estimate the overhead of scalarizing an instructions unique
   /// non-constant operands. The (potentially vector) types to use for each of
   /// argument are passes via Tys.
diff --git a/llvm/lib/Analysis/TargetTransformInfo.cpp b/llvm/lib/Analysis/TargetTransformInfo.cpp
index 7e721cbc87f3f..81b1e6b181bb0 100644
--- a/llvm/lib/Analysis/TargetTransformInfo.cpp
+++ b/llvm/lib/Analysis/TargetTransformInfo.cpp
@@ -594,6 +594,11 @@ InstructionCost TargetTransformInfo::getScalarizationOverhead(
                                            CostKind);
 }
 
+bool TargetTransformInfo::hasScalarizationOverhead(
+    ArrayRef<Value *> VL, std::pair<bool, bool> &ScalarizeKind) const {
+  return TTIImpl->hasScalarizationOverhead(VLm ScalarizeKind);
+}
+
 InstructionCost TargetTransformInfo::getOperandsScalarizationOverhead(
     ArrayRef<const Value *> Args, ArrayRef<Type *> Tys,
     TTI::TargetCostKind CostKind) const {
diff --git a/llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp b/llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
index ae0819c964bef..f189e9b6ba14b 100644
--- a/llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
+++ b/llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
@@ -9084,6 +9084,14 @@ BoUpSLP::getEntryCost(const TreeEntry *E, ArrayRef<Value *> VectorizedVals,
         E, ScalarTy, *TTI, VectorizedVals, *this, CheckedExtracts);
   }
   InstructionCost CommonCost = 0;
+  std::pair<bool, bool> ScalarizationKind(false, false);
+  if (TTI->hasScalarizationOverhead(VL, ScalarizationKind)) {
+    APInt DemandedElts = APInt::getAllOnes(VL.size());
+    CommonCost -= TTI->getScalarizationOverhead(
+        VecTy, DemandedElts,
+        /*Insert*/ ScalarizationKind.first,
+        /*Extract*/ ScalarizationKind.second, CostKind);
+  }
   SmallVector<int> Mask;
   bool IsReverseOrder = isReverseOrder(E->ReorderIndices);
   if (!E->ReorderIndices.empty() &&

arsenm

Include target implementation and test?

arsenm · 2024-06-13T08:18:02Z

llvm/include/llvm/Analysis/TargetTransformInfo.h

+  /// Whether or not there is any target-specific condition that imposes an
+  /// overhead for scalarization
+  bool hasScalarizationOverhead(ArrayRef<Value *> VL,
+                                std::pair<bool, bool> &ScalarizationKind) const;


What can this express that getOperandsScalarizationOverhead and getScalarizationOverhead do not?

I'm already confused by every function in TTI having multiple versions of ~everything

What can this express that getOperandsScalarizationOverhead and getScalarizationOverhead do not?

Those functions are used to calculate the cost of the scalarized sequence based on the inserts/extracts needed and the legalization costs based on TLI. For the purpose here, we can use either to calculate the cost of scalarization, but we need a mechanism to control whether or not to include this scalarization overhead for a particular tree entry (which is what this new hook is used for).

Even if we wanted to implement this control in those functions, I think we would still need to query targets as to whether or not there needs to be a cost accounting for the SelectionDAG issue workaround (assuming we could generalize the conditions in which the SelectionDAG issue occurred). Perhaps renaming the hook would help (e.g. hasSelectionDAGScalarizationOverhead)?

Change-Id: Ia995bc646e5f050083bd6277eeabe0b5ab410f47

Change-Id: I0de224f42d77bb25fcbae5ccd6ad863560d0bb1d

Change-Id: If5ac53d5235ee8c65c53454b209c9f155c17edc4

jrbyrnes · 2024-06-13T19:44:30Z

Add implementation of hasScalarizationOverhead and test.

Rebased on top of #91016 in order to facilitate the test

Change-Id: Ideeafb60bc63c8bc09faa33f09dfb89f2c379819

Change-Id: Iee3e2c5036fc946df05aa45a6122e8913cf9a916

jrbyrnes requested review from arsenm and alexey-bataev June 12, 2024 23:04

llvmbot added vectorizers llvm:analysis Includes value tracking, cost tables and constant folding llvm:transforms labels Jun 12, 2024

arsenm reviewed Jun 13, 2024

View reviewed changes

jrbyrnes added 3 commits June 13, 2024 11:43

[AMDGPU] Allow SLP to analyze i8s

69ecffc

Change-Id: Ia995bc646e5f050083bd6277eeabe0b5ab410f47

[SLP] Allow targets to add cost for nonstandard conditions

b245a49

Change-Id: I0de224f42d77bb25fcbae5ccd6ad863560d0bb1d

Review comments + Rebase on top of llvm#91016

0210138

Change-Id: If5ac53d5235ee8c65c53454b209c9f155c17edc4

jrbyrnes force-pushed the SLPCostHookRebase0 branch from dd65a39 to 0210138 Compare June 13, 2024 19:44

llvmbot added the backend:AMDGPU label Jun 13, 2024

jrbyrnes mentioned this pull request Jun 13, 2024

[AMDGPU] Allow SLP to analyze i8s #91016

Open

jrbyrnes added 2 commits June 13, 2024 13:28

Remove unused variables

f085102

Change-Id: Ideeafb60bc63c8bc09faa33f09dfb89f2c379819

Use const

2047613

Change-Id: Iee3e2c5036fc946df05aa45a6122e8913cf9a916

jrbyrnes mentioned this pull request Jun 17, 2024

[AMDGPU] Vectorize i8 Shuffles #95840

Closed

jrbyrnes mentioned this pull request Aug 23, 2024

[AMDGPU] Vectorize i8 Shuffles #105850

Open

jrbyrnes mentioned this pull request Oct 18, 2024

[AMDGPU] Allow SLP to analyze i8s #113002

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SLP] Allow targets to add cost for nonstandard conditions #95328

[SLP] Allow targets to add cost for nonstandard conditions #95328

Uh oh!

jrbyrnes commented Jun 12, 2024

Uh oh!

llvmbot commented Jun 12, 2024 •

edited

Loading

Uh oh!

arsenm left a comment

Uh oh!

arsenm Jun 13, 2024

Uh oh!

jrbyrnes Jun 13, 2024 •

edited

Loading

Uh oh!

jrbyrnes commented Jun 13, 2024

Uh oh!

Uh oh!

[SLP] Allow targets to add cost for nonstandard conditions #95328

Are you sure you want to change the base?

[SLP] Allow targets to add cost for nonstandard conditions #95328

Uh oh!

Conversation

jrbyrnes commented Jun 12, 2024

Uh oh!

llvmbot commented Jun 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arsenm left a comment

Choose a reason for hiding this comment

Uh oh!

arsenm Jun 13, 2024

Choose a reason for hiding this comment

Uh oh!

jrbyrnes Jun 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jrbyrnes commented Jun 13, 2024

Uh oh!

Uh oh!

llvmbot commented Jun 12, 2024 •

edited

Loading

jrbyrnes Jun 13, 2024 •

edited

Loading