[GlobalISel] Add support for most G_VECREDUCE_* operations to moreElementsVector #81830

dc03-work · 2024-02-15T07:37:18Z

The code for getting the "neutral" element is taken almost exactly as it is in
SelectionDAG, with the exception that support for
G_VECREDUCE_{FMAXIMUM,FMINIMUM} was not added.

The code for SelectionDAG is located at
SelectionDAG::getNeutralELement().

Created using spr 1.3.5

github-actions · 2024-02-15T07:37:39Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be
notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write
permissions for the repository. In which case you can instead tag reviewers by
name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review
by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate
is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2024-02-15T07:38:09Z

@llvm/pr-subscribers-llvm-globalisel

Author: Dhruv Chawla (work) (dc03-work)

Changes

The code for getting the "neutral" element is taken almost exactly as it is in
SelectionDAG, with the exception that support for
G_VECREDUCE_{FMAXIMUM,FMINIMUM} was not added.

The code for SelectionDAG is located at
SelectionDAG::getNeutralELement().

Full diff: https://github.com/llvm/llvm-project/pull/81830.diff

2 Files Affected:

(modified) llvm/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h (+4)
(modified) llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp (+65)

diff --git a/llvm/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h b/llvm/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h
index a7ecf0dc1ba216..90487ae3bc2ebd 100644
--- a/llvm/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h
+++ b/llvm/include/llvm/CodeGen/GlobalISel/LegalizerHelper.h
@@ -281,6 +281,10 @@ class LegalizerHelper {
                                          MachineInstr &MI,
                                          LostDebugLocObserver &LocObserver);
 
+  MachineInstrBuilder
+  getNeutralElementForVecReduce(unsigned Opcode, MachineIRBuilder &MIRBuilder,
+                                LLT Ty);
+
 public:
   /// Return the alignment to use for a stack temporary object with the given
   /// type.
diff --git a/llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp b/llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
index e39fdae1ccbedb..ad1003839e3371 100644
--- a/llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
+++ b/llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
@@ -5159,6 +5159,42 @@ LegalizerHelper::moreElementsVectorPhi(MachineInstr &MI, unsigned TypeIdx,
   return Legalized;
 }
 
+MachineInstrBuilder LegalizerHelper::getNeutralElementForVecReduce(
+    unsigned Opcode, MachineIRBuilder &MIRBuilder, LLT Ty) {
+  assert(Ty.isScalar() && "Expected scalar type to make neutral element for");
+
+  switch (Opcode) {
+  default:
+    return MIRBuilder.buildUndef(Ty);
+  case TargetOpcode::G_VECREDUCE_ADD:
+  case TargetOpcode::G_VECREDUCE_OR:
+  case TargetOpcode::G_VECREDUCE_XOR:
+  case TargetOpcode::G_VECREDUCE_UMAX:
+    return MIRBuilder.buildConstant(Ty, 0);
+  case TargetOpcode::G_VECREDUCE_MUL:
+    return MIRBuilder.buildConstant(Ty, 1);
+  case TargetOpcode::G_VECREDUCE_AND:
+  case TargetOpcode::G_VECREDUCE_UMIN:
+    return MIRBuilder.buildConstant(
+        Ty, APInt::getAllOnes(Ty.getScalarSizeInBits()));
+  case TargetOpcode::G_VECREDUCE_SMAX:
+    return MIRBuilder.buildConstant(
+        Ty, APInt::getSignedMinValue(Ty.getSizeInBits()));
+  case TargetOpcode::G_VECREDUCE_SMIN:
+    return MIRBuilder.buildConstant(
+        Ty, APInt::getSignedMaxValue(Ty.getSizeInBits()));
+  case TargetOpcode::G_VECREDUCE_FADD:
+    return MIRBuilder.buildFConstant(Ty, -0.0);
+  case TargetOpcode::G_VECREDUCE_FMUL:
+    return MIRBuilder.buildFConstant(Ty, 1.0);
+  case TargetOpcode::G_VECREDUCE_FMINIMUM:
+  case TargetOpcode::G_VECREDUCE_FMAXIMUM:
+    assert(false && "getNeutralElementForVecReduce unimplemented for "
+                    "G_VECREDUCE_FMINIMUM and G_VECREDUCE_FMAXIMUM!");
+  }
+  llvm_unreachable("switch expected to return!");
+}
+
 LegalizerHelper::LegalizeResult
 LegalizerHelper::moreElementsVector(MachineInstr &MI, unsigned TypeIdx,
                                     LLT MoreTy) {
@@ -5341,6 +5377,35 @@ LegalizerHelper::moreElementsVector(MachineInstr &MI, unsigned TypeIdx,
     Observer.changedInstr(MI);
     return Legalized;
   }
+  case TargetOpcode::G_VECREDUCE_FADD:
+  case TargetOpcode::G_VECREDUCE_FMUL:
+  case TargetOpcode::G_VECREDUCE_ADD:
+  case TargetOpcode::G_VECREDUCE_MUL:
+  case TargetOpcode::G_VECREDUCE_AND:
+  case TargetOpcode::G_VECREDUCE_OR:
+  case TargetOpcode::G_VECREDUCE_XOR:
+  case TargetOpcode::G_VECREDUCE_SMAX:
+  case TargetOpcode::G_VECREDUCE_SMIN:
+  case TargetOpcode::G_VECREDUCE_UMAX:
+  case TargetOpcode::G_VECREDUCE_UMIN: {
+    LLT OrigTy = MRI.getType(MI.getOperand(1).getReg());
+    MachineOperand &MO = MI.getOperand(1);
+    auto NewVec = MIRBuilder.buildPadVectorWithUndefElements(MoreTy, MO);
+    auto NeutralElement = getNeutralElementForVecReduce(
+        MI.getOpcode(), MIRBuilder, MoreTy.getElementType());
+    for (size_t i = OrigTy.getNumElements(), e = MoreTy.getNumElements();
+         i != e; i++) {
+      auto Idx = MIRBuilder.buildConstant(LLT::scalar(32), i);
+      NewVec = MIRBuilder.buildInsertVectorElement(MoreTy, NewVec,
+                                                   NeutralElement, Idx);
+    }
+
+    Observer.changingInstr(MI);
+    MO.setReg(NewVec.getReg(0));
+    Observer.changedInstr(MI);
+    return Legalized;
+  }
+
   default:
     return UnableToLegalize;
   }

dc03-work · 2024-02-15T07:41:09Z

Sorry for the long branch names on both of these PRs, I do not know how to change the default branch names with SPR.

davemgreen · 2024-02-15T09:47:29Z

Hi - this sounds really good. It's nice to start getting support for these operations.

We usually prefer that patches are committed alongside tests, so I think it would make sense to combine this in with #81831. That way we can add the AArch64 legalization at the same time and make sure it's properly tested.

dc03-work · 2024-02-19T05:38:31Z

Hi - this sounds really good. It's nice to start getting support for these operations.

We usually prefer that patches are committed alongside tests, so I think it would make sense to combine this in with #81831. That way we can add the AArch64 legalization at the same time and make sure it's properly tested.

The problem with merging with #81831 is that it only focuses on adding support for SMIN and friends, so it would still not end up testing the whole function. TBH I'm not sure how to test these functions, because that would require finding cases for all these operations where we need to add more elements to the vector.

For AArch64, one solution could be to limit this patch to just SMIN and friends for now, but I feel that's not very beneficial to other backends who could use the support for all operations.

madhur13490 · 2024-02-23T05:22:17Z

Hi @dc03-work I am in favour of merging too. It would be easy to get tested and no need to have separate tests for the first patch. Can we please combine them?

dc03-work · 2024-02-23T08:15:16Z

I have merged these two PRs in #82740, so I'll be closing them.

[𝘀𝗽𝗿] initial version

3ab2123

Created using spr 1.3.5

llvmbot added the llvm:globalisel label Feb 15, 2024

dc03-work mentioned this pull request Feb 15, 2024

[AArch64][GlobalISel] Improve codegen for G_VECREDUCE_{SMIN,SMAX,UMIN,UMAX} for odd-sized vectors #81831

Closed

dc03-work requested review from madhur13490 and davemgreen February 15, 2024 07:38

dc03-work closed this Feb 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GlobalISel] Add support for most G_VECREDUCE_* operations to moreElementsVector #81830

[GlobalISel] Add support for most G_VECREDUCE_* operations to moreElementsVector #81830

Uh oh!

dc03-work commented Feb 15, 2024

Uh oh!

github-actions bot commented Feb 15, 2024

Uh oh!

llvmbot commented Feb 15, 2024

Uh oh!

dc03-work commented Feb 15, 2024

Uh oh!

davemgreen commented Feb 15, 2024

Uh oh!

dc03-work commented Feb 19, 2024

Uh oh!

madhur13490 commented Feb 23, 2024

Uh oh!

dc03-work commented Feb 23, 2024

Uh oh!

Uh oh!

[GlobalISel] Add support for most G_VECREDUCE_* operations to moreElementsVector #81830

[GlobalISel] Add support for most G_VECREDUCE_* operations to moreElementsVector #81830

Uh oh!

Conversation

dc03-work commented Feb 15, 2024

Uh oh!

github-actions bot commented Feb 15, 2024

Uh oh!

llvmbot commented Feb 15, 2024

Uh oh!

dc03-work commented Feb 15, 2024

Uh oh!

davemgreen commented Feb 15, 2024

Uh oh!

dc03-work commented Feb 19, 2024

Uh oh!

madhur13490 commented Feb 23, 2024

Uh oh!

dc03-work commented Feb 23, 2024

Uh oh!

Uh oh!