-
Notifications
You must be signed in to change notification settings - Fork 14.3k
[RISCV] Update V0Defs after moving Src in peepholes #107359
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RISCV] Update V0Defs after moving Src in peepholes #107359
Conversation
If we move a pseudo in tryReduceVL or foldVMV_V_V via ensureDominates, its V0 definition may have changed so we need to update V0Defs. This shouldn't have any functional change today since any pseudo which uses V0 won't be able to move past a new definition. However this will matter if we add a peephole to convert unmasked pseudos to masked pseudos and add a use of V0.
@llvm/pr-subscribers-backend-risc-v Author: Luke Lau (lukel97) ChangesIf we move a pseudo in tryReduceVL or foldVMV_V_V via ensureDominates, This shouldn't have any functional change today since any pseudo which However this will matter if we add a peephole to convert unmasked Full diff: https://github.com/llvm/llvm-project/pull/107359.diff 1 Files Affected:
diff --git a/llvm/lib/Target/RISCV/RISCVVectorPeephole.cpp b/llvm/lib/Target/RISCV/RISCVVectorPeephole.cpp
index 026e0a365b8dcb..e6ea8c41269ca3 100644
--- a/llvm/lib/Target/RISCV/RISCVVectorPeephole.cpp
+++ b/llvm/lib/Target/RISCV/RISCVVectorPeephole.cpp
@@ -61,7 +61,7 @@ class RISCVVectorPeephole : public MachineFunctionPass {
}
private:
- bool tryToReduceVL(MachineInstr &MI) const;
+ bool tryToReduceVL(MachineInstr &MI);
bool convertToVLMAX(MachineInstr &MI) const;
bool convertToWholeRegister(MachineInstr &MI) const;
bool convertToUnmasked(MachineInstr &MI) const;
@@ -71,7 +71,7 @@ class RISCVVectorPeephole : public MachineFunctionPass {
bool isAllOnesMask(const MachineInstr *MaskDef) const;
std::optional<unsigned> getConstant(const MachineOperand &VL) const;
- bool ensureDominates(const MachineOperand &Use, MachineInstr &Src) const;
+ bool ensureDominates(const MachineOperand &Use, MachineInstr &Src);
/// Maps uses of V0 to the corresponding def of V0.
DenseMap<const MachineInstr *, const MachineInstr *> V0Defs;
@@ -107,7 +107,7 @@ static unsigned getSEWLMULRatio(const MachineInstr &MI) {
// Attempt to reduce the VL of an instruction whose sole use is feeding a
// instruction with a narrower VL. This currently works backwards from the
// user instruction (which might have a smaller VL).
-bool RISCVVectorPeephole::tryToReduceVL(MachineInstr &MI) const {
+bool RISCVVectorPeephole::tryToReduceVL(MachineInstr &MI) {
// Note that the goal here is a bit multifaceted.
// 1) For store's reducing the VL of the value being stored may help to
// reduce VL toggles. This is somewhat of an artifact of the fact we
@@ -457,7 +457,7 @@ static bool dominates(MachineBasicBlock::const_iterator A,
/// does. Returns false if doesn't dominate and we can't move. \p MO must be in
/// the same basic block as \Src.
bool RISCVVectorPeephole::ensureDominates(const MachineOperand &MO,
- MachineInstr &Src) const {
+ MachineInstr &Src) {
assert(MO.getParent()->getParent() == Src.getParent());
if (!MO.isReg() || MO.getReg() == RISCV::NoRegister)
return true;
@@ -466,7 +466,7 @@ bool RISCVVectorPeephole::ensureDominates(const MachineOperand &MO,
if (Def->getParent() == Src.getParent() && !dominates(Def, Src)) {
if (!isSafeToMove(Src, *Def->getNextNode()))
return false;
- // FIXME: Update V0Defs
+ V0Defs[&Src] = V0Defs[Def];
Src.moveBefore(Def->getNextNode());
}
|
I just want to point out that I agree with the discussion in https://github.com/llvm/llvm-project/pull/106753/files#r1739182733, it's probably worthwhile looking into a smaller mapping in a future PR, e.g. only store instructions that use V0. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
For anyone curious, I was initially concerned about the case where Def might change V0 itself, but realized that's prevented by the safe to move check.
Argh, now that you mention it this should actually be setting the definition to the one at Def->getNextNode(). |
Yes, but per the reasoning in my comment those must be the same. :) |
This fixes #107950 and adds a test case for it. The issue was due to us incorrectly assuming that we stored a V0Defs entry for every single instruction. We actually only store them for instructions that use V0, so when we updated the V0Def after moving we sometimes ended up copying nullptr over from an instruction that doesn't use V0 and clearing the V0Def entry inadvertently. Because we don't have V0Defs on instructions that don't use V0, the FIXME was never actually needed in the first place since the bookkeeping wasn't out of sync to begin with. That commit also mentioned that a future unmasked to masked pseudo peephole might need unmasked pseudos to have V0Defs entries, but after working on this locally it turns out we don't. This reverts commit ce36480.
If we move a pseudo in tryReduceVL or foldVMV_V_V via ensureDominates, its V0 definition may have changed so we need to update V0Defs.
This shouldn't have any functional change today since any pseudo which uses V0 won't be able to move past a new definition.
However this will matter if we add a peephole to convert unmasked pseudos to masked pseudos and add a use of V0.