[VPlan] Fix VPTypeAnalysis cache clobbering in EVL transform #120252

lukel97 · 2024-12-17T15:54:20Z

When building SPEC CPU 2017 with RISC-V and EVL tail folding, this assertion in VPTypeAnalysis would trigger during the transformation to EVL recipes:

llvm-project/llvm/lib/Transforms/Vectorize/VPlanAnalysis.cpp

Lines 135 to 142 in d8a0709

    
           Type *VPTypeAnalysis::inferScalarTypeForRecipe(const VPWidenSelectRecipe *R) { 
        
             Type *ResTy = inferScalarType(R->getOperand(1)); 
        
             VPValue *OtherV = R->getOperand(2); 
        
             assert(inferScalarType(OtherV) == ResTy && 
        
                    "different types inferred for different operands"); 
        
             CachedTypes[OtherV] = ResTy; 
        
             return ResTy; 
        
           }

It was caused by this recipe:

WIDEN ir<%shr> = vp.or ir<%add33>, ir<0>, vp<%6>

Having its type inferred as i16, when ir<%add33> and ir<0> had inferred types of i32 somehow.

The cause of this turned out to be because the VPTypeAnalysis cache was getting clobbered: In this transform we were erasing recipes but keeping around the same mapping from VPValue* to Type*. In the meantime, new recipes would be created which would have the same address as the old value. They would then incorrectly get the old erased VPValue*'s cached type:

  --- before ---
  0x600001ec5030: WIDEN ir<%mul21.neg> = vp.mul vp<%11>, ir<0>, vp<%6>
  0x600001ec5450: <badref> <- some value that was erased
  --- after ---
  0x600001ec5030: WIDEN ir<%mul21.neg> = vp.mul vp<%11>, ir<0>, vp<%6>
  0x600001ec5450: WIDEN ir<%shr> = vp.or ir<%add33>, ir<0>, vp<%6>  <- a new value that happens to have the same address

This fixes this by deferring the erasing of recipes till after the transformation.

The test case might be a bit flakey since it just happens to have the right conditions to recreate this. I tried to add an assert in inferScalarType that every VPValue in the cache was valid, but couldn't find a way of telling if a VPValue had been erased.

llvmbot · 2024-12-17T15:54:59Z

@llvm/pr-subscribers-llvm-transforms

Author: Luke Lau (lukel97)

Changes

When building SPEC CPU 2017 with RISC-V and EVL tail folding, this assertion in VPTypeAnalysis would trigger during the transformation to EVL recipes:

llvm-project/llvm/lib/Transforms/Vectorize/VPlanAnalysis.cpp

Lines 135 to 142 in d8a0709

    
           Type *VPTypeAnalysis::inferScalarTypeForRecipe(const VPWidenSelectRecipe *R) { 
        
             Type *ResTy = inferScalarType(R->getOperand(1)); 
        
             VPValue *OtherV = R->getOperand(2); 
        
             assert(inferScalarType(OtherV) == ResTy && 
        
                    "different types inferred for different operands"); 
        
             CachedTypes[OtherV] = ResTy; 
        
             return ResTy; 
        
           }

It was caused by this recipe:

WIDEN ir&lt;%shr&gt; = vp.or ir&lt;%add33&gt;, ir&lt;0&gt;, vp&lt;%6&gt;

Having its type inferred as i16, when ir<%add33> and ir<0> had inferred types of i32 somehow.

The cause of this turned out to be because the VPTypeAnalysis cache was getting clobbered: In this transform we were erasing recipes but keeping around the same mapping from VPValue* to Type*. In the meantime, new recipes would be created which would have the same address as the old value. They would then incorrectly get the old erased VPValue*'s cached type:

  --- before ---
  0x600001ec5030: WIDEN ir&lt;%mul21.neg&gt; = vp.mul vp&lt;%11&gt;, ir&lt;0&gt;, vp&lt;%6&gt;
  0x600001ec5450: &lt;badref&gt; &lt;- some value that was erased
  --- after ---
  0x600001ec5030: WIDEN ir&lt;%mul21.neg&gt; = vp.mul vp&lt;%11&gt;, ir&lt;0&gt;, vp&lt;%6&gt;
  0x600001ec5450: WIDEN ir&lt;%shr&gt; = vp.or ir&lt;%add33&gt;, ir&lt;0&gt;, vp&lt;%6&gt;  &lt;- a new value that happens to have the same address

This fixes this by not sharing the VPTypeAnalysis across iterations of the loop, i.e. clearing the cache after anything is erased.

The test case might be a bit flakey since it just happens to have the right conditions to recreate this. I tried to add an assert in inferScalarType that every VPValue in the cache was valid, but couldn't find a way of telling if a VPValue had been erased.

Another alternative is that we could maybe add a method to remove specific VPValues from the cache, but it looks like recursivelyDeleteDeadRecipes might also erase other recipes too.

Full diff: https://github.com/llvm/llvm-project/pull/120252.diff

2 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp (+4-1)
(added) llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll (+121)

diff --git a/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp b/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
index 9a3b82fe57c12a..bfff2504c393b4 100644
--- a/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
+++ b/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
@@ -1446,7 +1446,6 @@ void VPlanTransforms::addActiveLaneMask(
 static void transformRecipestoEVLRecipes(VPlan &Plan, VPValue &EVL) {
   using namespace llvm::VPlanPatternMatch;
   Type *CanonicalIVType = Plan.getCanonicalIV()->getScalarType();
-  VPTypeAnalysis TypeInfo(CanonicalIVType);
   LLVMContext &Ctx = CanonicalIVType->getContext();
   SmallVector<VPValue *> HeaderMasks = collectAllHeaderMasks(Plan);
 
@@ -1463,6 +1462,10 @@ static void transformRecipestoEVLRecipes(VPlan &Plan, VPValue &EVL) {
         return HeaderMask == OrigMask ? nullptr : OrigMask;
       };
 
+      // Don't preseve the type info cache across loop iterations since
+      // CurRecipe will be erased and invalidate it.
+      VPTypeAnalysis TypeInfo(CanonicalIVType);
+
       VPRecipeBase *NewRecipe =
           TypeSwitch<VPRecipeBase *, VPRecipeBase *>(CurRecipe)
               .Case<VPWidenLoadRecipe>([&](VPWidenLoadRecipe *L) {
diff --git a/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll b/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
new file mode 100644
index 00000000000000..4ab129d8b3ac39
--- /dev/null
+++ b/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
@@ -0,0 +1,121 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 5
+; RUN: opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S %s | FileCheck %s
+
+; This test tries to recreate the conditions for a crash that occurred when the
+; VPTypeAnalysis cache wasn't cleared after a recipe was erased and clobbered
+; with a new one.
+
+define void @type_info_cache_clobber(ptr %dstv, ptr %src, i64 %wide.trip.count) {
+; CHECK-LABEL: define void @type_info_invalid_cache(
+; CHECK-SAME: ptr [[DSTV:%.*]], ptr [[SRC:%.*]], i64 [[WIDE_TRIP_COUNT:%.*]]) #[[ATTR0:[0-9]+]] {
+; CHECK-NEXT:  [[ENTRY:.*]]:
+; CHECK-NEXT:    [[TMP0:%.*]] = add i64 [[WIDE_TRIP_COUNT]], 1
+; CHECK-NEXT:    [[TMP1:%.*]] = sub i64 -1, [[TMP0]]
+; CHECK-NEXT:    [[TMP2:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP3:%.*]] = mul i64 [[TMP2]], 8
+; CHECK-NEXT:    [[TMP4:%.*]] = icmp ult i64 [[TMP1]], [[TMP3]]
+; CHECK-NEXT:    br i1 [[TMP4]], label %[[SCALAR_PH:.*]], label %[[VECTOR_MEMCHECK:.*]]
+; CHECK:       [[VECTOR_MEMCHECK]]:
+; CHECK-NEXT:    [[SCEVGEP:%.*]] = getelementptr i8, ptr [[DSTV]], i64 1
+; CHECK-NEXT:    [[TMP5:%.*]] = add i64 [[WIDE_TRIP_COUNT]], 1
+; CHECK-NEXT:    [[SCEVGEP1:%.*]] = getelementptr i8, ptr [[SRC]], i64 [[TMP5]]
+; CHECK-NEXT:    [[BOUND0:%.*]] = icmp ult ptr [[DSTV]], [[SCEVGEP1]]
+; CHECK-NEXT:    [[BOUND1:%.*]] = icmp ult ptr [[SRC]], [[SCEVGEP]]
+; CHECK-NEXT:    [[FOUND_CONFLICT:%.*]] = and i1 [[BOUND0]], [[BOUND1]]
+; CHECK-NEXT:    br i1 [[FOUND_CONFLICT]], label %[[SCALAR_PH]], label %[[VECTOR_PH:.*]]
+; CHECK:       [[VECTOR_PH]]:
+; CHECK-NEXT:    [[TMP6:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP7:%.*]] = mul i64 [[TMP6]], 8
+; CHECK-NEXT:    [[TMP8:%.*]] = sub i64 [[TMP7]], 1
+; CHECK-NEXT:    [[N_RND_UP:%.*]] = add i64 [[TMP0]], [[TMP8]]
+; CHECK-NEXT:    [[N_MOD_VF:%.*]] = urem i64 [[N_RND_UP]], [[TMP7]]
+; CHECK-NEXT:    [[N_VEC:%.*]] = sub i64 [[N_RND_UP]], [[N_MOD_VF]]
+; CHECK-NEXT:    [[TMP9:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP10:%.*]] = mul i64 [[TMP9]], 8
+; CHECK-NEXT:    [[BROADCAST_SPLATINSERT:%.*]] = insertelement <vscale x 8 x ptr> poison, ptr [[DSTV]], i64 0
+; CHECK-NEXT:    [[BROADCAST_SPLAT:%.*]] = shufflevector <vscale x 8 x ptr> [[BROADCAST_SPLATINSERT]], <vscale x 8 x ptr> poison, <vscale x 8 x i32> zeroinitializer
+; CHECK-NEXT:    br label %[[VECTOR_BODY:.*]]
+; CHECK:       [[VECTOR_BODY]]:
+; CHECK-NEXT:    [[INDEX:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; CHECK-NEXT:    [[EVL_BASED_IV:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_EVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; CHECK-NEXT:    [[AVL:%.*]] = sub i64 [[TMP0]], [[EVL_BASED_IV]]
+; CHECK-NEXT:    [[TMP11:%.*]] = call i32 @llvm.experimental.get.vector.length.i64(i64 [[AVL]], i32 8, i1 true)
+; CHECK-NEXT:    [[TMP12:%.*]] = add i64 [[EVL_BASED_IV]], 0
+; CHECK-NEXT:    [[TMP13:%.*]] = getelementptr i8, ptr [[SRC]], i64 [[TMP12]]
+; CHECK-NEXT:    [[TMP14:%.*]] = getelementptr i8, ptr [[TMP13]], i32 0
+; CHECK-NEXT:    [[VP_OP_LOAD:%.*]] = call <vscale x 8 x i8> @llvm.vp.load.nxv8i8.p0(ptr align 1 [[TMP14]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]]), !alias.scope [[META0:![0-9]+]]
+; CHECK-NEXT:    [[TMP15:%.*]] = call <vscale x 8 x i32> @llvm.vp.zext.nxv8i32.nxv8i8(<vscale x 8 x i8> [[VP_OP_LOAD]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[VP_OP:%.*]] = call <vscale x 8 x i32> @llvm.vp.mul.nxv8i32(<vscale x 8 x i32> [[TMP15]], <vscale x 8 x i32> zeroinitializer, <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[VP_OP2:%.*]] = call <vscale x 8 x i32> @llvm.vp.ashr.nxv8i32(<vscale x 8 x i32> [[TMP15]], <vscale x 8 x i32> zeroinitializer, <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[VP_OP3:%.*]] = call <vscale x 8 x i32> @llvm.vp.or.nxv8i32(<vscale x 8 x i32> [[VP_OP2]], <vscale x 8 x i32> zeroinitializer, <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[TMP16:%.*]] = icmp ult <vscale x 8 x i32> [[TMP15]], zeroinitializer
+; CHECK-NEXT:    [[TMP17:%.*]] = call <vscale x 8 x i32> @llvm.vp.select.nxv8i32(<vscale x 8 x i1> [[TMP16]], <vscale x 8 x i32> [[VP_OP3]], <vscale x 8 x i32> zeroinitializer, i32 [[TMP11]])
+; CHECK-NEXT:    [[TMP18:%.*]] = call <vscale x 8 x i8> @llvm.vp.trunc.nxv8i8.nxv8i32(<vscale x 8 x i32> [[TMP17]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    call void @llvm.vp.scatter.nxv8i8.nxv8p0(<vscale x 8 x i8> [[TMP18]], <vscale x 8 x ptr> align 1 [[BROADCAST_SPLAT]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]]), !alias.scope [[META3:![0-9]+]], !noalias [[META0]]
+; CHECK-NEXT:    [[TMP19:%.*]] = call <vscale x 8 x i16> @llvm.vp.trunc.nxv8i16.nxv8i32(<vscale x 8 x i32> [[VP_OP]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    call void @llvm.vp.scatter.nxv8i16.nxv8p0(<vscale x 8 x i16> [[TMP19]], <vscale x 8 x ptr> align 2 zeroinitializer, <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[TMP20:%.*]] = zext i32 [[TMP11]] to i64
+; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP20]], [[EVL_BASED_IV]]
+; CHECK-NEXT:    [[INDEX_NEXT]] = add i64 [[INDEX]], [[TMP10]]
+; CHECK-NEXT:    [[TMP21:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]
+; CHECK-NEXT:    br i1 [[TMP21]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !llvm.loop [[LOOP5:![0-9]+]]
+; CHECK:       [[MIDDLE_BLOCK]]:
+; CHECK-NEXT:    br i1 true, label %[[EXIT:.*]], label %[[SCALAR_PH]]
+; CHECK:       [[SCALAR_PH]]:
+; CHECK-NEXT:    [[BC_RESUME_VAL:%.*]] = phi i64 [ [[N_VEC]], %[[MIDDLE_BLOCK]] ], [ 0, %[[ENTRY]] ], [ 0, %[[VECTOR_MEMCHECK]] ]
+; CHECK-NEXT:    br label %[[LOOP:.*]]
+; CHECK:       [[LOOP]]:
+; CHECK-NEXT:    [[IV:%.*]] = phi i64 [ [[BC_RESUME_VAL]], %[[SCALAR_PH]] ], [ [[IV_NEXT:%.*]], %[[LOOP]] ]
+; CHECK-NEXT:    [[ARRAYIDX13:%.*]] = getelementptr i8, ptr [[SRC]], i64 [[IV]]
+; CHECK-NEXT:    [[TMP22:%.*]] = load i8, ptr [[ARRAYIDX13]], align 1
+; CHECK-NEXT:    [[CONV14:%.*]] = zext i8 [[TMP22]] to i32
+; CHECK-NEXT:    [[MUL21_NEG:%.*]] = mul i32 [[CONV14]], 0
+; CHECK-NEXT:    [[ADD33:%.*]] = ashr i32 [[CONV14]], 0
+; CHECK-NEXT:    [[SHR:%.*]] = or i32 [[ADD33]], 0
+; CHECK-NEXT:    [[TOBOOL_NOT_I:%.*]] = icmp ult i32 [[CONV14]], 0
+; CHECK-NEXT:    [[COND_I:%.*]] = select i1 [[TOBOOL_NOT_I]], i32 [[SHR]], i32 0
+; CHECK-NEXT:    [[CONV_I:%.*]] = trunc i32 [[COND_I]] to i8
+; CHECK-NEXT:    store i8 [[CONV_I]], ptr [[DSTV]], align 1
+; CHECK-NEXT:    [[CONV36:%.*]] = trunc i32 [[MUL21_NEG]] to i16
+; CHECK-NEXT:    store i16 [[CONV36]], ptr null, align 2
+; CHECK-NEXT:    [[IV_NEXT]] = add i64 [[IV]], 1
+; CHECK-NEXT:    [[EC:%.*]] = icmp eq i64 [[IV]], [[WIDE_TRIP_COUNT]]
+; CHECK-NEXT:    br i1 [[EC]], label %[[EXIT]], label %[[LOOP]], !llvm.loop [[LOOP8:![0-9]+]]
+; CHECK:       [[EXIT]]:
+; CHECK-NEXT:    ret void
+;
+entry:
+  br label %loop
+
+loop:
+  %iv = phi i64 [ 0, %entry ], [ %iv.next, %loop ]
+  %arrayidx13 = getelementptr i8, ptr %src, i64 %iv
+  %0 = load i8, ptr %arrayidx13, align 1
+  %conv14 = zext i8 %0 to i32
+  %mul21.neg = mul i32 %conv14, 0
+  %add33 = ashr i32 %conv14, 0
+  %shr = or i32 %add33, 0
+  %tobool.not.i = icmp ult i32 %conv14, 0
+  %cond.i = select i1 %tobool.not.i, i32 %shr, i32 0
+  %conv.i = trunc i32 %cond.i to i8
+  store i8 %conv.i, ptr %dstv, align 1
+  %conv36 = trunc i32 %mul21.neg to i16
+  store i16 %conv36, ptr null, align 2
+  %iv.next = add i64 %iv, 1
+  %ec = icmp eq i64 %iv, %wide.trip.count
+  br i1 %ec, label %exit, label %loop
+
+exit:
+  ret void
+}
+;.
+; CHECK: [[META0]] = !{[[META1:![0-9]+]]}
+; CHECK: [[META1]] = distinct !{[[META1]], [[META2:![0-9]+]]}
+; CHECK: [[META2]] = distinct !{[[META2]], !"LVerDomain"}
+; CHECK: [[META3]] = !{[[META4:![0-9]+]]}
+; CHECK: [[META4]] = distinct !{[[META4]], [[META2]]}
+; CHECK: [[LOOP5]] = distinct !{[[LOOP5]], [[META6:![0-9]+]], [[META7:![0-9]+]]}
+; CHECK: [[META6]] = !{!"llvm.loop.isvectorized", i32 1}
+; CHECK: [[META7]] = !{!"llvm.loop.unroll.runtime.disable"}
+; CHECK: [[LOOP8]] = distinct !{[[LOOP8]], [[META6]]}
+;.

llvmbot · 2024-12-17T15:55:00Z

@llvm/pr-subscribers-vectorizers

Author: Luke Lau (lukel97)

Changes

When building SPEC CPU 2017 with RISC-V and EVL tail folding, this assertion in VPTypeAnalysis would trigger during the transformation to EVL recipes:

llvm-project/llvm/lib/Transforms/Vectorize/VPlanAnalysis.cpp

Lines 135 to 142 in d8a0709

    
           Type *VPTypeAnalysis::inferScalarTypeForRecipe(const VPWidenSelectRecipe *R) { 
        
             Type *ResTy = inferScalarType(R->getOperand(1)); 
        
             VPValue *OtherV = R->getOperand(2); 
        
             assert(inferScalarType(OtherV) == ResTy && 
        
                    "different types inferred for different operands"); 
        
             CachedTypes[OtherV] = ResTy; 
        
             return ResTy; 
        
           }

It was caused by this recipe:

WIDEN ir&lt;%shr&gt; = vp.or ir&lt;%add33&gt;, ir&lt;0&gt;, vp&lt;%6&gt;

Having its type inferred as i16, when ir<%add33> and ir<0> had inferred types of i32 somehow.

The cause of this turned out to be because the VPTypeAnalysis cache was getting clobbered: In this transform we were erasing recipes but keeping around the same mapping from VPValue* to Type*. In the meantime, new recipes would be created which would have the same address as the old value. They would then incorrectly get the old erased VPValue*'s cached type:

  --- before ---
  0x600001ec5030: WIDEN ir&lt;%mul21.neg&gt; = vp.mul vp&lt;%11&gt;, ir&lt;0&gt;, vp&lt;%6&gt;
  0x600001ec5450: &lt;badref&gt; &lt;- some value that was erased
  --- after ---
  0x600001ec5030: WIDEN ir&lt;%mul21.neg&gt; = vp.mul vp&lt;%11&gt;, ir&lt;0&gt;, vp&lt;%6&gt;
  0x600001ec5450: WIDEN ir&lt;%shr&gt; = vp.or ir&lt;%add33&gt;, ir&lt;0&gt;, vp&lt;%6&gt;  &lt;- a new value that happens to have the same address

This fixes this by not sharing the VPTypeAnalysis across iterations of the loop, i.e. clearing the cache after anything is erased.

The test case might be a bit flakey since it just happens to have the right conditions to recreate this. I tried to add an assert in inferScalarType that every VPValue in the cache was valid, but couldn't find a way of telling if a VPValue had been erased.

Another alternative is that we could maybe add a method to remove specific VPValues from the cache, but it looks like recursivelyDeleteDeadRecipes might also erase other recipes too.

Full diff: https://github.com/llvm/llvm-project/pull/120252.diff

2 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp (+4-1)
(added) llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll (+121)

diff --git a/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp b/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
index 9a3b82fe57c12a..bfff2504c393b4 100644
--- a/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
+++ b/llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp
@@ -1446,7 +1446,6 @@ void VPlanTransforms::addActiveLaneMask(
 static void transformRecipestoEVLRecipes(VPlan &Plan, VPValue &EVL) {
   using namespace llvm::VPlanPatternMatch;
   Type *CanonicalIVType = Plan.getCanonicalIV()->getScalarType();
-  VPTypeAnalysis TypeInfo(CanonicalIVType);
   LLVMContext &Ctx = CanonicalIVType->getContext();
   SmallVector<VPValue *> HeaderMasks = collectAllHeaderMasks(Plan);
 
@@ -1463,6 +1462,10 @@ static void transformRecipestoEVLRecipes(VPlan &Plan, VPValue &EVL) {
         return HeaderMask == OrigMask ? nullptr : OrigMask;
       };
 
+      // Don't preseve the type info cache across loop iterations since
+      // CurRecipe will be erased and invalidate it.
+      VPTypeAnalysis TypeInfo(CanonicalIVType);
+
       VPRecipeBase *NewRecipe =
           TypeSwitch<VPRecipeBase *, VPRecipeBase *>(CurRecipe)
               .Case<VPWidenLoadRecipe>([&](VPWidenLoadRecipe *L) {
diff --git a/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll b/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
new file mode 100644
index 00000000000000..4ab129d8b3ac39
--- /dev/null
+++ b/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
@@ -0,0 +1,121 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 5
+; RUN: opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S %s | FileCheck %s
+
+; This test tries to recreate the conditions for a crash that occurred when the
+; VPTypeAnalysis cache wasn't cleared after a recipe was erased and clobbered
+; with a new one.
+
+define void @type_info_cache_clobber(ptr %dstv, ptr %src, i64 %wide.trip.count) {
+; CHECK-LABEL: define void @type_info_invalid_cache(
+; CHECK-SAME: ptr [[DSTV:%.*]], ptr [[SRC:%.*]], i64 [[WIDE_TRIP_COUNT:%.*]]) #[[ATTR0:[0-9]+]] {
+; CHECK-NEXT:  [[ENTRY:.*]]:
+; CHECK-NEXT:    [[TMP0:%.*]] = add i64 [[WIDE_TRIP_COUNT]], 1
+; CHECK-NEXT:    [[TMP1:%.*]] = sub i64 -1, [[TMP0]]
+; CHECK-NEXT:    [[TMP2:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP3:%.*]] = mul i64 [[TMP2]], 8
+; CHECK-NEXT:    [[TMP4:%.*]] = icmp ult i64 [[TMP1]], [[TMP3]]
+; CHECK-NEXT:    br i1 [[TMP4]], label %[[SCALAR_PH:.*]], label %[[VECTOR_MEMCHECK:.*]]
+; CHECK:       [[VECTOR_MEMCHECK]]:
+; CHECK-NEXT:    [[SCEVGEP:%.*]] = getelementptr i8, ptr [[DSTV]], i64 1
+; CHECK-NEXT:    [[TMP5:%.*]] = add i64 [[WIDE_TRIP_COUNT]], 1
+; CHECK-NEXT:    [[SCEVGEP1:%.*]] = getelementptr i8, ptr [[SRC]], i64 [[TMP5]]
+; CHECK-NEXT:    [[BOUND0:%.*]] = icmp ult ptr [[DSTV]], [[SCEVGEP1]]
+; CHECK-NEXT:    [[BOUND1:%.*]] = icmp ult ptr [[SRC]], [[SCEVGEP]]
+; CHECK-NEXT:    [[FOUND_CONFLICT:%.*]] = and i1 [[BOUND0]], [[BOUND1]]
+; CHECK-NEXT:    br i1 [[FOUND_CONFLICT]], label %[[SCALAR_PH]], label %[[VECTOR_PH:.*]]
+; CHECK:       [[VECTOR_PH]]:
+; CHECK-NEXT:    [[TMP6:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP7:%.*]] = mul i64 [[TMP6]], 8
+; CHECK-NEXT:    [[TMP8:%.*]] = sub i64 [[TMP7]], 1
+; CHECK-NEXT:    [[N_RND_UP:%.*]] = add i64 [[TMP0]], [[TMP8]]
+; CHECK-NEXT:    [[N_MOD_VF:%.*]] = urem i64 [[N_RND_UP]], [[TMP7]]
+; CHECK-NEXT:    [[N_VEC:%.*]] = sub i64 [[N_RND_UP]], [[N_MOD_VF]]
+; CHECK-NEXT:    [[TMP9:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP10:%.*]] = mul i64 [[TMP9]], 8
+; CHECK-NEXT:    [[BROADCAST_SPLATINSERT:%.*]] = insertelement <vscale x 8 x ptr> poison, ptr [[DSTV]], i64 0
+; CHECK-NEXT:    [[BROADCAST_SPLAT:%.*]] = shufflevector <vscale x 8 x ptr> [[BROADCAST_SPLATINSERT]], <vscale x 8 x ptr> poison, <vscale x 8 x i32> zeroinitializer
+; CHECK-NEXT:    br label %[[VECTOR_BODY:.*]]
+; CHECK:       [[VECTOR_BODY]]:
+; CHECK-NEXT:    [[INDEX:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; CHECK-NEXT:    [[EVL_BASED_IV:%.*]] = phi i64 [ 0, %[[VECTOR_PH]] ], [ [[INDEX_EVL_NEXT:%.*]], %[[VECTOR_BODY]] ]
+; CHECK-NEXT:    [[AVL:%.*]] = sub i64 [[TMP0]], [[EVL_BASED_IV]]
+; CHECK-NEXT:    [[TMP11:%.*]] = call i32 @llvm.experimental.get.vector.length.i64(i64 [[AVL]], i32 8, i1 true)
+; CHECK-NEXT:    [[TMP12:%.*]] = add i64 [[EVL_BASED_IV]], 0
+; CHECK-NEXT:    [[TMP13:%.*]] = getelementptr i8, ptr [[SRC]], i64 [[TMP12]]
+; CHECK-NEXT:    [[TMP14:%.*]] = getelementptr i8, ptr [[TMP13]], i32 0
+; CHECK-NEXT:    [[VP_OP_LOAD:%.*]] = call <vscale x 8 x i8> @llvm.vp.load.nxv8i8.p0(ptr align 1 [[TMP14]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]]), !alias.scope [[META0:![0-9]+]]
+; CHECK-NEXT:    [[TMP15:%.*]] = call <vscale x 8 x i32> @llvm.vp.zext.nxv8i32.nxv8i8(<vscale x 8 x i8> [[VP_OP_LOAD]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[VP_OP:%.*]] = call <vscale x 8 x i32> @llvm.vp.mul.nxv8i32(<vscale x 8 x i32> [[TMP15]], <vscale x 8 x i32> zeroinitializer, <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[VP_OP2:%.*]] = call <vscale x 8 x i32> @llvm.vp.ashr.nxv8i32(<vscale x 8 x i32> [[TMP15]], <vscale x 8 x i32> zeroinitializer, <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[VP_OP3:%.*]] = call <vscale x 8 x i32> @llvm.vp.or.nxv8i32(<vscale x 8 x i32> [[VP_OP2]], <vscale x 8 x i32> zeroinitializer, <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[TMP16:%.*]] = icmp ult <vscale x 8 x i32> [[TMP15]], zeroinitializer
+; CHECK-NEXT:    [[TMP17:%.*]] = call <vscale x 8 x i32> @llvm.vp.select.nxv8i32(<vscale x 8 x i1> [[TMP16]], <vscale x 8 x i32> [[VP_OP3]], <vscale x 8 x i32> zeroinitializer, i32 [[TMP11]])
+; CHECK-NEXT:    [[TMP18:%.*]] = call <vscale x 8 x i8> @llvm.vp.trunc.nxv8i8.nxv8i32(<vscale x 8 x i32> [[TMP17]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    call void @llvm.vp.scatter.nxv8i8.nxv8p0(<vscale x 8 x i8> [[TMP18]], <vscale x 8 x ptr> align 1 [[BROADCAST_SPLAT]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]]), !alias.scope [[META3:![0-9]+]], !noalias [[META0]]
+; CHECK-NEXT:    [[TMP19:%.*]] = call <vscale x 8 x i16> @llvm.vp.trunc.nxv8i16.nxv8i32(<vscale x 8 x i32> [[VP_OP]], <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    call void @llvm.vp.scatter.nxv8i16.nxv8p0(<vscale x 8 x i16> [[TMP19]], <vscale x 8 x ptr> align 2 zeroinitializer, <vscale x 8 x i1> splat (i1 true), i32 [[TMP11]])
+; CHECK-NEXT:    [[TMP20:%.*]] = zext i32 [[TMP11]] to i64
+; CHECK-NEXT:    [[INDEX_EVL_NEXT]] = add i64 [[TMP20]], [[EVL_BASED_IV]]
+; CHECK-NEXT:    [[INDEX_NEXT]] = add i64 [[INDEX]], [[TMP10]]
+; CHECK-NEXT:    [[TMP21:%.*]] = icmp eq i64 [[INDEX_NEXT]], [[N_VEC]]
+; CHECK-NEXT:    br i1 [[TMP21]], label %[[MIDDLE_BLOCK:.*]], label %[[VECTOR_BODY]], !llvm.loop [[LOOP5:![0-9]+]]
+; CHECK:       [[MIDDLE_BLOCK]]:
+; CHECK-NEXT:    br i1 true, label %[[EXIT:.*]], label %[[SCALAR_PH]]
+; CHECK:       [[SCALAR_PH]]:
+; CHECK-NEXT:    [[BC_RESUME_VAL:%.*]] = phi i64 [ [[N_VEC]], %[[MIDDLE_BLOCK]] ], [ 0, %[[ENTRY]] ], [ 0, %[[VECTOR_MEMCHECK]] ]
+; CHECK-NEXT:    br label %[[LOOP:.*]]
+; CHECK:       [[LOOP]]:
+; CHECK-NEXT:    [[IV:%.*]] = phi i64 [ [[BC_RESUME_VAL]], %[[SCALAR_PH]] ], [ [[IV_NEXT:%.*]], %[[LOOP]] ]
+; CHECK-NEXT:    [[ARRAYIDX13:%.*]] = getelementptr i8, ptr [[SRC]], i64 [[IV]]
+; CHECK-NEXT:    [[TMP22:%.*]] = load i8, ptr [[ARRAYIDX13]], align 1
+; CHECK-NEXT:    [[CONV14:%.*]] = zext i8 [[TMP22]] to i32
+; CHECK-NEXT:    [[MUL21_NEG:%.*]] = mul i32 [[CONV14]], 0
+; CHECK-NEXT:    [[ADD33:%.*]] = ashr i32 [[CONV14]], 0
+; CHECK-NEXT:    [[SHR:%.*]] = or i32 [[ADD33]], 0
+; CHECK-NEXT:    [[TOBOOL_NOT_I:%.*]] = icmp ult i32 [[CONV14]], 0
+; CHECK-NEXT:    [[COND_I:%.*]] = select i1 [[TOBOOL_NOT_I]], i32 [[SHR]], i32 0
+; CHECK-NEXT:    [[CONV_I:%.*]] = trunc i32 [[COND_I]] to i8
+; CHECK-NEXT:    store i8 [[CONV_I]], ptr [[DSTV]], align 1
+; CHECK-NEXT:    [[CONV36:%.*]] = trunc i32 [[MUL21_NEG]] to i16
+; CHECK-NEXT:    store i16 [[CONV36]], ptr null, align 2
+; CHECK-NEXT:    [[IV_NEXT]] = add i64 [[IV]], 1
+; CHECK-NEXT:    [[EC:%.*]] = icmp eq i64 [[IV]], [[WIDE_TRIP_COUNT]]
+; CHECK-NEXT:    br i1 [[EC]], label %[[EXIT]], label %[[LOOP]], !llvm.loop [[LOOP8:![0-9]+]]
+; CHECK:       [[EXIT]]:
+; CHECK-NEXT:    ret void
+;
+entry:
+  br label %loop
+
+loop:
+  %iv = phi i64 [ 0, %entry ], [ %iv.next, %loop ]
+  %arrayidx13 = getelementptr i8, ptr %src, i64 %iv
+  %0 = load i8, ptr %arrayidx13, align 1
+  %conv14 = zext i8 %0 to i32
+  %mul21.neg = mul i32 %conv14, 0
+  %add33 = ashr i32 %conv14, 0
+  %shr = or i32 %add33, 0
+  %tobool.not.i = icmp ult i32 %conv14, 0
+  %cond.i = select i1 %tobool.not.i, i32 %shr, i32 0
+  %conv.i = trunc i32 %cond.i to i8
+  store i8 %conv.i, ptr %dstv, align 1
+  %conv36 = trunc i32 %mul21.neg to i16
+  store i16 %conv36, ptr null, align 2
+  %iv.next = add i64 %iv, 1
+  %ec = icmp eq i64 %iv, %wide.trip.count
+  br i1 %ec, label %exit, label %loop
+
+exit:
+  ret void
+}
+;.
+; CHECK: [[META0]] = !{[[META1:![0-9]+]]}
+; CHECK: [[META1]] = distinct !{[[META1]], [[META2:![0-9]+]]}
+; CHECK: [[META2]] = distinct !{[[META2]], !"LVerDomain"}
+; CHECK: [[META3]] = !{[[META4:![0-9]+]]}
+; CHECK: [[META4]] = distinct !{[[META4]], [[META2]]}
+; CHECK: [[LOOP5]] = distinct !{[[LOOP5]], [[META6:![0-9]+]], [[META7:![0-9]+]]}
+; CHECK: [[META6]] = !{!"llvm.loop.isvectorized", i32 1}
+; CHECK: [[META7]] = !{!"llvm.loop.unroll.runtime.disable"}
+; CHECK: [[LOOP8]] = distinct !{[[LOOP8]], [[META6]]}
+;.

alexey-bataev · 2024-12-17T16:48:57Z

How about alternative approach - actual deletion of the dead recipes after the transformation?

fhahn · 2024-12-17T16:53:18Z

How about alternative approach - actual deletion of the dead recipes after the transformation?

Yes that would probably be good.

lukel97 · 2024-12-17T17:00:02Z

How about alternative approach - actual deletion of the dead recipes after the transformation?

Yeah I had considered that but I thought recursivelyDeleteDeadRecipes might have made it a bit tricky since it erases recipes in flight. But I don't see why we couldn't also just run that once at the end instead of after each header mask. I'll give that a try

fhahn · 2024-12-17T17:01:36Z

How about alternative approach - actual deletion of the dead recipes after the transformation?

Yeah I had considered that but I thought recursivelyDeleteDeadRecipes might have made it a bit tricky since it erases recipes in flight. But I don't see why we couldn't also just run that once at the end instead of after each header mask. I'll give that a try

You probably need to collect the recipes to remove, as some of the replaced recipes won't be considered dead (e.g. store recipes)

llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp

fhahn · 2024-12-17T17:11:36Z

llvm/lib/Transforms/Vectorize/VPlanTransforms.cpp

    }
-    recursivelyDeleteDeadRecipes(HeaderMask);


Can you add HeaderMask to be erased here?

Is it possible that HeaderMask might still be used by a recipe if it can't be converted to an EVL recipe?

In that case, maybe just recursively delete the recipes to erase, which then will automatically clean up the header mask if it becomes dead?

All recipes you are queuing should be VPSingelDefRecipe*

for (VPRecipeBase *R : ToErase) recursivelyDeleteDeadRecipes(R);

Some of the recipes are stores with zero values, but I was able to copy the trick from optimizeForVFAndUF where you erase the recipe and then call recursivelyDeleteDeadRecipes on its operands

Co-authored-by: Florian Hahn <[email protected]>

fhahn

LGTM, thanks!

llvm-ci · 2024-12-18T03:40:28Z

LLVM Buildbot has detected a new failure on builder ml-opt-dev-x86-64 running on ml-opt-dev-x86-64-b1 while building llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/137/builds/10510

Here is the relevant piece of the build log for the reference

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'LLVM :: Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /b/ml-opt-dev-x86-64-b1/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /b/ml-opt-dev-x86-64-b1/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll | /b/ml-opt-dev-x86-64-b1/build/bin/FileCheck /b/ml-opt-dev-x86-64-b1/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /b/ml-opt-dev-x86-64-b1/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /b/ml-opt-dev-x86-64-b1/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /b/ml-opt-dev-x86-64-b1/build/bin/FileCheck /b/ml-opt-dev-x86-64-b1/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
/b/ml-opt-dev-x86-64-b1/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll:65:15: error: CHECK-NEXT: expected string not found in input
; CHECK-NEXT: [[BC_RESUME_VAL:%.*]] = phi i64 [ [[N_VEC]], %[[MIDDLE_BLOCK]] ], [ 0, %[[ENTRY]] ], [ 0, %[[VECTOR_MEMCHECK]] ]
              ^
<stdin>:65:11: note: scanning from here
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "N_VEC" equal to "%n\\.vec"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "MIDDLE_BLOCK" equal to "middle\\.block"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "ENTRY" equal to "entry"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "VECTOR_MEMCHECK" equal to "vector\\.memcheck"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:66:11: note: possible intended match here
 %bc.resume.val = phi i64 [ %n.vec, %middle.block ], [ 0, %vector.memcheck ], [ 0, %entry ]
          ^

Input file: <stdin>
Check file: /b/ml-opt-dev-x86-64-b1/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll

-dump-input=help explains the following input dump.

Input was:
<<<<<<
           .
           .
           .
          60:  br i1 %21, label %middle.block, label %vector.body, !llvm.loop !5 
          61:  
          62: middle.block: ; preds = %vector.body 
          63:  br i1 true, label %exit, label %scalar.ph 
          64:  
          65: scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block 
next:65'0               X~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match found
next:65'1                                                                   with "N_VEC" equal to "%n\\.vec"
next:65'2                                                                   with "MIDDLE_BLOCK" equal to "middle\\.block"
next:65'3                                                                   with "ENTRY" equal to "entry"
...

llvm-ci · 2024-12-18T04:26:41Z

LLVM Buildbot has detected a new failure on builder llvm-clang-x86_64-expensive-checks-debian running on gribozavr4 while building llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/16/builds/10886

Here is the relevant piece of the build log for the reference

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'LLVM :: Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll | /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bin/FileCheck /b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /b/1/llvm-clang-x86_64-expensive-checks-debian/build/bin/FileCheck /b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
/b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll:65:15: error: CHECK-NEXT: expected string not found in input
; CHECK-NEXT: [[BC_RESUME_VAL:%.*]] = phi i64 [ [[N_VEC]], %[[MIDDLE_BLOCK]] ], [ 0, %[[ENTRY]] ], [ 0, %[[VECTOR_MEMCHECK]] ]
              ^
<stdin>:65:11: note: scanning from here
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "N_VEC" equal to "%n\\.vec"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "MIDDLE_BLOCK" equal to "middle\\.block"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "ENTRY" equal to "entry"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "VECTOR_MEMCHECK" equal to "vector\\.memcheck"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:66:11: note: possible intended match here
 %bc.resume.val = phi i64 [ %n.vec, %middle.block ], [ 0, %vector.memcheck ], [ 0, %entry ]
          ^

Input file: <stdin>
Check file: /b/1/llvm-clang-x86_64-expensive-checks-debian/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll

-dump-input=help explains the following input dump.

Input was:
<<<<<<
           .
           .
           .
          60:  br i1 %21, label %middle.block, label %vector.body, !llvm.loop !5 
          61:  
          62: middle.block: ; preds = %vector.body 
          63:  br i1 true, label %exit, label %scalar.ph 
          64:  
          65: scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block 
next:65'0               X~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match found
next:65'1                                                                   with "N_VEC" equal to "%n\\.vec"
next:65'2                                                                   with "MIDDLE_BLOCK" equal to "middle\\.block"
next:65'3                                                                   with "ENTRY" equal to "entry"
...

llvm-ci · 2024-12-18T04:33:26Z

LLVM Buildbot has detected a new failure on builder clang-x86_64-debian-fast running on gribozavr4 while building llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/56/builds/14764

Here is the relevant piece of the build log for the reference

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'LLVM :: Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /b/1/clang-x86_64-debian-fast/llvm.obj/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /b/1/clang-x86_64-debian-fast/llvm.src/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll | /b/1/clang-x86_64-debian-fast/llvm.obj/bin/FileCheck /b/1/clang-x86_64-debian-fast/llvm.src/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /b/1/clang-x86_64-debian-fast/llvm.obj/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /b/1/clang-x86_64-debian-fast/llvm.src/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /b/1/clang-x86_64-debian-fast/llvm.obj/bin/FileCheck /b/1/clang-x86_64-debian-fast/llvm.src/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
/b/1/clang-x86_64-debian-fast/llvm.src/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll:65:15: error: CHECK-NEXT: expected string not found in input
; CHECK-NEXT: [[BC_RESUME_VAL:%.*]] = phi i64 [ [[N_VEC]], %[[MIDDLE_BLOCK]] ], [ 0, %[[ENTRY]] ], [ 0, %[[VECTOR_MEMCHECK]] ]
              ^
<stdin>:65:11: note: scanning from here
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "N_VEC" equal to "%n\\.vec"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "MIDDLE_BLOCK" equal to "middle\\.block"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "ENTRY" equal to "entry"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "VECTOR_MEMCHECK" equal to "vector\\.memcheck"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:66:11: note: possible intended match here
 %bc.resume.val = phi i64 [ %n.vec, %middle.block ], [ 0, %vector.memcheck ], [ 0, %entry ]
          ^

Input file: <stdin>
Check file: /b/1/clang-x86_64-debian-fast/llvm.src/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll

-dump-input=help explains the following input dump.

Input was:
<<<<<<
           .
           .
           .
          60:  br i1 %21, label %middle.block, label %vector.body, !llvm.loop !5 
          61:  
          62: middle.block: ; preds = %vector.body 
          63:  br i1 true, label %exit, label %scalar.ph 
          64:  
          65: scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block 
next:65'0               X~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match found
next:65'1                                                                   with "N_VEC" equal to "%n\\.vec"
next:65'2                                                                   with "MIDDLE_BLOCK" equal to "middle\\.block"
next:65'3                                                                   with "ENTRY" equal to "entry"
...

llvm-ci · 2024-12-18T04:35:46Z

LLVM Buildbot has detected a new failure on builder llvm-x86_64-debian-dylib running on gribozavr4 while building llvm at step 7 "test-build-unified-tree-check-llvm".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/60/builds/15472

Here is the relevant piece of the build log for the reference

Step 7 (test-build-unified-tree-check-llvm) failure: test (failure)
******************** TEST 'LLVM :: Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /b/1/llvm-x86_64-debian-dylib/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /b/1/llvm-x86_64-debian-dylib/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll | /b/1/llvm-x86_64-debian-dylib/build/bin/FileCheck /b/1/llvm-x86_64-debian-dylib/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /b/1/llvm-x86_64-debian-dylib/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /b/1/llvm-x86_64-debian-dylib/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /b/1/llvm-x86_64-debian-dylib/build/bin/FileCheck /b/1/llvm-x86_64-debian-dylib/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
/b/1/llvm-x86_64-debian-dylib/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll:65:15: error: CHECK-NEXT: expected string not found in input
; CHECK-NEXT: [[BC_RESUME_VAL:%.*]] = phi i64 [ [[N_VEC]], %[[MIDDLE_BLOCK]] ], [ 0, %[[ENTRY]] ], [ 0, %[[VECTOR_MEMCHECK]] ]
              ^
<stdin>:65:11: note: scanning from here
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "N_VEC" equal to "%n\\.vec"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "MIDDLE_BLOCK" equal to "middle\\.block"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "ENTRY" equal to "entry"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "VECTOR_MEMCHECK" equal to "vector\\.memcheck"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:66:11: note: possible intended match here
 %bc.resume.val = phi i64 [ %n.vec, %middle.block ], [ 0, %vector.memcheck ], [ 0, %entry ]
          ^

Input file: <stdin>
Check file: /b/1/llvm-x86_64-debian-dylib/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll

-dump-input=help explains the following input dump.

Input was:
<<<<<<
           .
           .
           .
          60:  br i1 %21, label %middle.block, label %vector.body, !llvm.loop !5 
          61:  
          62: middle.block: ; preds = %vector.body 
          63:  br i1 true, label %exit, label %scalar.ph 
          64:  
          65: scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block 
next:65'0               X~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match found
next:65'1                                                                   with "N_VEC" equal to "%n\\.vec"
next:65'2                                                                   with "MIDDLE_BLOCK" equal to "middle\\.block"
next:65'3                                                                   with "ENTRY" equal to "entry"
...

llvm-ci · 2024-12-18T04:37:01Z

LLVM Buildbot has detected a new failure on builder lld-x86_64-ubuntu-fast running on as-builder-4 while building llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/33/builds/8474

Here is the relevant piece of the build log for the reference

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'LLVM :: Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll | /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/build/bin/FileCheck /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/build/bin/FileCheck /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
/home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll:65:15: error: CHECK-NEXT: expected string not found in input
; CHECK-NEXT: [[BC_RESUME_VAL:%.*]] = phi i64 [ [[N_VEC]], %[[MIDDLE_BLOCK]] ], [ 0, %[[ENTRY]] ], [ 0, %[[VECTOR_MEMCHECK]] ]
              ^
<stdin>:65:11: note: scanning from here
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "N_VEC" equal to "%n\\.vec"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "MIDDLE_BLOCK" equal to "middle\\.block"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "ENTRY" equal to "entry"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "VECTOR_MEMCHECK" equal to "vector\\.memcheck"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:66:11: note: possible intended match here
 %bc.resume.val = phi i64 [ %n.vec, %middle.block ], [ 0, %vector.memcheck ], [ 0, %entry ]
          ^

Input file: <stdin>
Check file: /home/buildbot/worker/as-builder-4/ramdisk/lld-x86_64/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll

-dump-input=help explains the following input dump.

Input was:
<<<<<<
           .
           .
           .
          60:  br i1 %21, label %middle.block, label %vector.body, !llvm.loop !5 
          61:  
          62: middle.block: ; preds = %vector.body 
          63:  br i1 true, label %exit, label %scalar.ph 
          64:  
          65: scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block 
next:65'0               X~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match found
next:65'1                                                                   with "N_VEC" equal to "%n\\.vec"
next:65'2                                                                   with "MIDDLE_BLOCK" equal to "middle\\.block"
next:65'3                                                                   with "ENTRY" equal to "entry"
...

lukel97 · 2024-12-18T04:47:22Z

Buildbot failures should hopefully be fixed in b1f4a02

llvm-ci · 2024-12-18T05:59:54Z

LLVM Buildbot has detected a new failure on builder premerge-monolithic-linux running on premerge-linux-1 while building llvm at step 7 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/153/builds/17891

Here is the relevant piece of the build log for the reference

Step 7 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'LLVM :: Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 2: /build/buildbot/premerge-monolithic-linux/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /build/buildbot/premerge-monolithic-linux/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll | /build/buildbot/premerge-monolithic-linux/build/bin/FileCheck /build/buildbot/premerge-monolithic-linux/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /build/buildbot/premerge-monolithic-linux/build/bin/opt -passes=loop-vectorize -force-tail-folding-style=data-with-evl -prefer-predicate-over-epilogue=predicate-dont-vectorize -mtriple=riscv64 -mattr=+v -S /build/buildbot/premerge-monolithic-linux/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
+ /build/buildbot/premerge-monolithic-linux/build/bin/FileCheck /build/buildbot/premerge-monolithic-linux/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll
/build/buildbot/premerge-monolithic-linux/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll:65:15: error: CHECK-NEXT: expected string not found in input
; CHECK-NEXT: [[BC_RESUME_VAL:%.*]] = phi i64 [ [[N_VEC]], %[[MIDDLE_BLOCK]] ], [ 0, %[[ENTRY]] ], [ 0, %[[VECTOR_MEMCHECK]] ]
              ^
<stdin>:65:11: note: scanning from here
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "N_VEC" equal to "%n\\.vec"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "MIDDLE_BLOCK" equal to "middle\\.block"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "ENTRY" equal to "entry"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:65:11: note: with "VECTOR_MEMCHECK" equal to "vector\\.memcheck"
scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block
          ^
<stdin>:66:11: note: possible intended match here
 %bc.resume.val = phi i64 [ %n.vec, %middle.block ], [ 0, %vector.memcheck ], [ 0, %entry ]
          ^

Input file: <stdin>
Check file: /build/buildbot/premerge-monolithic-linux/llvm-project/llvm/test/Transforms/LoopVectorize/RISCV/type-info-cache-evl-crash.ll

-dump-input=help explains the following input dump.

Input was:
<<<<<<
           .
           .
           .
          60:  br i1 %21, label %middle.block, label %vector.body, !llvm.loop !5 
          61:  
          62: middle.block: ; preds = %vector.body 
          63:  br i1 true, label %exit, label %scalar.ph 
          64:  
          65: scalar.ph: ; preds = %vector.memcheck, %entry, %middle.block 
next:65'0               X~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match found
next:65'1                                                                   with "N_VEC" equal to "%n\\.vec"
next:65'2                                                                   with "MIDDLE_BLOCK" equal to "middle\\.block"
next:65'3                                                                   with "ENTRY" equal to "entry"
...

Add additional OR simplification to fix a divergence between legacy and VPlan-based cost model. This adds a new m_AllOnes matcher by generalizing specific_intval to int_pred_ty, which takes a predicate to check to support matching both specific APInts and other APInt predices, like isAllOnes. Fixes #131359.

[VPlan] Fix VPTypeAnalysis cache clobbering in EVL transform

ba977f0

lukel97 requested review from fhahn, arcbbb, alexey-bataev, Mel-Chen and LiqinWeng December 17, 2024 15:54

llvmbot added vectorizers llvm:transforms labels Dec 17, 2024

lukel97 mentioned this pull request Dec 17, 2024

[VPlan] Handle VPWidenCastRecipe without underlying value in EVL transform #120194

Merged

lukel97 added 2 commits December 18, 2024 01:07

Fix test name

f152006

Defer deleting recipes till after transformation

e51abd6

fhahn reviewed Dec 17, 2024

View reviewed changes

lukel97 and others added 2 commits December 18, 2024 01:14

Add . to comment

3638810

Co-authored-by: Florian Hahn <[email protected]>

Use recursivelyDeleteDeadRecipes to clear header masks

0e261f3

fhahn approved these changes Dec 17, 2024

View reviewed changes

lukel97 merged commit c2a879e into llvm:main Dec 18, 2024
8 checks passed

lukel97 mentioned this pull request Apr 2, 2025

[VPlan] Process simplifyRecipes via a worklist #133977

Open

	Type VPTypeAnalysis::inferScalarTypeForRecipe(const VPWidenSelectRecipe R) {
	Type *ResTy = inferScalarType(R->getOperand(1));
	VPValue *OtherV = R->getOperand(2);
	assert(inferScalarType(OtherV) == ResTy &&
	"different types inferred for different operands");
	CachedTypes[OtherV] = ResTy;
	return ResTy;
	}

[VPlan] Fix VPTypeAnalysis cache clobbering in EVL transform #120252

[VPlan] Fix VPTypeAnalysis cache clobbering in EVL transform #120252

Uh oh!

Conversation

lukel97 commented Dec 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Dec 17, 2024

Uh oh!

llvmbot commented Dec 17, 2024

Uh oh!

alexey-bataev commented Dec 17, 2024

Uh oh!

fhahn commented Dec 17, 2024

Uh oh!

lukel97 commented Dec 17, 2024

Uh oh!

fhahn commented Dec 17, 2024

Uh oh!

Uh oh!

fhahn Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

lukel97 Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

fhahn Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

lukel97 Dec 17, 2024

Choose a reason for hiding this comment

Uh oh!

fhahn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvm-ci commented Dec 18, 2024

Uh oh!

llvm-ci commented Dec 18, 2024

Uh oh!

llvm-ci commented Dec 18, 2024

Uh oh!

llvm-ci commented Dec 18, 2024

Uh oh!

llvm-ci commented Dec 18, 2024

Uh oh!

lukel97 commented Dec 18, 2024

Uh oh!

llvm-ci commented Dec 18, 2024

Uh oh!

Uh oh!

lukel97 commented Dec 17, 2024 •

edited

Loading