[VP] Check if VP ops with functional intrinsics are speculatable #69504

lukel97 · 2023-10-18T19:50:20Z

Noticed whilst working on #69494. VP intrinsics whose functional equivalent is
an intrinsic were being marked as their lanes being non-speculatable, even if
the underlying intrinsic was speculatable.

This meant that

  %1 = call <4 x i32> @llvm.vp.umax(<4 x i32> %x, <4 x i32> %y, <4 x i1> %mask, i32 %evl)

would be expanded out to

  %.splatinsert = insertelement <4 x i32> poison, i32 %evl, i64 0
  %.splat = shufflevector <4 x i32> %.splatinsert, <4 x i32> poison, <4 x i32> zeroinitializer
  %1 = icmp ult <4 x i32> <i32 0, i32 1, i32 2, i32 3>, %.splat
  %2 = and <4 x i1> %1, %mask
  %3 = call <4 x i32> @llvm.umax.v4i32(<4 x i32> %x, <4 x i32> %y)

instead of

  %1 = call <4 x i32> @llvm.umax.v4i32(<4 x i32> %x, <4 x i32> %y)

The cause of this was isSafeToSpeculativelyExecuteWithOpcode checking the
function attributes for the VP instruction itself, not the functional
intrinsic. Since isSafeToSpeculativelyExecuteWithOpcode expects an already
materialized instruction, we can't use it directly for the intrinsic case. So
this fixes it by manually checking the function attributes on the intrinsic.

Noticed whilst working on llvm#69494. VP intrinsics whose functional equivalent is an intrinsic were being marked as their lanes being non-speculatable, even if the underlying intrinsic was speculatable. This meant that %1 = call <4 x i32> @llvm.vp.umax(<4 x i32> %x, <4 x i32> %y, <4 x i1> %mask, i32 %evl) would be expanded out to %.splatinsert = insertelement <4 x i32> poison, i32 %evl, i64 0 %.splat = shufflevector <4 x i32> %.splatinsert, <4 x i32> poison, <4 x i32> zeroinitializer %1 = icmp ult <4 x i32> <i32 0, i32 1, i32 2, i32 3>, %.splat %2 = and <4 x i1> %1, %mask %3 = call <4 x i32> @llvm.umax.v4i32(<4 x i32> %x, <4 x i32> %y) instead of %1 = call <4 x i32> @llvm.umax.v4i32(<4 x i32> %x, <4 x i32> %y) The cause of this was isSafeToSpeculativelyExecuteWithOpcode checking the function attributes for the VP instruction itself, not the functional intrinsic. Since isSafeToSpeculativelyExecuteWithOpcode expects an already materialized instruction, we can't use it directly for the intrinsic case. So this fixes it by manually checking the function attributes on the intrinsic.

RKSimon · 2023-10-19T11:01:04Z

Please can you precommit the test changes so the patch properly shows the codegen diff

lukel97 · 2023-10-19T13:40:01Z

Please can you precommit the test changes so the patch properly shows the codegen diff

I wanted to do that originally but expand-vp.ll is a handwritten test unfortunately. I'm tempted to convert it to update_test_checks.py, should I do that in a separate PR?

lukel97 · 2023-10-26T12:29:24Z

Ping. Looks like the test was intentionally handwritten, with the agreement to convert it to UTC once caching had been implemented: https://reviews.llvm.org/D78203#inline-949565

RKSimon

LGTM

…m#69504) Noticed whilst working on llvm#69494. VP intrinsics whose functional equivalent is an intrinsic were being marked as their lanes being non-speculatable, even if the underlying intrinsic was speculatable. This meant that ```llvm %1 = call <4 x i32> @llvm.vp.umax(<4 x i32> %x, <4 x i32> %y, <4 x i1> %mask, i32 %evl) ``` would be expanded out to ```llvm %.splatinsert = insertelement <4 x i32> poison, i32 %evl, i64 0 %.splat = shufflevector <4 x i32> %.splatinsert, <4 x i32> poison, <4 x i32> zeroinitializer %1 = icmp ult <4 x i32> <i32 0, i32 1, i32 2, i32 3>, %.splat %2 = and <4 x i1> %1, %mask %3 = call <4 x i32> @llvm.umax.v4i32(<4 x i32> %x, <4 x i32> %y) ``` instead of ```llvm %1 = call <4 x i32> @llvm.umax.v4i32(<4 x i32> %x, <4 x i32> %y) ``` The cause of this was isSafeToSpeculativelyExecuteWithOpcode checking the function attributes for the VP instruction itself, not the functional intrinsic. Since isSafeToSpeculativelyExecuteWithOpcode expects an already materialized instruction, we can't use it directly for the intrinsic case. So this fixes it by manually checking the function attributes on the intrinsic.

lukel97 requested review from frasercrmck, LiqinWeng, RKSimon, simoll, michaelmaitland and topperc October 18, 2023 19:50

RKSimon approved these changes Oct 26, 2023

View reviewed changes

lukel97 merged commit 2e85123 into llvm:main Oct 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[VP] Check if VP ops with functional intrinsics are speculatable #69504

[VP] Check if VP ops with functional intrinsics are speculatable #69504

Uh oh!

lukel97 commented Oct 18, 2023 •

edited

Loading

Uh oh!

RKSimon commented Oct 19, 2023

Uh oh!

lukel97 commented Oct 19, 2023

Uh oh!

lukel97 commented Oct 26, 2023

Uh oh!

RKSimon left a comment

Uh oh!

Uh oh!

[VP] Check if VP ops with functional intrinsics are speculatable #69504

[VP] Check if VP ops with functional intrinsics are speculatable #69504

Uh oh!

Conversation

lukel97 commented Oct 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RKSimon commented Oct 19, 2023

Uh oh!

lukel97 commented Oct 19, 2023

Uh oh!

lukel97 commented Oct 26, 2023

Uh oh!

RKSimon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lukel97 commented Oct 18, 2023 •

edited

Loading