[PowerPC] Optimize BUILD_VECTOR from load and zeros #73609

bzEq · 2023-11-28T03:35:49Z

We are encountered with patterns like BUILD_VECTOR 0, 0, (load), 0 resulted in suboptimal codegen. This PR improves it.

nemanjai · 2023-11-28T08:16:35Z

llvm/lib/Target/PowerPC/PPCInstrVSX.td

@@ -3437,6 +3437,12 @@ def : Pat<(store (i32 (extractelt v4i32:$A, 1)), ForceXForm:$src),
 def : Pat<(store (f32 (extractelt v4f32:$A, 1)), ForceXForm:$src),
          (STIWX (EXTRACT_SUBREG $A, sub_64), ForceXForm:$src)>;

+// BUILD_VECTOR via loads and zeros.
+def : Pat<(v2f64 (build_vector (f64 (extloadf32 ForceXForm:$src)), (f64 fpimm0))),


Do we not have an opportunity to also improve the following:

Non-extending load

Single precision and i32 (other elements are zeros)

Zero is at a different index

Little endian

ecnelises · 2023-12-06T02:51:51Z

llvm/lib/Target/PowerPC/PPCInstrVSX.td

+          (v2i64 (COPY_TO_REGCLASS (LXSDX ForceXForm:$src), VSRC))>;
+def : Pat<(v2i64 BVLoadAndZerosLong<0>.DAG),
+          (v2i64 (XXPERMDIs
+                 (COPY_TO_REGCLASS (LXSDX ForceXForm:$src), VSRC), 2))>;


Can we use multiclass / defm to simplify the definitions?

I've tried, however looks not simplified much(Maybe I was using a different approach than what's in your mind).

bzEq · 2024-04-13T09:18:12Z

Ping.

kamaub · 2025-02-10T21:22:43Z

Take over this patch to get it through reviews and commited in #126599, closing this PR.

bzEq self-assigned this Nov 28, 2023

bzEq marked this pull request as draft November 28, 2023 04:30

bzEq changed the title ~~[PowerPC] Optimize BUILD_VECTOR from extload and zeros~~ [PowerPC] Optimize BUILD_VECTOR from load and zeros Nov 28, 2023

nemanjai reviewed Nov 28, 2023

View reviewed changes

bzEq pushed a commit that referenced this pull request Nov 30, 2023

[PowerPC] Enhance test for PR #73609. NFC.

afd9582

bzEq force-pushed the use-lxsspx branch from 8097621 to 2d04874 Compare November 30, 2023 05:10

bzEq requested review from ecnelises and chenzheng1030 November 30, 2023 05:11

bzEq marked this pull request as ready for review November 30, 2023 05:11

bzEq requested a review from nemanjai November 30, 2023 05:12

Optimize BUILD_VECTOR

fe5c6b5

bzEq force-pushed the use-lxsspx branch from 2d04874 to fe5c6b5 Compare November 30, 2023 05:55

ecnelises reviewed Dec 6, 2023

View reviewed changes

bzEq requested a review from ecnelises April 13, 2024 09:20

lei137 requested review from stefanp-synopsys, amykhuang, diggerlin and mandlebug September 6, 2024 14:54

kamaub mentioned this pull request Feb 10, 2025

[PowerPC] Optimize BUILD_VECTOR from load and zeros #126599

Open

kamaub closed this Feb 10, 2025

bzEq deleted the use-lxsspx branch February 16, 2025 06:38

bzEq restored the use-lxsspx branch February 16, 2025 06:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PowerPC] Optimize BUILD_VECTOR from load and zeros #73609

[PowerPC] Optimize BUILD_VECTOR from load and zeros #73609

Uh oh!

bzEq commented Nov 28, 2023 •

edited

Loading

Uh oh!

nemanjai Nov 28, 2023

Uh oh!

ecnelises Dec 6, 2023

Uh oh!

bzEq Dec 6, 2023

Uh oh!

bzEq commented Apr 13, 2024

Uh oh!

kamaub commented Feb 10, 2025

Uh oh!

Uh oh!

[PowerPC] Optimize BUILD_VECTOR from load and zeros #73609

[PowerPC] Optimize BUILD_VECTOR from load and zeros #73609

Uh oh!

Conversation

bzEq commented Nov 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nemanjai Nov 28, 2023

Choose a reason for hiding this comment

Uh oh!

ecnelises Dec 6, 2023

Choose a reason for hiding this comment

Uh oh!

bzEq Dec 6, 2023

Choose a reason for hiding this comment

Uh oh!

bzEq commented Apr 13, 2024

Uh oh!

kamaub commented Feb 10, 2025

Uh oh!

Uh oh!

bzEq commented Nov 28, 2023 •

edited

Loading