[X86] Combine `uitofp <v x i32> to <v x half>` #121809

abhishek-kaushik22 · 2025-01-06T17:51:45Z

Closes #121793

llvmbot · 2025-01-06T17:52:19Z

@llvm/pr-subscribers-llvm-selectiondag

@llvm/pr-subscribers-backend-x86

Author: None (abhishek-kaushik22)

Changes

Closes #121793

Full diff: https://github.com/llvm/llvm-project/pull/121809.diff

1 Files Affected:

(modified) llvm/lib/Target/X86/X86ISelLowering.cpp (+13-2)

diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp
index 68bdeb1cebeb9c..156ad47efcbf4f 100644
--- a/llvm/lib/Target/X86/X86ISelLowering.cpp
+++ b/llvm/lib/Target/X86/X86ISelLowering.cpp
@@ -56098,8 +56098,19 @@ static SDValue combineUIntToFP(SDNode *N, SelectionDAG &DAG,
   if (InVT.isVector() && VT.getVectorElementType() == MVT::f16) {
     unsigned ScalarSize = InVT.getScalarSizeInBits();
     if ((ScalarSize == 16 && Subtarget.hasFP16()) || ScalarSize == 32 ||
-        ScalarSize >= 64)
-      return SDValue();
+        ScalarSize >= 64) {
+      if (ScalarSize != 32 || VT.getScalarSizeInBits() != 16 ||
+          Subtarget.hasFP16())
+        return SDValue();
+      // UINT_TO_FP(vXi32 to vXf16) -> FP_ROUND(UINT_TO_FP(vXi32 to vXf32), 0)
+      return DAG.getNode(
+          ISD::FP_ROUND, SDLoc(N), VT,
+          DAG.getNode(ISD::UINT_TO_FP, SDLoc(N),
+                      InVT.changeVectorElementType(MVT::f32), Op0),
+          DAG.getTargetConstant(
+              0, SDLoc(N),
+              DAG.getTargetLoweringInfo().getPointerTy(DAG.getDataLayout())));
+    }
     SDLoc dl(N);
     EVT DstVT =
         EVT::getVectorVT(*DAG.getContext(),

abhishek-kaushik22 · 2025-01-06T17:52:50Z

@RKSimon @phoebewang @e-kud can you please review?

e-kud · 2025-01-06T18:09:05Z

@abhishek-kaushik22 any tests?

RKSimon

I'd prefer this was handled in VectorLegalizer::ExpandUINT_TO_FLOAT since the issue isn't x86-specific

abhishek-kaushik22 · 2025-01-06T19:12:57Z

I'd prefer this was handled in VectorLegalizer::ExpandUINT_TO_FLOAT since the issue isn't x86-specific

Can you please provide an example for other targets that this is failing on?

RKSimon

test cases?

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

arsenm

Needs tests

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

abhishek-kaushik22 · 2025-01-07T08:46:35Z

Needs tests

I am not sure what to add in the tests. Should the tests include check-not for the inf constant or are auto-generated assertions enough? @RKSimon @arsenm

Edit: Added tests with auto assertions to get a review

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

abhishek-kaushik22 · 2025-01-07T10:58:15Z

With +f16c I still see vmulps .LCPI0_0(%rip), %ymm1, %ymm1 where

.LCPI0_0:
	.long	0x7f800000                      # float +Inf
	.long	0x7f800000                      # float +Inf
	.long	0x7f800000                      # float +Inf
	.long	0x7f800000                      # float +Inf
	.long	0x7f800000                      # float +Inf
	.long	0x7f800000                      # float +Inf
	.long	0x7f800000                      # float +Inf
	.long	0x7f800000                      # float +Inf

in the assembly. Thanks @phoebewang for the comments.

…vm-project into uint_to_fp

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll

phoebewang

LGTM with one nit.

llvm/test/CodeGen/X86/uint_to_half.ll

…vm-project into uint_to_fp

RKSimon · 2025-01-08T10:28:00Z

Thanks @abhishek-kaushik22 !

llvmbot added the backend:X86 label Jan 6, 2025

e-kud requested review from RKSimon, phoebewang and e-kud January 6, 2025 18:08

RKSimon reviewed Jan 6, 2025

View reviewed changes

Update LegalizeVectorOps.cpp

9f62f41

abhishek-kaushik22 force-pushed the uint_to_fp branch from 10fb587 to 9f62f41 Compare January 6, 2025 21:50

llvmbot added the llvm:SelectionDAG SelectionDAGISel as well label Jan 6, 2025

abhishek-kaushik22 added 3 commits January 7, 2025 03:31

Merge remote-tracking branch 'upstream/main' into uint_to_fp

14d036a

Update LegalizeVectorOps.cpp

74f5327

Merge branch 'main' into uint_to_fp

1dca40c

RKSimon reviewed Jan 6, 2025

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp Outdated Show resolved Hide resolved

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp Outdated Show resolved Hide resolved

arsenm reviewed Jan 7, 2025

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp Outdated Show resolved Hide resolved

abhishek-kaushik22 added 2 commits January 7, 2025 13:38

Update LegalizeVectorOps.cpp

40ca7cf

Merge branch 'main' into uint_to_fp

be80a20

arsenm reviewed Jan 7, 2025

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp Outdated Show resolved Hide resolved

Update LegalizeVectorOps.cpp

9a519bc

Add test with auto assertions

eb53636

arsenm reviewed Jan 7, 2025

View reviewed changes

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll Outdated Show resolved Hide resolved

phoebewang reviewed Jan 7, 2025

View reviewed changes

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll Outdated Show resolved Hide resolved

abhishek-kaushik22 added 2 commits January 7, 2025 14:38

Update test

cab2613

Fix indentation

96707b1

phoebewang reviewed Jan 7, 2025

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp Show resolved Hide resolved

abhishek-kaushik22 added 2 commits January 7, 2025 16:07

Update test with +f16c

d7809f2

Merge branch 'main' into uint_to_fp

8cf98f3

abhishek-kaushik22 added 2 commits January 7, 2025 16:54

Fix +f16c case

edb53f6

Merge branch 'uint_to_fp' of https://github.com/abhishek-kaushik22/ll…

4c1abcd

…vm-project into uint_to_fp

phoebewang reviewed Jan 7, 2025

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp Outdated Show resolved Hide resolved

phoebewang reviewed Jan 7, 2025

View reviewed changes

llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp Outdated Show resolved Hide resolved

RKSimon reviewed Jan 7, 2025

View reviewed changes

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll Outdated Show resolved Hide resolved

RKSimon reviewed Jan 7, 2025

View reviewed changes

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/X86/test_UINT_TO_FP_no_inf_corei7_avx.ll Outdated Show resolved Hide resolved

abhishek-kaushik22 added 2 commits January 7, 2025 19:28

Address review comments

25850f3

Merge branch 'main' into uint_to_fp

f727748

phoebewang approved these changes Jan 7, 2025

View reviewed changes

llvm/test/CodeGen/X86/uint_to_half.ll Outdated Show resolved Hide resolved

abhishek-kaushik22 added 2 commits January 7, 2025 20:51

Update uint_to_half.ll

bf3b192

Merge branch 'uint_to_fp' of https://github.com/abhishek-kaushik22/ll…

477f75a

…vm-project into uint_to_fp

tianleliu merged commit 366e62a into llvm:main Jan 8, 2025
6 of 8 checks passed

abhishek-kaushik22 deleted the uint_to_fp branch January 8, 2025 08:57

[X86] Combine uitofp <v x i32> to <v x half> #121809

[X86] Combine uitofp <v x i32> to <v x half> #121809

Uh oh!

Conversation

abhishek-kaushik22 commented Jan 6, 2025

Uh oh!

llvmbot commented Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abhishek-kaushik22 commented Jan 6, 2025

Uh oh!

e-kud commented Jan 6, 2025

Uh oh!

RKSimon left a comment

Choose a reason for hiding this comment

Uh oh!

abhishek-kaushik22 commented Jan 6, 2025

Uh oh!

RKSimon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

arsenm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

abhishek-kaushik22 commented Jan 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abhishek-kaushik22 commented Jan 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

phoebewang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

RKSimon commented Jan 8, 2025

Uh oh!

Uh oh!

[X86] Combine `uitofp <v x i32> to <v x half>` #121809

[X86] Combine `uitofp <v x i32> to <v x half>` #121809

llvmbot commented Jan 6, 2025 •

edited

Loading

abhishek-kaushik22 commented Jan 7, 2025 •

edited

Loading