[AMDGPU][True16][CodeGen] Undo sub(x,c) to add in true16 flow #118854

broxigarchen · 2024-12-05T18:29:28Z

Undo sub x, c -> add x, -c canonicalization in true16 fow.

This duplicating the pattern from fake16 and implemement the same pattern in true16 format

llvmbot · 2024-12-05T19:32:59Z

@llvm/pr-subscribers-backend-amdgpu

Author: Brox Chen (broxigarchen)

Changes

Undo sub x, c -> add x, -c canonicalization in true16 fow.

This duplicating the pattern from fake16 and implemement the same pattern in true16 format

Full diff: https://github.com/llvm/llvm-project/pull/118854.diff

1 Files Affected:

(modified) llvm/lib/Target/AMDGPU/VOP3Instructions.td (+5-1)

diff --git a/llvm/lib/Target/AMDGPU/VOP3Instructions.td b/llvm/lib/Target/AMDGPU/VOP3Instructions.td
index 47b60bb0fdab30..ccf6ccf6e5d44a 100644
--- a/llvm/lib/Target/AMDGPU/VOP3Instructions.td
+++ b/llvm/lib/Target/AMDGPU/VOP3Instructions.td
@@ -1271,7 +1271,11 @@ let True16Predicate = NotHasTrue16BitInsts, SubtargetPredicate = isGFX10Plus in
 let True16Predicate = UseRealTrue16Insts in {
   def : OpSelBinOpClampPat<uaddsat, V_ADD_NC_U16_t16_e64>;
   def : OpSelBinOpClampPat<usubsat, V_SUB_NC_U16_t16_e64>;
-} // End OtherPredicates = [UseRealTrue16Insts]
+  def : GCNPat<
+    (add i16:$src0, (i16 NegSubInlineIntConst16:$src1)),
+    (V_SUB_NC_U16_t16_e64 0, VSrc_b16:$src0, 0, NegSubInlineIntConst16:$src1, 0, 0)
+  >;
+} // End True16Predicate = UseRealTrue16Insts
 
 let True16Predicate = UseFakeTrue16Insts in {
    def : OpSelBinOpClampPat<uaddsat, V_ADD_NC_U16_fake16_e64>;

llvm/lib/Target/AMDGPU/VOP3Instructions.td

llvm-ci · 2025-01-14T18:25:57Z

LLVM Buildbot has detected a new failure on builder llvm-clang-x86_64-expensive-checks-debian running on gribozavr4 while building llvm at step 6 "test-build-unified-tree-check-all".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/16/builds/11978

Here is the relevant piece of the build log for the reference

Step 6 (test-build-unified-tree-check-all) failure: test (failure)
******************** TEST 'LLVM :: tools/llvm-gsymutil/ARM_AArch64/macho-merged-funcs-dwarf.yaml' FAILED ********************
Exit Code: 1

Command Output (stdout):
--
Input file: /b/1/llvm-clang-x86_64-expensive-checks-debian/build/test/tools/llvm-gsymutil/ARM_AArch64/Output/macho-merged-funcs-dwarf.yaml.tmp.dSYM
Output file (aarch64): /b/1/llvm-clang-x86_64-expensive-checks-debian/build/test/tools/llvm-gsymutil/ARM_AArch64/Output/macho-merged-funcs-dwarf.yaml.tmp.default.gSYM
Loaded 3 functions from DWARF.
Loaded 3 functions from symbol table.
warning: same address range contains different debug info. Removing:
[0x0000000000000248 - 0x0000000000000270): Name=0x00000001
addr=0x0000000000000248, file=  1, line=  5
addr=0x0000000000000254, file=  1, line=  7
addr=0x0000000000000258, file=  1, line=  9
addr=0x000000000000025c, file=  1, line=  8
addr=0x0000000000000260, file=  1, line= 11
addr=0x0000000000000264, file=  1, line= 10
addr=0x0000000000000268, file=  1, line=  6


In favor of this one:
[0x0000000000000248 - 0x0000000000000270): Name=0x00000047
addr=0x0000000000000248, file=  3, line=  5
addr=0x0000000000000254, file=  3, line=  7
addr=0x0000000000000258, file=  3, line=  9
addr=0x000000000000025c, file=  3, line=  8
addr=0x0000000000000260, file=  3, line= 11
addr=0x0000000000000264, file=  3, line= 10
addr=0x0000000000000268, file=  3, line=  6


warning: same address range contains different debug info. Removing:
[0x0000000000000248 - 0x0000000000000270): Name=0x00000047
addr=0x0000000000000248, file=  3, line=  5
addr=0x0000000000000254, file=  3, line=  7
addr=0x0000000000000258, file=  3, line=  9
addr=0x000000000000025c, file=  3, line=  8
addr=0x0000000000000260, file=  3, line= 11
addr=0x0000000000000264, file=  3, line= 10
addr=0x0000000000000268, file=  3, line=  6


In favor of this one:
[0x0000000000000248 - 0x0000000000000270): Name=0x00000030
addr=0x0000000000000248, file=  2, line=  5
addr=0x0000000000000254, file=  2, line=  7
addr=0x0000000000000258, file=  2, line=  9
addr=0x000000000000025c, file=  2, line=  8
addr=0x0000000000000260, file=  2, line= 11
addr=0x0000000000000264, file=  2, line= 10
...

broxigarchen changed the title ~~[AMDGPU][True16][CodeGen]~~ [AMDGPU][True16][CodeGen] Undo sub(x,c) to add in true16 flow Dec 5, 2024

broxigarchen marked this pull request as ready for review December 5, 2024 19:32

broxigarchen requested a review from arsenm December 5, 2024 19:32

llvmbot added the backend:AMDGPU label Dec 5, 2024

arsenm reviewed Dec 5, 2024

View reviewed changes

llvm/lib/Target/AMDGPU/VOP3Instructions.td Show resolved Hide resolved

broxigarchen added 2 commits January 13, 2025 12:40

revert add cannonlization to sub in true16

732e319

add test change

81fe90a

broxigarchen force-pushed the main-merge-true16-codegen-addsub-u16 branch from 6823631 to 81fe90a Compare January 13, 2025 18:16

arsenm approved these changes Jan 14, 2025

View reviewed changes

broxigarchen requested review from Sisyph and kosarev January 14, 2025 15:57

broxigarchen merged commit f1b1c7f into llvm:main Jan 14, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU][True16][CodeGen] Undo sub(x,c) to add in true16 flow #118854

[AMDGPU][True16][CodeGen] Undo sub(x,c) to add in true16 flow #118854

Uh oh!

broxigarchen commented Dec 5, 2024 •

edited

Loading

Uh oh!

llvmbot commented Dec 5, 2024

Uh oh!

Uh oh!

Uh oh!

llvm-ci commented Jan 14, 2025

Uh oh!

Uh oh!

[AMDGPU][True16][CodeGen] Undo sub(x,c) to add in true16 flow #118854

[AMDGPU][True16][CodeGen] Undo sub(x,c) to add in true16 flow #118854

Uh oh!

Conversation

broxigarchen commented Dec 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Dec 5, 2024

Uh oh!

Uh oh!

Uh oh!

llvm-ci commented Jan 14, 2025

Uh oh!

Uh oh!

broxigarchen commented Dec 5, 2024 •

edited

Loading