[AMDGPU][True16] fix a bug in codeGen causing e64 with wrong vgpr type to shrink #102942

broxigarchen · 2024-08-12T17:43:21Z

This bug is introduced in #102198

The previous path change to use realTrue16 flag, however, we have some t16 instructions that are implemented with fake16, and has Lo128 registers types. Thus we should still using hasTrue16Bit flag for shrinking check

to shrink

llvmbot · 2024-08-12T17:43:58Z

@llvm/pr-subscribers-backend-amdgpu

Author: Brox Chen (broxigarchen)

Changes

This bug is introduced in #102198

Full diff: https://github.com/llvm/llvm-project/pull/102942.diff

1 Files Affected:

(modified) llvm/lib/Target/AMDGPU/SIShrinkInstructions.cpp (+1-1)

diff --git a/llvm/lib/Target/AMDGPU/SIShrinkInstructions.cpp b/llvm/lib/Target/AMDGPU/SIShrinkInstructions.cpp
index 155747551471e..5d38cafd73dd9 100644
--- a/llvm/lib/Target/AMDGPU/SIShrinkInstructions.cpp
+++ b/llvm/lib/Target/AMDGPU/SIShrinkInstructions.cpp
@@ -1048,7 +1048,7 @@ bool SIShrinkInstructions::runOnMachineFunction(MachineFunction &MF) {
               MachineFunctionProperties::Property::NoVRegs))
         continue;
 
-      if (ST->useRealTrue16Insts() && AMDGPU::isTrue16Inst(MI.getOpcode()) &&
+      if (ST->hasTrue16BitInsts() && AMDGPU::isTrue16Inst(MI.getOpcode()) &&
           !shouldShrinkTrue16(MI))
         continue;

Sisyph

LGTM. We can revert this part of the patch until all instructions are properly updated.

jayfoad · 2024-08-12T17:59:53Z

Can you include a test case?

hanhanW · 2024-08-12T18:28:51Z

Thanks, I verified that it fixes the issue in our project!

arsenm

Needs test

broxigarchen · 2024-08-12T19:45:49Z

Needs test

Added a small test to verify and to prevent furture failure. Should be expanded when more true16 and fake16 instructions are supported

broxigarchen added 3 commits August 12, 2024 13:41

[AMDGPU][CodeGen] support v_mov_b16 and v_swap_b16 in true16 format

b1ee138

added back the missing imm pattern for mov_b16

b01863a

[AMDGPU][True16] fix a bug in codeGen causing e64 with wrong vgpr type

acfb65a

to shrink

llvmbot added the backend:AMDGPU label Aug 12, 2024

Sisyph approved these changes Aug 12, 2024

View reviewed changes

arsenm requested changes Aug 12, 2024

View reviewed changes

added a mir test for shrinking Lo128 register type

6fb5015

broxigarchen force-pushed the main-merge-true16-swap-mov branch from 109eaf0 to 6fb5015 Compare August 12, 2024 19:47

raikonenfnu self-requested a review August 12, 2024 20:12

arsenm approved these changes Aug 12, 2024

View reviewed changes

Sisyph merged commit 6b7afaa into llvm:main Aug 12, 2024
6 of 7 checks passed

broxigarchen mentioned this pull request Aug 13, 2024

Request Commit Access For broxigarchen #100457

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU][True16] fix a bug in codeGen causing e64 with wrong vgpr type to shrink #102942

[AMDGPU][True16] fix a bug in codeGen causing e64 with wrong vgpr type to shrink #102942

Uh oh!

broxigarchen commented Aug 12, 2024 •

edited

Loading

Uh oh!

llvmbot commented Aug 12, 2024

Uh oh!

Sisyph left a comment

Uh oh!

jayfoad commented Aug 12, 2024

Uh oh!

hanhanW commented Aug 12, 2024

Uh oh!

arsenm left a comment

Uh oh!

broxigarchen commented Aug 12, 2024

Uh oh!

Uh oh!

Uh oh!

[AMDGPU][True16] fix a bug in codeGen causing e64 with wrong vgpr type to shrink #102942

[AMDGPU][True16] fix a bug in codeGen causing e64 with wrong vgpr type to shrink #102942

Uh oh!

Conversation

broxigarchen commented Aug 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Aug 12, 2024

Uh oh!

Sisyph left a comment

Choose a reason for hiding this comment

Uh oh!

jayfoad commented Aug 12, 2024

Uh oh!

hanhanW commented Aug 12, 2024

Uh oh!

arsenm left a comment

Choose a reason for hiding this comment

Uh oh!

broxigarchen commented Aug 12, 2024

Uh oh!

Uh oh!

Uh oh!

broxigarchen commented Aug 12, 2024 •

edited

Loading