[AMDGPU] Correct bitshift legality transformation for small vectors #140940

zGoldthorpe · 2025-05-21T17:49:44Z

Fix for a bug found by the AMD fuzzing project.

The legaliser would originally try to widen a small vector such as <4 x i1> to a single i16 during the legalisation of bitshifts, as it was not originally written with consideration for vector operands. This patch simply adds a guard to prohibit this transformation and allow other legalisation transformations to step in.

github-actions · 2025-05-21T17:50:06Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write permissions for the repository. In which case you can instead tag reviewers by name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2025-05-21T17:50:39Z

@llvm/pr-subscribers-llvm-globalisel

@llvm/pr-subscribers-backend-amdgpu

Author: None (zGoldthorpe)

Changes

Fix for a bug found by the AMD fuzzing project.

The legaliser would originally try to widen a small vector such as <4 x i1> to a single i16 during the legalisation of bitshifts, as it was not originally written with consideration for vector operands. This patch simply adds a guard to prohibit this transformation and allow other legalisation transformations to step in.

Full diff: https://github.com/llvm/llvm-project/pull/140940.diff

2 Files Affected:

(modified) llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp (+1-1)
(added) llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll (+24)

diff --git a/llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp b/llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
index 667c466a998e0..eeb05f0acebed 100644
--- a/llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
+++ b/llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp
@@ -1765,7 +1765,7 @@ AMDGPULegalizerInfo::AMDGPULegalizerInfo(const GCNSubtarget &ST_,
         // 32-bit amount.
         const LLT ValTy = Query.Types[0];
         const LLT AmountTy = Query.Types[1];
-        return ValTy.getSizeInBits() <= 16 &&
+        return ValTy.isScalar() && ValTy.getSizeInBits() <= 16 &&
                AmountTy.getSizeInBits() < 16;
       }, changeTo(1, S16));
     Shifts.maxScalarIf(typeIs(0, S16), 1, S16);
diff --git a/llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll b/llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll
new file mode 100644
index 0000000000000..1d40038abe911
--- /dev/null
+++ b/llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll
@@ -0,0 +1,24 @@
+; RUN: llc -global-isel -mtriple=amdgcn -mcpu=gfx90a -O0 -print-after=legalizer %s -o /dev/null 2>&1 | FileCheck %s
+
+; CHECK-LABEL: widen_ashr_i4:
+define amdgpu_kernel void @widen_ashr_i4(
+    ptr addrspace(1) %res, i4 %a, i4 %b) {
+; CHECK: G_ASHR %{{[0-9]+}}:_, %{{[0-9]+}}:_(s16)
+entry:
+  %res.val = ashr i4 %a, %b
+  store i4 %res.val, ptr addrspace(1) %res
+  ret void
+}
+
+; CHECK-LABEL: widen_ashr_v4i1:
+define amdgpu_kernel void @widen_ashr_v4i1(
+    ptr addrspace(1) %res, <4 x i1> %a, <4 x i1> %b) {
+; CHECK: G_ASHR %{{[0-9]+}}:_, %{{[0-9]+}}:_(s16)
+; CHECK: G_ASHR %{{[0-9]+}}:_, %{{[0-9]+}}:_(s16)
+; CHECK: G_ASHR %{{[0-9]+}}:_, %{{[0-9]+}}:_(s16)
+; CHECK: G_ASHR %{{[0-9]+}}:_, %{{[0-9]+}}:_(s16)
+entry:
+  %res.val = ashr <4 x i1> %a, %b
+  store <4 x i1> %res.val, ptr addrspace(1) %res
+  ret void
+}

llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-ashr.mir

Also added tests for <4 x i2> vectors, which are still within the scope of the patch, particularly because an arithmetic right shift is an identity transformation.

github-actions · 2025-05-23T08:56:39Z

@zGoldthorpe Congratulations on having your first Pull Request (PR) merged into the LLVM Project!

Your changes will be combined with recent changes from other authors, then tested by our build bots. If there is a problem with a build, you may receive a report in an email or a comment on this PR.

Please check whether problems have been caused by your change specifically, as the builds can include changes from many authors. It is not uncommon for your change to be included in a build that fails due to someone else's changes, or infrastructure issues.

How to do this, and the rest of the post-merge process, is covered in detail here.

If your change does cause a problem, it may be reverted, or you can revert it yourself. This is a normal part of LLVM development. You can fix your changes and open a new PR to merge them again.

If you don't get any reports, no action is required from you. Your changes are working as expected, well done!

llvm-ci · 2025-05-23T09:05:06Z

LLVM Buildbot has detected a new failure on builder lldb-x86_64-debian running on lldb-x86_64-debian while building llvm at step 6 "test".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/162/builds/23018

Here is the relevant piece of the build log for the reference

Step 6 (test) failure: build (failure)
...
UNSUPPORTED: lldb-shell :: Process/Windows/exception_access_violation.cpp (2945 of 2956)
UNSUPPORTED: lldb-shell :: ScriptInterpreter/Lua/watchpoint_callback.test (2946 of 2956)
UNSUPPORTED: lldb-shell :: ScriptInterpreter/Python/Crashlog/text.test (2947 of 2956)
UNSUPPORTED: lldb-shell :: ObjectFile/ELF/elf-dynsym.test (2948 of 2956)
UNSUPPORTED: lldb-shell :: ScriptInterpreter/Lua/fail_breakpoint_oneline.test (2949 of 2956)
UNSUPPORTED: lldb-shell :: ScriptInterpreter/Lua/bindings.test (2950 of 2956)
UNSUPPORTED: lldb-shell :: ScriptInterpreter/Lua/persistent_state.test (2951 of 2956)
PASS: lldb-api :: api/multithreaded/TestMultithreaded.py (2952 of 2956)
PASS: lldb-api :: terminal/TestEditlineCompletions.py (2953 of 2956)
UNRESOLVED: lldb-api :: tools/lldb-dap/launch/TestDAP_launch.py (2954 of 2956)
******************** TEST 'lldb-api :: tools/lldb-dap/launch/TestDAP_launch.py' FAILED ********************
Script:
--
/usr/bin/python3 /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/lldb/test/API/dotest.py -u CXXFLAGS -u CFLAGS --env LLVM_LIBS_DIR=/home/worker/2.0.1/lldb-x86_64-debian/build/./lib --env LLVM_INCLUDE_DIR=/home/worker/2.0.1/lldb-x86_64-debian/build/include --env LLVM_TOOLS_DIR=/home/worker/2.0.1/lldb-x86_64-debian/build/./bin --arch x86_64 --build-dir /home/worker/2.0.1/lldb-x86_64-debian/build/lldb-test-build.noindex --lldb-module-cache-dir /home/worker/2.0.1/lldb-x86_64-debian/build/lldb-test-build.noindex/module-cache-lldb/lldb-api --clang-module-cache-dir /home/worker/2.0.1/lldb-x86_64-debian/build/lldb-test-build.noindex/module-cache-clang/lldb-api --executable /home/worker/2.0.1/lldb-x86_64-debian/build/./bin/lldb --compiler /home/worker/2.0.1/lldb-x86_64-debian/build/./bin/clang --dsymutil /home/worker/2.0.1/lldb-x86_64-debian/build/./bin/dsymutil --make /usr/bin/gmake --llvm-tools-dir /home/worker/2.0.1/lldb-x86_64-debian/build/./bin --lldb-obj-root /home/worker/2.0.1/lldb-x86_64-debian/build/tools/lldb --lldb-libs-dir /home/worker/2.0.1/lldb-x86_64-debian/build/./lib -t /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/lldb/test/API/tools/lldb-dap/launch -p TestDAP_launch.py
--
Exit Code: 1

Command Output (stdout):
--
lldb version 21.0.0git (https://github.com/llvm/llvm-project.git revision bb7e5597407884dbbd1d45570fa73dea168545f5)
  clang revision bb7e5597407884dbbd1d45570fa73dea168545f5
  llvm revision bb7e5597407884dbbd1d45570fa73dea168545f5
Skipping the following test categories: ['libc++', 'dsym', 'gmodules', 'debugserver', 'objc']

--
Command Output (stderr):
--
Change dir to: /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/lldb/test/API/tools/lldb-dap/launch
runCmd: settings clear --all

output: 

runCmd: settings set symbols.enable-external-lookup false

output: 

runCmd: settings set target.inherit-tcc true

output: 

runCmd: settings set target.disable-aslr false

output: 

runCmd: settings set target.detach-on-error false

output: 

runCmd: settings set target.auto-apply-fixits false

…lvm#140940) Fix for a bug found by the AMD fuzzing project. The legaliser would originally try to widen a small vector such as `<4 x i1>` to a single `i16` during the legalisation of bitshifts, as it was not originally written with consideration for vector operands. This patch simply adds a guard to prohibit this transformation and allow other legalisation transformations to step in.

Correct bitshift legalisation

12bdc90

llvmbot added the backend:AMDGPU label May 21, 2025

shiltian reviewed May 21, 2025

View reviewed changes

llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll Outdated Show resolved Hide resolved

shiltian requested a review from arsenm May 21, 2025 17:57

arsenm added the llvm:globalisel label May 21, 2025

arsenm reviewed May 21, 2025

View reviewed changes

llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll Outdated Show resolved Hide resolved

llvm/test/CodeGen/AMDGPU/widen-vector-shift.ll Outdated Show resolved Hide resolved

Integrated and autogenerated test.

d364457

arsenm reviewed May 21, 2025

View reviewed changes

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-ashr.mir Outdated Show resolved Hide resolved

Rewrote tests in LLVM IR.

8740201

Also added tests for <4 x i2> vectors, which are still within the scope of the patch, particularly because an arithmetic right shift is an identity transformation.

shiltian approved these changes May 22, 2025

View reviewed changes

arsenm approved these changes May 23, 2025

View reviewed changes

arsenm merged commit bb7e559 into llvm:main May 23, 2025
7 of 11 checks passed

zGoldthorpe deleted the pr/bitshift-legality branch May 23, 2025 12:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU] Correct bitshift legality transformation for small vectors #140940

[AMDGPU] Correct bitshift legality transformation for small vectors #140940

Uh oh!

zGoldthorpe commented May 21, 2025

Uh oh!

github-actions bot commented May 21, 2025

Uh oh!

llvmbot commented May 21, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 23, 2025

Uh oh!

llvm-ci commented May 23, 2025

Uh oh!

Uh oh!

[AMDGPU] Correct bitshift legality transformation for small vectors #140940

[AMDGPU] Correct bitshift legality transformation for small vectors #140940

Uh oh!

Conversation

zGoldthorpe commented May 21, 2025

Uh oh!

github-actions bot commented May 21, 2025

Uh oh!

llvmbot commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 23, 2025

Uh oh!

llvm-ci commented May 23, 2025

Uh oh!

Uh oh!

llvmbot commented May 21, 2025 •

edited

Loading