Skip to content

Commit 3c79a04

Browse files
authored
[AMDGPU] Add amdgpu-sw-lower-lds pass to NPM codegen addIRPasses. (#124102)
This PR adds amdgpu-sw-lower-lds pass to AMDGPUCodeGenPassBuilder::addIRPasses()
1 parent 6330f1e commit 3c79a04

File tree

2 files changed

+4
-0
lines changed

2 files changed

+4
-0
lines changed

llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1950,6 +1950,9 @@ void AMDGPUCodeGenPassBuilder::addIRPasses(AddIRPass &addPass) const {
19501950

19511951
addPass(AMDGPUOpenCLEnqueuedBlockLoweringPass());
19521952

1953+
if (EnableSwLowerLDS)
1954+
addPass(AMDGPUSwLowerLDSPass(TM));
1955+
19531956
// Runs before PromoteAlloca so the latter can account for function uses
19541957
if (EnableLowerModuleLDS)
19551958
addPass(AMDGPULowerModuleLDSPass(TM));

llvm/test/CodeGen/AMDGPU/amdgpu-sw-lower-lds-static-lds-test.ll

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,6 @@
11
; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --check-globals all --version 4
22
; RUN: opt < %s -passes=amdgpu-sw-lower-lds -S -amdgpu-asan-instrument-lds=false -mtriple=amdgcn-amd-amdhsa | FileCheck %s
3+
; RUN: llc < %s -enable-new-pm -stop-after=amdgpu-sw-lower-lds -amdgpu-asan-instrument-lds=false -mtriple=amdgcn-amd-amdhsa | FileCheck %s
34

45
; Test to check if static LDS accesses in kernel are lowered correctly.
56
@lds_1 = internal addrspace(3) global [1 x i8] poison, align 4

0 commit comments

Comments
 (0)