LLVM and SPIRV-LLVM-Translator pulldown (WW20) #9486

jsji · 2023-05-16T18:49:22Z

LLVM: llvm/llvm-project@2051755
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@772c7be

The code is doing the optimization: `((a | c1) << c2)` ==> `(a << c2) + (c1 << c2)` But this is only valid if `a` and `c1` have no common bits being set. Differential Revision: https://reviews.llvm.org/D150246

The revision adds basic timing to the mlir-translate tool. Reviewed By: Dinistro Differential Revision: https://reviews.llvm.org/D150434

This patch consumes the EntryValueObjects in a MachineFunction's table, using them to emit the appropriate debug information for these variables. Depends on D149880 Differential Revision: https://reviews.llvm.org/D149881

Most of the code changed here dates back to 2010, when LLDB was first introduced upstream, as such it benefits from a slight cleanup. The method "dump" is not used anywhere nor is it tested, so this commit removes it. The "findRanges" method returns a boolean which is never checked and indicates whether the method found anything/assigned a range map to the out parameter. This commit folds the out parameter into the return type of the method. A handful of typedefs were also never used and therefore removed. Differential Revision: https://reviews.llvm.org/D150363

The TrackingListener was unnecessarily strict. Existing ops are now allowed when updating payload ops mappings due to `replaceOp` in the TrackingListener. Differential Revision: https://reviews.llvm.org/D150429

…hen call stack frame extension is invoked When the stack frame extension routine is used, the contents of r3 is overwritten. However, if r3 is live in the prologue (ie. one of the function's parameters resides in r3), it needs to be saved. We save r3 in r0 if r0 is available (ie. r0 is not used as temporary storage for r4), and in the corresponding stack slot for the third parameter otherwise. Differential Revision: https://reviews.llvm.org/D150332 Reviewed By: uweigand

The newly added compiler_pop_stack_no_memoperands has no memory operands on the memory instructions but accesses the same locations as compiler_pop_stack. At the moment, accesses to the stack are missed by shrink-wrapping. Test case for the issue pointed out by @jpenix-quic in D149668 post-commit.

This change adds the following three operations and unit tests for them: - conv_3d_ncdhw_fcdhw - depthwise_conv_1d_ncw_cw - depthwise_conv_3d_ncdhw_cdhw Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D150054

- Added missing TensorTransformOps to the Transform doc - Added missing AMDGPUPasses to the Passes doc - Place `async dialect` in alphabetical order in the Passes doc Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D150341

This reverts commit 8d657c4. Reverts it due to the regression reported in D150068.

…X3 to VEX2 1. Share code `optimizeInstFromVEX3ToVEX2` with MCInstLower 2. Move the code of optimization for shift/rotate to a separate file 3. Since the function is shared, a side effect is that more encoding optimizations are done on the Asmparser side. Considering we already use reverse-encoding for optimization in AsmParser before this patch, I believe the change is positive and expected. This is a reland of D150068 with the fix D150440.

As pointed out by @jpenix-quic in D149668 post-commit, machine instructions without memory operands need to be treated conservatively.

…sic memcpy. With this change, more `memref.copy` will be lowered to the efficient `memcpy`. For example, ``` memref.copy %subview, %alloc : memref<1x576xf32, strided<[704, 1]>> to memref<1x576xf32> ``` Differential Revision: https://reviews.llvm.org/D150448

Change-Id: I608f14ac3a504cc668f93f130a17dea3950fa554

Also some simplifications: * `outputBufferOperands` was unused. * The condition that the number of operands equals the number of inputs plus the number of inits seemed vacuously true (?). Differential Revision: https://reviews.llvm.org/D150376

Fixes #62653

The financial cost of the network I/O for the Clang install artifacts is quite significant. afd3478 improved this by creating tarballs. This commit improves the tarball by using xz compression instead of gzip. This option is the slowest, but gives the smallest size. size time time (compression) (decompression) gzip 51 M 7 s 1.2 s bz2 44 M 17 s 5.8 s xz 33 M 76 s 3.1 s Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D150062

These tests should have added -std=c++23 instead of replacing -std=c++2b in D149553. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D150063

The newer formatters for (tuple, vector<bool>::reference) specify the formatter's parse and format member function. This signature is slightly different from the signature for existing formatters. Adapt the existing formatters to the new style. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D150034

Differential Revision: https://reviews.llvm.org/D149986

This commit implements IRTranslator lowering of dbg.declare intrinsics targeting swiftasync Arguments, by putting them in the MachineFunction's table of variables whose location doesn't change throughout the function. Depends on D149881 Differential Revision: https://reviews.llvm.org/D149882

While pointers in address space 7 (128 bit rsrc + 32 bit offset) should be rewritten out of the code before IR translation on AMDGPU, higher-level analyses may still call MVT getPointerTy() and the like on the target machine. Currently, since there is no MVT::i160, this operation ends up causing crashes. The changes to the data layout that caused such crashes were D149776. This patch causes getPointerTy() to return the type MVT::v5i32 and getPointerMemTy() to be MVT::v8i32. These are accurate types, but mean that we can't use vectors of address space 7 pointers during codegen. This is mostly OK, since vectors of buffers aren't supported in LPC anyway, but it's a noticable limitation. Potential alternative solutions include adjusting getPointerTy() to return an EVT or adding MVT::i160 and MVT::i256, both of which are rather disruptive to the rest of the compiler. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D150002

This commit implements SelectionDAG lowering of dbg.declare intrinsics targeting swiftasync Arguments, by putting them in the MachineFunction's table of variables whose location doesn't change throughout the function. Depends on D149882 Differential Revision: https://reviews.llvm.org/D149883

…lh/mul_lohi are not available. Correct the legality of i32 mul_lohi on AArch64. Previously, AArch64 incorrectly reported i32 mul_lohi as Legal. This allowed BuildUDIV/SDIV to use them. A later DAGCombiner would replace them with MULHS/MULHU because only the high half was used. This conversion does not check the legality of MULHS/MULHU under the assumption that LegalizeDAG can turn it back into MUL_LOHI later. After they are converted to MULHS/MULHU, DAGCombine ran and saw that these operations aren't supported but an i64 MUL is. So they get converted to that plus a shift. Without this, LegalizeDAG would convert back MUL_LOHI and isel would fail to find a pattern. This patch teaches BuildUDIV/SDIV to create the wide mul and shift so that we can report the correct operation legality on AArch64. It also enables div by constant folding for more cases on VE. I don't know if VE wants this div by constant optimization or not. If they don't want it, they can use the isIntDivCheap hook to disable it. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D150333

Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D150414

Add llvm-mca tests for RISCV LMUL instruments to show that llvm-mca RISCV LMUL instruments work. Differential Revision: https://reviews.llvm.org/D149496

…epare While the original motivation for this patch (address space 7 on AMDGPU) has been reworked and is not presently planned to reach IR translation, the incorrect (by the spec) handling of index offset width in IR translation and CodeGenPrepare is likely to trip someone - possibly future AMD, since we have a p7:160:256:256:32 now, so we convert to the other API now. Reviewed By: aemerson, arsenm Differential Revision: https://reviews.llvm.org/D143526

This commit passed buildable tests in phabricator, but fails once committed. This reverts commit 1dedc96.

…v_pulldown

Currently, we always convert SPIR-V bultins to globals for forward translation and to functions for reverse translation. I have a use case where I want to keep them as globals for reverse translation, so I added this mode. Implementations for both cases already existed, I just consolidated them and added the option. Signed-off-by: Sarnie, Nick <[email protected]> Original commit: KhronosGroup/SPIRV-LLVM-Translator@730eaf0

This target extension type is created here: https://github.com/intel/vc-intrinsics/blob/master/GenXIntrinsics/lib/GenXIntrinsics/GenXSPIRVWriterAdaptor.cpp#L245 As with other target extension types, reverse translation is not yet supported. Signed-off-by: Sarnie, Nick <[email protected]> Co-authored-by: Victor Mustya <[email protected]> Original commit: KhronosGroup/SPIRV-LLVM-Translator@60746d5

Currently only to DebugInfo/X86 Currently failing tests can be noticed by RUNx line Signed-off-by: Sidorov, Dmitry <[email protected]> Original commit: KhronosGroup/SPIRV-LLVM-Translator@772c7be

kchusha · 2023-05-17T13:14:12Z

/summary:run

sarnex · 2023-05-18T13:54:26Z

@againull @kchusha @jsji

I have a PR to fix the llvm-spirv test failures here, feel free to merge it

the builtin-functions one is a real issue but it was introduced before just exposed by this test, i can reproduce it in current sycl branch HEAD, i will make an internal tracker second one is expected because of entry point wrapper thing reverted here Signed-off-by: Sarnie, Nick <[email protected]>

againull · 2023-05-18T15:47:34Z

@againull @kchusha @jsji

I have a PR to fix the llvm-spirv test failures here, feel free to merge it

@sarnex Thanks a ton for providing the fix!

againull · 2023-05-18T20:48:41Z

/merge

bb-sycl · 2023-05-18T20:49:10Z

Thu 18 May 2023 08:49:09 PM UTC --- Start to merge the commit into sycl branch. It will take several minutes.

bb-sycl · 2023-05-18T20:54:14Z

Thu 18 May 2023 08:54:14 PM UTC --- Merge the branch in this PR to base automatically. Will close the PR later.

ruiling and others added 30 commits May 12, 2023 19:50

AMDGPU: Fix issue in shl(or) combine

60d9010

The code is doing the optimization: `((a | c1) << c2)` ==> `(a << c2) + (c1 << c2)` But this is only valid if `a` and `c1` have no common bits being set. Differential Revision: https://reviews.llvm.org/D150246

[mlir] Add timings to mlir translate.

c3b4e27

The revision adds basic timing to the mlir-translate tool. Reviewed By: Dinistro Differential Revision: https://reviews.llvm.org/D150434

[AsmPrinter] Use EntryValue object info to emit Dwarf

ee75422

This patch consumes the EntryValueObjects in a MachineFunction's table, using them to emit the appropriate debug information for these variables. Depends on D149880 Differential Revision: https://reviews.llvm.org/D149881

[mlir][transform] TrackingListener: Allow existing ops as replacements

7d436d5

The TrackingListener was unnecessarily strict. Existing ops are now allowed when updating payload ops mappings due to `replaceOp` in the TrackingListener. Differential Revision: https://reviews.llvm.org/D150429

Revert "[X86][AsmParser] Refactor code in AsmParser"

f4865c7

This reverts commit 8d657c4. Reverts it due to the regression reported in D150068.

[ShrinkWrap] Conservatively treat MIs without memory operands.

d0718ff

As pointed out by @jpenix-quic in D149668 post-commit, machine instructions without memory operands need to be treated conservatively.

Precommit test for D149873

2c52a18

Change-Id: I608f14ac3a504cc668f93f130a17dea3950fa554

[X86] narrowShuffle - only narrow from legal vector types

c06a61f

Fixes #62653

[clang] Restores some -std=c++2b tests.

fd55636

These tests should have added -std=c++23 instead of replacing -std=c++2b in D149553. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D150063

AMDGPU: Force sc0 and sc1 on stores for gfx940 and gfx941

42bd814

Differential Revision: https://reviews.llvm.org/D149986

[mlir][sparse] minor reorg of sparse tensor tablegen defs

ea7ee9d

Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D150414

[RISCV][llvm-mca] Add mca tests for riscv lmul instruments

1dedc96

Add llvm-mca tests for RISCV LMUL instruments to show that llvm-mca RISCV LMUL instruments work. Differential Revision: https://reviews.llvm.org/D149496

Revert "[RISCV][llvm-mca] Add mca tests for riscv lmul instruments"

ad8765a

This commit passed buildable tests in phabricator, but fails once committed. This reverts commit 1dedc96.

[mlir][Linalg] NFC - Retire dead FusionOnTensors.cpp

26b5b06

[mlir][Linalg] NFC - Retire dead tilePadOp

0047b17

sys-ce-bb and others added 5 commits May 16, 2023 11:48

Merge remote-tracking branch 'origin/sycl-web' into llvmspirv_pulldown

5ce8979

Merge commit '205175578e0d73b4cd63d4d124a900fff10da7f8' into llvmspir…

fcdebe6

…v_pulldown

Add nonsemantic-shader-100/200 to X86 tests (#2005)

40c99ea

Currently only to DebugInfo/X86 Currently failing tests can be noticed by RUNx line Signed-off-by: Sidorov, Dmitry <[email protected]> Original commit: KhronosGroup/SPIRV-LLVM-Translator@772c7be

jsji added the disable-lint Skip linter check step and proceed with build jobs label May 16, 2023

againull closed this May 17, 2023

againull reopened this May 17, 2023

againull temporarily deployed to aws May 17, 2023 17:39 — with GitHub Actions Inactive

jsji temporarily deployed to aws May 18, 2023 16:30 — with GitHub Actions Inactive

jsji temporarily deployed to aws May 18, 2023 17:21 — with GitHub Actions Inactive

againull marked this pull request as ready for review May 18, 2023 20:48

againull requested review from a team and bader as code owners May 18, 2023 20:48

againull requested a review from jchlanda May 18, 2023 20:48

bb-sycl approved these changes May 18, 2023

View reviewed changes

bb-sycl merged commit dbadecb into sycl May 18, 2023

kchusha deleted the llvmspirv_pulldown branch May 19, 2023 12:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW20) #9486

LLVM and SPIRV-LLVM-Translator pulldown (WW20) #9486

Uh oh!

jsji commented May 16, 2023

Uh oh!

kchusha commented May 17, 2023

Uh oh!

sarnex commented May 18, 2023

Uh oh!

againull commented May 18, 2023

Uh oh!

againull commented May 18, 2023

Uh oh!

bb-sycl commented May 18, 2023

Uh oh!

bb-sycl commented May 18, 2023

Uh oh!

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW20) #9486

LLVM and SPIRV-LLVM-Translator pulldown (WW20) #9486

Uh oh!

Conversation

jsji commented May 16, 2023

Uh oh!

kchusha commented May 17, 2023

Uh oh!

sarnex commented May 18, 2023

Uh oh!

againull commented May 18, 2023

Uh oh!

againull commented May 18, 2023

Uh oh!

bb-sycl commented May 18, 2023

Uh oh!

bb-sycl commented May 18, 2023

Uh oh!

Uh oh!