LLVM and SPIRV-LLVM-Translator pulldown (WW26) #3961

vmaksimo · 2021-06-21T10:40:07Z

LLVM: llvm/llvm-project@342bbb7
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@8679b96

…m-readobj. NFC. The --coff-exports option to llvm-readobj prints the exported symbols from a DLL/EXE, it doesn't do anything with regards to an import library. Differential Revision: https://reviews.llvm.org/D104214

The existing tests only test that some options (but not e.g. arm) are accepted, but it doesn't test their functional effect of affecting the generated object files. Differential Revision: https://reviews.llvm.org/D104215

Also use the default LLVM target as default for dlltool. This matches how GNU dlltool behaves; it is compiled with one default target, which is used if no option is provided. Extend the anonymous namespace in the implementation file instead of using static functions. Based on a patch by Mateusz Mikuła. The effect of the default LLVM target, if neither the -m option nor a tool triple prefix is provided, isn't tested, as we can't make assumptions about what it is set to. (We could make the default be forced to one of the four supported architectures if the default triple is another arch, and then just test that llvm-dlltool without an -m option is able to produce an import library, without checking the actual architecture though.) Differential Revision: https://reviews.llvm.org/D104212

The following class isn't part of the export table; there's a second correctly placed comment about the things that actually belong to the export table.

After D77330, the comments are inconsistent with the disassembled code. As the value of `far` has been changed, a thunk to reach it is now generated, and target addresses of branch instructions are different from what was initially expected. The patch fixes that and makes the test closer to what it was originally. Differential Revision: https://reviews.llvm.org/D104286

Do not use ultimate symbols in DescriptorInquiry. Using the ultimate symbol may lead to issues later for at least two reasons: - The original symbols may have volatile/asynchronous attributes that the ultimate may not have. Later phases working on the DescriptorInquiry would then not apply potential care required by these attributes. - HostAssociatedDetails symbols are used by OpenMP for symbols with special OpenMP attributes inside OpenMP region (e.g variables with private attribute), so it is very important to preserve this aspect in the DescriptorInquiry, that would otherwise apply on the symbol outside of the region. Differential Revision: https://reviews.llvm.org/D104385

As noted in PR45210: https://bugs.llvm.org/show_bug.cgi?id=45210 ...the bug is triggered as Eli say when sext(idx) * ElementSize overflows. ``` // assume that GV is an array of 4-byte elements GEP = gep GV, 0, Idx // this is accessing Idx * 4 L = load GEP ICI = icmp eq L, value => ICI = icmp eq Idx, NewIdx ``` The foldCmpLoadFromIndexedGlobal function simplifies GEP+load operation to icmp. And there is a problem because Idx * ElementSize can overflow. Let's assume that the wanted value is at offset 0. Then, there are actually four possible values for Idx to match offset 0: 0x00..00, 0x40..00, 0x80..00, 0xC0..00. We should return true for all these values, but currently, the new icmp only returns true for 0x00..00. This problem can be solved by masking off (trailing zeros of ElementSize) bits from Idx. ``` ... => Idx' = and Idx, 0x3F..FF ICI = icmp eq Idx', NewIdx ``` Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D99481

Short granule tags as poison cause a UaF to read the referenced memory to retrieve the tag, and means we do not detect the UaF if the last granule's tag is still around. This only increases the change of not catching a UaF from 0.39 % (1 / 256) to 0.42 % (1 / (256 - 17)). Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D104304

Before: ADDR is located -320 bytes to the right of 1072-byte region After: ADDR is located 752 bytes inside 1072-byte region Reviewed By: eugenis, walli99 Differential Revision: https://reviews.llvm.org/D104412

…aces This functionality is similar to delayed registration of dialect interfaces. It allows external interface models to be registered before the dialect containing the attribute/operation/type interface is loaded, or even before the context is created. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D104397

The idea is now that AppendError<...> will set eReturnStatusFailed for you so you don't have to call SetStatus again. Previously if the error message was empty, the status wouldn't be set. I don't think there are any sitautions where the message is in fact empty but it potentially could be depending on where we get the string from. So let's set the status up front then return early if the message is empty. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D104380

Since https://reviews.llvm.org/D103701 AppendError<...> sets this for you. This change includes all of the non-command uses. Some uses remain where it's either tricky to reason about the logic, or they aren't paired with AppendError calls. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D104379

This patch adds a test case showing how a single extra .loc can cause binary differences when using -x86-pad-for-align=true. The issue has been discussed in D94542, PR42138, PR48742.

We now generate as many benchmarks as there are implementations. Differential Revision: https://reviews.llvm.org/D102156

Make sure llvm-mc is invariant with respect to debug locations in the test (checks update to use the -x86-pad-for-align default value)

Differential Revision: https://reviews.llvm.org/D104449

…er to add additional select(setcc,x,y) folds. NFCI. I need to add some additional handling to address some of the regressions from D101074

LLVM_DEBUG in headers is awkward, better avoid it. DEBUG_TYPE in a header results in a lot of macro redefinition warnings.

…strs. NFC.

This is part 2, covering the commands source. Some uses remain where it's tricky to see what the logic is or they are not used with AppendError. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D104448

…a value This patch fixes an issue where builds of programs with multiple dbg.values with DIArgList locations could have non-deterministic output. This issue was caused by ReplaceableMetadataImpl::getAllArgListUsers, which returned DIArgList pointers in a random order; the output of this function would later be used to insert dbg.values, causing the order of insertion to be non-deterministic. This patch changes getAllArgListUsers to return pointers in a fixed order. Differential Revision: https://reviews.llvm.org/D104105

Fixed crash when doing pointer math on a void pointer. Also, reworked test to use -verify rather than FileCheck. Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D104424

…sers of a value" Commit caused build errors on buildbots with [-Werror,-Wreturn-std-move] enabled. This reverts commit fa1de88.

In D103169 I'm adding to InstSimplify support for NaN to constrained intrinsics that have a regular FP IR instruction counterpart. Precommit the tests for clarity when that ticket lands.

Differential Revision: https://reviews.llvm.org/D104455

We need to dedup archive loads (similar to what we do for dylib loads). I noticed this issue after building some Swift stuff that used `-force_load_swift_libs`, as it caused some Swift archives to be loaded many times. Reviewed By: #lld-macho, thakis, MaskRay Differential Revision: https://reviews.llvm.org/D104353

Summary: This patch, as a follow-up of D95505, adds support for writing the long symbol name by implementing the StringTable. Only XCOFF32 is suppoted now. Reviewed By: jhenderson, shchenz Differential Revision: https://reviews.llvm.org/D103455

Differential Revision: https://reviews.llvm.org/D103789

…and UB

There does not seem to be any use of these functions. They just put the value to a local which is never used again. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D104512

The main motivation behind pointer replacement of LDS use within non-kernel functions is - to *avoid* subsequent LDS lowering pass from directly packing LDS (assume large LDS) into a struct type which would otherwise cause allocating huge memory for struct instance within every kernel. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D103225

This revision adds a BufferizationAliasInfo which maintains and updates information about which tensors will alias once bufferized, which bufferized tensors are equivalent to others and how to handle clobbers. Bufferization greedily tries to bufferize inplace by: 1. first trying to bufferize SubTensorInsertOp inplace, in reverse order (these are deemed the most expensives). 2. then trying to bufferize all non SubTensorOp / SubTensorInsertOp, in reverse order. 3. lastly trying to bufferize all SubTensorOp in reverse order. Reverse order is a heuristic that seems to work nicely because structured tensor codegen very often proceeds by: 1. take a subset of a tensor 2. compute on that subset 3. insert the result subset into the full tensor and yield a new tensor. BufferizationAliasInfo + equivalence sets + clobber analysis allows bufferizing nested subtensor/compute/subtensor_insert sequences inplace to a certain extent. To fully realize inplace bufferization, additional container-containee analysis will be necessary and is left for a subsequent commit. Differential revision: https://reviews.llvm.org/D104110

This pass aims to optimize VGPR live-range in a typical divergent if-else control flow. For example: def(a) if(cond) use(a) ... // A else use(a) As AMDGPU access vgpr with respect to active-mask, we can mark `a` as dead in region A. For details, please refer to the comments in implementation file. The pass is enabled by default, the frontend can disable it through "-amdgpu-opt-vgpr-liverange=false". Differential Revision: https://reviews.llvm.org/D102212

Differential Revision: https://reviews.llvm.org/D104591

getSpecializationCost was returning INT_MAX for a case when specialisation shouldn't happen, but this wasn't properly checked if specialisation was forced. Differential Revision: https://reviews.llvm.org/D104461

…pulldown

…v_pulldown

Modify the `SPIRVShuffleVector` constructor to allow passing a nullptr basic block (as is the case for variable initializers). Modify the `SPIRVShuffleVector` constructor to take `SPIRVId`s instead of `SPIRVValue`s, which better reflects what is stored in the class, and saves us an unnecessary ID-to-Value-to-ID round trip in `createInstFromSpecConstantOp`. Original commit: KhronosGroup/SPIRV-LLVM-Translator@72f99e3

Modify the constructors to allow passing a nullptr basic block (as is the case for variable initializers). Modify the constructors to take `SPIRVId`s instead of `SPIRVValue`s, which better reflects what is stored in the class, and saves us an unnecessary ID-to-Value-to-ID round trip in `createInstFromSpecConstantOp`. Modify test/opundef.spt to return the constructed value, so that it does not get optimized out. Original commit: KhronosGroup/SPIRV-LLVM-Translator@9bc1d5a

When some constant expression is used as operand of regular instruction and operand of other constant expression, the current algorithm of lowering could produce incorrect order of instructions because it is not possible to add instruction operand to a constant expression. Consider following pseudo code example: ``` call foo(constexpr2(constexpr1), constexpr1) // After first loop iteration through operands of the call instruction: A = constexpr2op(constexpr1) call foo(A, constexpr1) // Ok, instruction A is now a user of constexpr1, so, when second // operand is processed, all uses of constexpr1 are updated, but the // instruction that represents constexpr1 is inserted before call // instruction because it is now being processed, so it will look like // this: A = constexpr2(B) B = constexpr1 call foo(A, B) // So, instruction B needs to be moved after all its users to get a // valid module: B = constexpr1 A = constexpr2(B) call foo(A, B) ``` Original commit: KhronosGroup/SPIRV-LLVM-Translator@390aba9

vmaksimo · 2021-06-21T11:06:29Z

/summary:run

mstorsjo and others added 30 commits June 17, 2021 13:02

[LLD] [COFF] Remove a stray duplicate comment. NFC.

ceee35e

The following class isn't part of the export table; there's a second correctly placed comment about the things that actually belong to the export table.

[NFC] test commit, fix namespace ending comment.

b18f30f

[hwasan] Improve report for addresses within regions.

ccc0f77

Before: ADDR is located -320 bytes to the right of 1072-byte region After: ADDR is located 752 bytes inside 1072-byte region Reviewed By: eugenis, walli99 Differential Revision: https://reviews.llvm.org/D104412

[mlir] define a customized DEBUG_TYPE in InterfaceSupport.h

6b63381

[X86] Add test showing binary differences with -x86-pad-for-align.

0bd5bbb

This patch adds a test case showing how a single extra .loc can cause binary differences when using -x86-pad-for-align=true. The issue has been discussed in D94542, PR42138, PR48742.

[libc] Generate one benchmark per implementation

8d64ed8

We now generate as many benchmarks as there are implementations. Differential Revision: https://reviews.llvm.org/D102156

[X86] Check using default in test added in 0bd5bbb.

aa6e8e9

Make sure llvm-mc is invariant with respect to debug locations in the test (checks update to use the -x86-pad-for-align default value)

[mlir][linalg] Purge linalg.indexed_generic.

5b3cb31

Differential Revision: https://reviews.llvm.org/D104449

[X86] combineSelect - refactor MIN/MAX detection code to make it easi…

cdb4fcf

…er to add additional select(setcc,x,y) folds. NFCI. I need to add some additional handling to address some of the regressions from D101074

[mlir] Split things dependent on LLVM_DEBUG into a .cpp file

c878d03

LLVM_DEBUG in headers is awkward, better avoid it. DEBUG_TYPE in a header results in a lot of macro redefinition warnings.

[FuncSpec] Precommit test: don't specialise funcs with NoDuplicate in…

3f59684

…strs. NFC.

[lldb] Remove redundant calls to set eReturnStatusFailed

eaf60a4

This is part 2, covering the commands source. Some uses remain where it's tricky to see what the logic is or they are not used with AppendError. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D104448

[mlir] Remove linalg.indexed_generic forward decl.

7cddf56

[Sema] Fix for PR50741

fc6ec9b

Fixed crash when doing pointer math on a void pointer. Also, reworked test to use -verify rather than FileCheck. Reviewed By: erichkeane Differential Revision: https://reviews.llvm.org/D104424

Revert "[DebugInfo] Prevent non-determinism when updating DIArgList u…

e8991ca

…sers of a value" Commit caused build errors on buildbots with [-Werror,-Wreturn-std-move] enabled. This reverts commit fa1de88.

[llvm] fix typo in comment

26f1f6d

[FPEnv][InstSimplify] Precommit tests for D103169.

60a8edf

In D103169 I'm adding to InstSimplify support for NaN to constrained intrinsics that have a regular FP IR instruction counterpart. Precommit the tests for clarity when that ticket lands.

[clangd] Explicitly fail if the file passed to --check is not valid.

6765b9c

Differential Revision: https://reviews.llvm.org/D104455

Esme-Yi and others added 17 commits June 21, 2021 05:09

[clangd] Type hints for C++14 return type deduction

e37653d

Differential Revision: https://reviews.llvm.org/D103789

[Test] Add some tests showing room for optimization exploiting undef …

3f2ff7c

…and UB

[AMDGPU][Libomptarget] Remove redundant functions

7a97cd9

There does not seem to be any use of these functions. They just put the value to a local which is never used again. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D104512

[gn build] Port 80fd5fa

b746a8d

[mlir][Linalg] NFC - Drop unused variable definition.

11e9a72

[gn build] Port 208332d

808ac8d

[mlir][linalg] Support low padding in subtensor(pad_tensor) lowering

225b960

Differential Revision: https://reviews.llvm.org/D104591

[FuncSpec] Don't specialise functions with NoDuplicate instructions.

342bbb7

getSpecializationCost was returning INT_MAX for a case when specialisation shouldn't happen, but this wasn't properly checked if specialisation was forced. Differential Revision: https://reviews.llvm.org/D104461

Merge remote-tracking branch 'otcshare_llvm/sycl-web' into llvmspirv_…

7540e25

…pulldown

Merge commit '342bbb7832b69cc2adba9acaac0ed2b9bffbe896' into llvmspir…

39d0d53

…v_pulldown

vmaksimo marked this pull request as ready for review June 22, 2021 10:13

vmaksimo requested review from AaronBallman, AGindinson, AlexeySachkov, AlexeySotkin, bader, DenisBakhvalov, elizabethandrews, kbobrovs, mdtoguchi and premanandrao as code owners June 22, 2021 10:13

vladimirlaz merged commit 0f7d1a6 into intel:sycl Jun 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW26) #3961

LLVM and SPIRV-LLVM-Translator pulldown (WW26) #3961

Uh oh!

vmaksimo commented Jun 21, 2021

Uh oh!

vmaksimo commented Jun 21, 2021

Uh oh!

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW26) #3961

LLVM and SPIRV-LLVM-Translator pulldown (WW26) #3961

Uh oh!

Conversation

vmaksimo commented Jun 21, 2021

Uh oh!

vmaksimo commented Jun 21, 2021

Uh oh!

Uh oh!