LLVM and SPIRV-LLVM-Translator pulldown (WW46) #4913

vmaksimo · 2021-11-08T07:45:01Z

LLVM: llvm/llvm-project@0fa45d6d
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@ebd464d

…rvals" This reverts commit ec0e1e8. It was pushed by mistake.

Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D113236

…ze object. NFCI This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in lib/External/isl/include/isl/isl-noxceptions.h and the official isl C++ interface. In the official interface the type `isl::size` cannot be casted to an unsigned without previously having checked if it contains a valid value with the function `isl::size::is_error()`. For this reason two helping functions have been added: - `IslAssert`: assert that no errors are present in debug builds and just disables the mandatory error check in non-debug builds - `unisgnedFromIslSIze`: cast the `isl::size` object to `unsigned` Changes made: - Add the functions `IslAssert` and `unsignedFromIslSize` - Add the utility function `rangeIslSize()` - Retype `MaxDisjunctsInDomain` from `int` to `unsigned` - Retype `RunTimeChecksMaxAccessDisjuncts` from `int` to `unsigned` - Retype `MaxDimensionsInAccessRange` from `int` to `unsigned` - Replaced some usages of `isl_size` to `unsigned` since we aim not to use `isl_size` anymore - `isl-noexceptions.h` has been generated by patacca/isl@e704f73 No functional change intended. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113101

In preparation for D112453.

This patch fleshes out the missing documentation for two of the VP intrinsics introduced in D99355: `llvm.vp.load` and `llvm.vp.store`. It does so mostly by deferring to the `llvm.masked.load` and `llvm.masked.store` intrinsics, respectively. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D112930

Fixes one of the regressions in D113192

…esence of segment attribute The ODS-based Python op bindings generator has been generating incorrect specification of the operand segment in presence if both optional and variadic operand groups: optional groups were treated as variadic whereas they require separate treatement. Make sure it is the case. Also harden the tests around generated op constructors as they could hitherto accept the code for both optional and variadic arguments. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D113259

This patch fleshes out the missing documentation for the final two VP intrinsics introduced in D99355: `llvm.vp.gather` and `llvm.vp.scatter`. It does so mostly by deferring to the `llvm.masked.gather` and `llvm.masked.scatter` intrinsics, respectively. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D112997

…tterns The `fir.select` and `fir.select_rank` are lowered to llvm.switch. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D113089 Co-authored-by: Jean Perier <[email protected]> Co-authored-by: Eric Schweitz <[email protected]>

…gn bits. NFC

…> bitcast(bitlogic(x,y)) To constant fold bitwise logic ops where we've legalized constant build vectors to a different type (e.g. v2i64 -> v4i32), this patch adds a basic ability to peek through the bitcasts and perform the constant fold on the inner operands. The MVE predicate v2i64 regressions will be addressed by future support for basic v2i64 type support. One of the yak shaving fixes for D113192.... Differential Revision: https://reviews.llvm.org/D113202

Differential Revision: https://reviews.llvm.org/D113172

This symbol is defined in libc.so so it is definitely not DSO-Local. Marking it as such causes problems on some platforms (such as PowerPC). Differential revision: https://reviews.llvm.org/D109090

A pattern has selected wrong uaddlv MI. It should be as below. uaddv(uaddlp(v8i8)) ==> uaddlv(v8i8) Differential Revision: https://reviews.llvm.org/D113263

NumOps represents the number of elements for vector constant folding, rename this NumElts so in future we can the consistently use NumOps to represent the number of operands of the opcode. Minor cleanup before trying to begin generalizing FoldConstantArithmetic to support opcodes other than binops.

[NFC] This patch fixes URLs containing "master". Old URLs were either broken or redirecting to the new URL. Reviewed By: #libc, ldionne, mehdi_amini Differential Revision: https://reviews.llvm.org/D113186

When created a UUNPKLO/HI node with an undef input then the output should also be undef. I've added a target DAG combine function to ensure we avoid creating an unnecessary uunpklo/hi instruction. Differential Revision: https://reviews.llvm.org/D113266

Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D113213

Avid readers of this saga may recall from previous installments, that replication mask replicates (lol) each of the `VF` elements in a vector `ReplicationFactor` times. For example, the mask for `ReplicationFactor=3` and `VF=4` is: `<0,0,0,1,1,1,2,2,2,3,3,3>`. More importantly, replication mask is used by LoopVectorizer when using masked interleaved memory operations. As discussed in previous installments, while it is used by LV, and we **seem** to support masked interleaved memory operations on X86, it's support in cost model leaves a lot to be desired: until basically yesterday even for AVX512 we had no cost model for it. As it has been witnessed in the recent AVX2 `X86TTIImpl::getInterleavedMemoryOpCost()` costmodel patches, while it is hard-enough to query the cost of a particular assembly sequence [from llvm-mca], afterwards the check lines LV costmodel tests must be updated manually. This is, at the very least, boring. Okay, now we have decent costmodel coverage for interleaving shuffles, but now basically the same mind-killing sequence has to be performed for replication mask. I think we can improve at least the second half of the problem, by teaching the `TargetTransformInfoImplCRTPBase::getUserCost()` to recognize `Instruction::ShuffleVector` that are repetition masks, adding exhaustive test coverage using `-cost-model -analyze` + `utils/update_analyze_test_checks.py` This way we can have good exhaustive coverage for cost model, and only basic coverage for the LV costmodel. This patch adds precise undef-aware `isReplicationMask()`, with exhaustive test coverage. * `InstructionsTest.ShuffleMaskIsReplicationMask` shows that it correctly detects all the known masks. * `InstructionsTest.ShuffleMaskIsReplicationMask_undef` shows that replacing some mask elements in a known replication mask still allows us to recognize it as a replication mask. Note, with enough undef elts, we may detect a different tuple. * `InstructionsTest.ShuffleMaskIsReplicationMask_Exhaustive_Correctness` shows that if we detected the replication mask with given params, then if we actually generate a true replication mask with said params, it matches element-wise ignoring undef mask elements. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D113214

(X s< 0) ? Y : 0 --> (X s>> BW-1) & Y We canonicalize to the icmp+select form in IR, and we already have this fold for scalar select in SDAG, so I think it's an oversight that we don't have the fold for vectors. It seems neutral for AArch64 and saves some instructions on x86. Whether we should also have the sibling folds for the inverse condition or all-ones true value may depend on target-specific factors such as whether there's an "and-not" instruction. Differential Revision: https://reviews.llvm.org/D113212

…t i1 Even though AVX512's masked mem ops (unlike AVX1/2) have a mask that is a `VF x i1`, replication of said masks happens after promotion of it to `VF x i8`, so we should use `i8`, not `i1`, when calculating the cost of mask replication.

Another minor step towards merging FoldConstantVectorArithmetic into FoldConstantArithmetic. We don't use SDNodeFlags in any constant folding inside DAG, so passing the Flags argument is a waste of time - an alternative would be to wire up FoldConstantArithmetic to take SDNodeFlags just-in-case we someday start using it, but we don't have any way to test it and I'd prefer to avoid dead code. Differential Revision: https://reviews.llvm.org/D113276

This introduces a new ComputeMinSignedBits method for ValueTracking that returns the BitWidth - SignBits + 1 from ComputeSignBits, and represents the minimum bit size for the value as a signed integer. Similar to the existing APInt::getMinSignedBits method, this can make some of the reasoning around ComputeSignBits more natural. See https://reviews.llvm.org/D112298

Previously, jumps to labels in constructs from exterior statements would elicit only a warning. Upgrade these to errors unless the branch into the construct would enter into only DO, IF, and SELECT CASE constructs, whose interiors don't scope variables or have other set-up/tear-down semantics. Branches into these "safe" constructs are still errors if they're nested in an unsafe construct that doesn't also enclose the exterior branch statement. Differential Revision: https://reviews.llvm.org/D113310

Component::CreatePointerDescriptor unconditionally expects a vector of subscripts to be passed as an argument, but is called from NAMELIST input with a null pointer. Make that argument a nullable pointer, move it to the end of the argument list, and give it a default value of nullptr. Differential Revision: https://reviews.llvm.org/D113312

CONFLICT (content): Merge conflict in llvm/lib/Support/Triple.cpp CONFLICT (content): Merge conflict in clang/lib/CodeGen/TargetInfo.cpp CONFLICT (content): Merge conflict in clang/lib/Basic/Targets/SPIR.h CONFLICT (content): Merge conflict in clang/lib/Basic/Targets.cpp

InstCombine converts range tests of the form (X > C1 && X < C2) or (X < C1 || X > C2) into checks of the form (X + C3 < C4) or (X + C3 > C4). It is possible to express all range tests in either of these forms (with different choices of constants), but currently neither of them is considered canonical. We may have equivalent range tests using either ult or ugt. This proposes to canonicalize all range tests to use ult. An alternative would be to canonicalize to either ult or ugt depending on the specific constants involved -- e.g. in practice we currently generate ult for && style ranges and ugt for || style ranges when going through the insertRangeTest() helper. In fact, the "clamp like" fold was relying on this, which is why I had to tweak it to not assume whether inversion is needed based on just the predicate. Proof: https://alive2.llvm.org/ce/z/_SP_rQ Differential Revision: https://reviews.llvm.org/D113366

This is a potential follow-up suggested in D113212.

This fixes the following clang VFS tests, if `windows_slash` is the default style: Clang :: VFS/implicit-include.c Clang :: VFS/relative-path.c Clang-Unit :: Frontend/./FrontendTests.exe/CompilerInstance.DefaultVFSOverlayFromInvocation Also clarify a couple references to `Style::windows` into `Style::windows_backslash`, to make it clearer that each of them are opinionated in different directions (even if it doesn't matter for calls to e.g. `is_absolute`). Differential Revision: https://reviews.llvm.org/D113272

…yHandleW This is needed for the paths to work when using forward slashes; this fixes the DirectoryWatcherTests unit tests. Also allocate missing space for the null terminator, which seems to have been missing all along (writing the terminator out of bounds). Differential Revision: https://reviews.llvm.org/D113264

Also remove a comment that seems to be left behind. Differential Revision: https://reviews.llvm.org/D113269

This reverts commit c499d69.

This reverts commit db81d8f.

CONFLICT (content): Merge conflict in clang/test/CodeGenSYCL/unique_stable_name.cpp CONFLICT (content): Merge conflict in clang/test/CodeGenSYCL/address-space-mangling.cpp

CONFLICT (content): Merge conflict in clang/lib/CodeGen/CGGPUBuiltin.cpp

…bled noundef analysis by default has been reverted. Once re-committed in the community, this patch must be reverted to restore noundef in LIT tests and fix failures.

…pulldown

vmaksimo · 2021-11-09T16:39:18Z

/summary:run

llvm-project commit (https://reviews.llvm.org/D98895) resulted in an additional diagnostic being generated at template definition as well. Deferred diagnostics should be cleaned up to remove superfluous diagnostics. This will be done in a follow up patch.

vmaksimo · 2021-11-11T18:21:59Z

/summary:run

Original commit: KhronosGroup/SPIRV-LLVM-Translator@a9a5c91

vmaksimo · 2021-11-18T14:23:23Z

/summary:run

vmaksimo · 2021-11-22T09:12:03Z

/summary:run

vmaksimo · 2021-11-24T16:50:08Z

Merged as a part of #4993

jayfoad and others added 30 commits November 5, 2021 09:10

[AMDGPU] NFC formatting fixes in SIMemoryLegalizer

c93bf53

Revert "[TwoAddressInstructionPass] Update existing physreg live inte…

0321bd6

…rvals" This reverts commit ec0e1e8. It was pushed by mistake.

[PowerPC] use correct selection for v16i8/v8i16 splat load

fed2889

Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D113236

[Sema][NFC] Add tests for builtin spaceship operator.

737f540

In preparation for D112453.

[ARM] Extra MVE constant select test. NFC

cb62c37

[X86] Enable v32i16 rotate lowering on non-BWI targets

5e9ac7c

Fixes one of the regressions in D113192

[InstCombine] Add additional tests for converting to sadd.sat with si…

cd8cb53

…gn bits. NFC

Enable -Wformat-pedantic and fix fallout.

13a442c

Differential Revision: https://reviews.llvm.org/D113172

[FreeBSD] Do not mark __stack_chk_guard as dso_local

1cb9f37

This symbol is defined in libc.so so it is definitely not DSO-Local. Marking it as such causes problems on some platforms (such as PowerPC). Differential revision: https://reviews.llvm.org/D109090

[AArch64] Fix a bug from a pattern for uaddv(uaddlp(x)) ==> uaddlv

a7b1872

A pattern has selected wrong uaddlv MI. It should be as below. uaddv(uaddlp(v8i8)) ==> uaddlv(v8i8) Differential Revision: https://reviews.llvm.org/D113263

[gn build] (manually) port df0ba47

a160aba

[NFC] Inclusive language: Remove instances of master in URLs

c71fbdd

[NFC] This patch fixes URLs containing "master". Old URLs were either broken or redirecting to the new URL. Reviewed By: #libc, ldionne, mehdi_amini Differential Revision: https://reviews.llvm.org/D113186

[NFC] Move CombinationGenerator from Exegesis to ADT

7a98761

Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D113213

[gn build] Port 7a98761

6cd309b

[InstCombine] add signbit tests for icmp with trunc; NFC

05f64b5

[x86] add tests for vector select; NFC

8918814

[AArch64] add tests for vector select; NFC

1e7afa2

ldionne and others added 17 commits November 8, 2021 14:34

[libc++] Trigger a rebuild of the CI Docker images

1837a83

Merge remote-tracking branch 'intel_llvm/sycl' into llvmspirv_pulldown

ef6d352

[AArch][x86] add tests for vselect; NFC

e2b1d32

This is a potential follow-up suggested in D113212.

[clang-move] Fix unit tests with forward slash as separator on windows

98f0bf7

Also remove a comment that seems to be left behind. Differential Revision: https://reviews.llvm.org/D113269

Revert "[openmp] Fix build, test passes on CI unexpectedly"

dc9edc6

This reverts commit c499d69.

Revert "[OpenMP] Lower printf to __llvm_omp_vprintf"

0fa45d6

This reverts commit db81d8f.

Merge from 'main' to 'sycl-web' (1 commits)

bae63e8

Merge from '"main"' to '"sycl-web"' (14 commits)

b4dd18c

CONFLICT (content): Merge conflict in clang/test/CodeGenSYCL/unique_stable_name.cpp CONFLICT (content): Merge conflict in clang/test/CodeGenSYCL/address-space-mangling.cpp

Merge from '"main"' to '"sycl-web"' (12 commits)

4bb9454

CONFLICT (content): Merge conflict in clang/lib/CodeGen/CGGPUBuiltin.cpp

Merge from '"main"' to '"sycl-web"' (17 commits)

61ba0f7

CONFLICT (content): Merge conflict in clang/lib/CodeGen/CGGPUBuiltin.cpp

Revert noundef additions in LIT tests after community patch which ena…

254ab74

…bled noundef analysis by default has been reverted. Once re-committed in the community, this patch must be reverted to restore noundef in LIT tests and fix failures.

vmaksimo added the disable-lint Skip linter check step and proceed with build jobs label Nov 9, 2021

vmaksimo added 2 commits November 9, 2021 12:06

Merge remote-tracking branch 'otcshare_llvm/sycl-web' into llvmspirv_…

1e4329a

…pulldown

Remove unresolved conflict in libclc CMake

9b9f7fd

vmaksimo added 3 commits November 18, 2021 17:13

Fix compilation warning

86e1e62

Support DW_OP_LLVM_arg DWARF operation

072f57d

Original commit: KhronosGroup/SPIRV-LLVM-Translator@a9a5c91

Merge remote-tracking branch 'intel_llvm/sycl' into llvmspirv_pulldown

e683d37

vmaksimo closed this Nov 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW46) #4913

LLVM and SPIRV-LLVM-Translator pulldown (WW46) #4913

Uh oh!

vmaksimo commented Nov 8, 2021 •

edited

Loading

Uh oh!

vmaksimo commented Nov 9, 2021

Uh oh!

vmaksimo commented Nov 11, 2021

Uh oh!

vmaksimo commented Nov 18, 2021

Uh oh!

vmaksimo commented Nov 22, 2021

Uh oh!

vmaksimo commented Nov 24, 2021

Uh oh!

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW46) #4913

LLVM and SPIRV-LLVM-Translator pulldown (WW46) #4913

Uh oh!

Conversation

vmaksimo commented Nov 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vmaksimo commented Nov 9, 2021

Uh oh!

vmaksimo commented Nov 11, 2021

Uh oh!

vmaksimo commented Nov 18, 2021

Uh oh!

vmaksimo commented Nov 22, 2021

Uh oh!

vmaksimo commented Nov 24, 2021

Uh oh!

Uh oh!

vmaksimo commented Nov 8, 2021 •

edited

Loading