[Docs][RISCV] Document RISC-V vector codegen #96740

lukel97 · 2024-06-26T08:01:03Z

This is a revival of https://reviews.llvm.org/D142348, and attempts to document how RVV semantics can be expressed in LLVM IR as well as how codegen works in the backend.

Parts of this are taken from the original RFC https://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html, but I've largely rewritten this from the original differential revision to exclude explaining the specification itself and instead just focus on the LLVM specific bits. (I figured that there's better material available elsewhere for learning about RVV itself)

I've also updated it to include as much as I know about fixed vector codegen as well as the recent changes to vsetvli insertion. Let me know if I'm missing anything else that would be useful to document.

This is a revival of https://reviews.llvm.org/D142348, and attempts to document how RVV semantics can be expressed in LLVM IR as well as how codegen works in the backend. Parts of this are taken from the original RFC https://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html, but I've largely rewritten this from the original differential revision to exclude explaining the specification itself and instead just focus on the LLVM specific bits. (I figured that there's better material available elsewhere for learning about RVV itself) I've also updated it to include as much as I know about fixed vector codegen as well as the recent changes to vsetvli insertion. Let me know if I'm missing anything else that would be useful to document.

llvmbot · 2024-06-26T08:01:42Z

@llvm/pr-subscribers-backend-risc-v

Author: Luke Lau (lukel97)

Changes

This is a revival of https://reviews.llvm.org/D142348, and attempts to document how RVV semantics can be expressed in LLVM IR as well as how codegen works in the backend.

Parts of this are taken from the original RFC https://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html, but I've largely rewritten this from the original differential revision to exclude explaining the specification itself and instead just focus on the LLVM specific bits. (I figured that there's better material available elsewhere for learning about RVV itself)

I've also updated it to include as much as I know about fixed vector codegen as well as the recent changes to vsetvli insertion. Let me know if I'm missing anything else that would be useful to document.

Full diff: https://github.com/llvm/llvm-project/pull/96740.diff

2 Files Affected:

(added) llvm/docs/RISCV/RISCVVectorExtension.rst (+285)
(modified) llvm/docs/UserGuides.rst (+4)

diff --git a/llvm/docs/RISCV/RISCVVectorExtension.rst b/llvm/docs/RISCV/RISCVVectorExtension.rst
new file mode 100644
index 0000000000000..41436f79dd44c
--- /dev/null
+++ b/llvm/docs/RISCV/RISCVVectorExtension.rst
@@ -0,0 +1,285 @@
+=========================
+ RISC-V Vector Extension
+=========================
+
+.. contents::
+   :local:
+
+The RISC-V target readily supports the 1.0 version of the `RISC-V Vector Extension (RVV) <https://github.com/riscv/riscv-v-spec/blob/v1.0/v-spec.adoc>`_, but requires some tricks to handle its unique design.
+This guide gives an overview of how RVV is modelled in LLVM IR and how the backend generates code for it.
+
+Mapping to LLVM IR types
+========================
+
+RVV adds 32 ``VLEN`` sized registers, where ``VLEN`` is an unknown constant to the compiler. To be able to represent ``VLEN`` sized values, the RISC-V backend takes the same approach as AArch64's SVE and uses `scalable vector types <https://lists.llvm.org/pipermail/llvm-dev/2018-July/124396.html>`_.
+
+Scalable vector types are of the form ``<vscale x n x ty>``, which indicate a vector with a multiple of ``n`` elements of type ``ty``. ``n`` and ``ty`` then end up controlling LMUL and SEW respectively.
+
+LLVM supports only ``ELEN=32`` or ``ELEN=64``, so ``vscale`` is defined as ``VLEN/64`` (see ``RISCV::RVVBitsPerBlock``).
+This makes the LLVM IR types stable between the two ``ELEN`` s considered, i.e. every LLVM IR scalable vector type has exactly one corresponding pair of element type and LMUL, and vice-versa.
+
++-------------------+---------------+----------------+------------------+-------------------+-------------------+-------------------+-------------------+
+|                   | LMUL=⅛        | LMUL=¼         | LMUL=½           | LMUL=1            | LMUL=2            | LMUL=4            | LMUL=8            |
++===================+===============+================+==================+===================+===================+===================+===================+
+| i64 (ELEN=64)     | N/A           | N/A            | N/A              | <v x 1 x i64>     | <v x 2 x i64>     | <v x 4 x i64>     | <v x 8 x i64>     |
++-------------------+---------------+----------------+------------------+-------------------+-------------------+-------------------+-------------------+
+| i32               | N/A           | N/A            | <v x 1 x i32>    | <v x 2 x i32>     | <v x 4 x i32>     | <v x 8 x i32>     | <v x 16 x i32>    |
++-------------------+---------------+----------------+------------------+-------------------+-------------------+-------------------+-------------------+
+| i16               | N/A           | <v x 1 x i16>  | <v x 2 x i16>    | <v x 4 x i16>     | <v x 8 x i16>     | <v x 16 x i16>    | <v x 32 x i16>    |
++-------------------+---------------+----------------+------------------+-------------------+-------------------+-------------------+-------------------+
+| i8                | <v x 1 x i8>  | <v x 2 x i8>   | <v x 4 x i8>     | <v x 8 x i8>      | <v x 16 x i8>     | <v x 32 x i8>     | <v x 64 x i8>     |
++-------------------+---------------+----------------+------------------+-------------------+-------------------+-------------------+-------------------+
+| double (ELEN=64)  | N/A           | N/A            | N/A              | <v x 1 x double>  | <v x 2 x double>  | <v x 4 x double>  | <v x 8 x double>  |
++-------------------+---------------+----------------+------------------+-------------------+-------------------+-------------------+-------------------+
+| float             | N/A           | N/A            | <v x 1 x float>  | <v x 2 x float>   | <v x 4 x float>   | <v x 8 x float>   | <v x 16 x float>  |
++-------------------+---------------+----------------+------------------+-------------------+-------------------+-------------------+-------------------+
+| half              | N/A           | <v x 1 x half> | <v x 2 x half>   | <v x 4 x half>    | <v x 8 x half>    | <v x 16 x half>   | <v x 32 x half>   |
++-------------------+---------------+----------------+------------------+-------------------+-------------------+-------------------+-------------------+
+
+(Read ``<v x k x ty>`` as ``<vscale x k x ty>``)
+
+
+Mask vector types
+-----------------
+
+As for mask vectors, they are physically represented using a layout of densely packed bits in a vector register.
+They are mapped to the following LLVM IR types:
+
+- <vscale x 1 x i1>
+- <vscale x 2 x i1>
+- <vscale x 4 x i1>
+- <vscale x 8 x i1>
+- <vscale x 16 x i1>
+- <vscale x 32 x i1>
+- <vscale x 64 x i1>
+
+Two types with the same ratio SEW/LMUL will have the same related mask type. For instance, two different comparisons one under SEW=64, LMUL=2 and the other under SEW=32, LMUL=1 will both generate a mask <vscale x 2 x i1>.
+
+Representation in LLVM IR
+=========================
+
+Vector instructions can be represented in three main ways in LLVM IR:
+
+1. Regular instructions on both fixed and scalable vector types
+
+   .. code-block:: llvm
+
+       %c = add <vscale x 4 x i32> %a, %b
+
+2. RISC-V vector intrinsics, which mirror the `C intrinsics specification <https://github.com/riscv-non-isa/rvv-intrinsic-doc>`_
+
+   These come in unmasked variants:
+
+   .. code-block:: llvm
+
+       %c = call @llvm.riscv.vadd.nxv4i32.nxv4i32(
+              <vscale x 4 x i32> %passthru,
+	      <vscale x 4 x i32> %a,
+	      <vscale x 4 x i32> %b,
+	      i64 %avl
+	    )
+
+   As well as masked variants:
+
+   .. code-block:: llvm
+
+       %c = call @llvm.riscv.vadd.nxv4i32.nxv4i32(
+              <vscale x 4 x i32> %passthru,
+	      <vscale x 4 x i32> %a,
+	      <vscale x 4 x i32> %b,
+	      i64 %avl
+	    )
+
+   Both allow setting the AVL as well as controlling the inactive/tail elements via the passthru operand, but the masked variant also provides operands for the mask and ``vta``/``vma`` policy bits.
+
+   The only valid types are scalable vector types.
+
+3. :doc:`Vector predication (VP) intrinsics </Proposals/VectorPredication>`
+
+   .. code-block:: llvm
+
+       %c = call @llvm.vp.add.nxv4i32(
+	      <vscale x 4 x i32> %a,
+	      <vscale x 4 x i32> %b,
+	      <vscale x 4 x i1> %m
+	      i32 %evl
+	    )
+
+   Unlike RISC-V intrinsics, VP intrinsics are target agnostic so they can be emitted from other optimisation passes in the middle-end (like the loop vectorizer). They also support fixed length vector types.
+
+SelectionDAG lowering
+=====================
+
+For regular **scalable** vector LLVM IR instructions, their corresponding SelectionDAG nodes are legal on RISC-V and don't require any custom lowering.
+
+.. code-block::
+
+   t5: nxv4i32 = add t2, t4
+
+RISC-V vector intrinsics are also always scalable and so don't need custom lowering:
+
+.. code-block::
+
+   t12: nxv4i32 = llvm.riscv.vadd TargetConstant:i64<10056>, undef:nxv4i32, t2, t4, t6
+
+Fixed length vectors
+--------------------
+
+The only legal vector MVTs on RISC-V are scalable, so fixed length vectors need to be custom lowered performed in a scalable container type.
+
+1. The fixed length vector operands are inserted into scalable containers via ``insert_subvector``. The container size is chosen to have a minimum size big enough to fit the fixed length vector (see ``getContainerForFixedLengthVector``).
+2. The operation is then performed via a scalable **VL (vector length) node**. These are custom nodes that contain an AVL operand which is set to the size of the fixed length vector, and are defined in RISCVInstrInfoVVLPatterns.td.
+3. The result is put back into a fixed length vector via ``extract_subvector``.
+
+.. code-block::
+
+   t2: nxv2i32,ch = CopyFromReg t0, Register:nxv2i32 %0
+     t4: v4i32 = extract_subvector t2, Constant:i64<0>
+       t6: nxv2i32,ch = CopyFromReg t0, Register:nxv2i32 %1
+     t7: v4i32 = extract_subvector t6, Constant:i64<0>
+   t8: v4i32 = add t4, t7
+
+   // custom lowered to:
+
+       t2: nxv2i32,ch = CopyFromReg t0, Register:nxv2i32 %0
+       t6: nxv2i32,ch = CopyFromReg t0, Register:nxv2i32 %1
+       t15: nxv2i1 = RISCVISD::VMSET_VL Constant:i64<4>
+     t16: nxv2i32 = RISCVISD::ADD_VL t2, t6, undef:nxv2i32, t15, Constant:i64<4>
+   t17: v4i32 = extract_subvector t16, Constant:i64<0>
+
+VL nodes often have a passthru or mask operand, which are usually set to undef and all ones for fixed length vectors.
+
+The ``insert_subvector`` and ``extract_subvector`` nodes responsible for wrapping and unwrapping will get combined away, and eventually we will lower all fixed vector types to scalable. Note that the vectors at the interface of a function are always scalable vectors.
+
+.. note::
+
+   The only ``insert_subvector`` and ``extract_subvector`` nodes that make it through lowering are those that can be performed as an exact subregister insert or extract. This means that any fixed length vector ``insert_subvector`` and ``extract_subvector`` nodes that aren't legalized must lie on a register group boundary, so the exact ``VLEN`` must be known at compile time (i.e. compiled with ``-mrvv-vector-bits=zvl`` or ``-mllvm -riscv-v-vector-bits-max=VLEN``, or have an exact ``vscale_range`` attribute).
+
+Vector predication intrinsics
+-----------------------------
+
+VP intrinsics also get custom lowered via VL nodes in order to set the EVL and mask.
+
+.. code-block::
+
+   t12: nxv2i32 = vp_add t2, t4, t6, Constant:i64<8>
+
+   // custom lowered to:
+
+   t18: nxv2i32 = RISCVISD::ADD_VL t2, t4, undef:nxv2i32, t6, Constant:i64<8>
+
+
+Instruction selection
+=====================
+
+VL and VTYPE need to be configured correctly, so we can't just directly select the underlying vector MachineInstrs. Instead a layer of pseudo instructions get selected which carry the extra information needed to emit the necessary ``vsetvli`` instructions later.
+
+.. code-block::
+
+   %c:vrm2 = PseudoVADD_VV_M2 %passthru:vrm2(tied-def 0), %a:vrm2, %b:vrm2, %vl:gpr, 5
+
+Each vector instruction has multiple pseudo instructions defined in ``RISCVInstrInfoVPseudos.td``.
+
+The pseudos have operands for the AVL and SEW (encoded as a power of 2), as well as potentially the mask, policy or rounding mode if applicable.
+The passhthru operand is tied to the destination register to control the inactive/tail elements.
+
+For each possible LMUL there is a variant of the pseudo instruction, as it affects the register class needed for the operands, and similarly there are ``_MASK`` variants that control whether or not the instruction is masked.
+
+For scalable vectors that should use VLMAX, the AVL is set to a sentinel value of -1.
+
+There are patterns for target agnostic SelectionDAG nodes in ``RISCVInstrInfoVSDPatterns.td``, VL nodes in ``RISCVInstrInfoVVLPatterns.td`` and RVV intrinsics in ``RISCVInstrInfoVPseudos.td``.
+
+Mask patterns
+-------------
+
+For the VL patterns we only match to masked pseudos to reduce the size of the match table, even if the node's mask is all ones and could be an unmasked pseudo. The ``RISCVDAGToDAGISel::doPeepholeMaskedRVV`` will detects that the mask is all ones during post-processing and convert it into its unmasked form.
+
+.. code-block::
+
+     t15: nxv4i1 = RISCVISD::VMSET_VL Constant:i32<-1>
+   t16: nxv4i32 = PseudoVADD_MASK_VV_M2 t0, t2, t4, t15, -1, 5
+
+   // gets optimized to:
+
+   t16: nxv4i32 = PseudoVADD_VV_M2 t0, t2, t4, 4, 5
+
+.. note::
+
+   Any vmset_vl can be treated as an all ones mask since the tail elements past VL are undef and can be replaced with ones.
+
+For masked pseudos the mask operand is copied to the physical ``$v0`` register with a glued ``CopyToReg`` node:
+
+.. code-block::
+
+     t23: ch,glue = CopyToReg t0, Register:nxv4i1 $v0, t6
+   t25: nxv4i32 = PseudoVADD_VV_M2_MASK Register:nxv4i32 $noreg, t2, t4, Register:nxv4i1 $v0, TargetConstant:i64<8>, TargetConstant:i64<5>, TargetConstant:i64<1>, t23:1
+
+Register allocation
+===================
+
+Register allocation is split between vector and scalar registers, with vector allocation running first:
+
+.. code-block::
+
+  $v8m2 = PseudoVADD_VV_M2 $v8m2(tied-def 0), $v8m2, $v10m2, %vl:gpr, 5
+
+.. note::
+
+   We split register allocation between vectors and scalars so that :ref:`RISCVInsertVSETVLI` can run after vector register allocation, but still before scalar register allocation as it may need to create a new virtual register to set the AVL to VLMAX.
+
+   Performing RISCVInsertVSETVLI after vector register allocation imposes fewer constraints on the machine scheduler since it cannot schedule instructions past vsetvlis, and it allows us to emit further vector pseudos during spilling or constant rematerialization.
+
+There are four register classes for vectors:
+
+- ``VR`` for vector registers (``v0``, ``v1,``, ..., ``v32``). Used when :math:`\text{LMUL} \leq 1` and mask registers.
+- ``VRM2`` for vector groups of length 2 i.e. :math:`\text{LMUL}=2` (``v0m2``, ``v2m2``, ..., ``v30m2``)
+- ``VRM4`` for vector groups of length 4 i.e. :math:`\text{LMUL}=4` (``v0m4``, ``v4m4``, ..., ``v28m4``)
+- ``VRM8`` for vector groups of length 8 i.e. :math:`\text{LMUL}=8` (``v0m8``, ``v8m8``, ..., ``v24m8``)
+
+:math:`\text{LMUL} \lt 1` types and mask types do not benefit from having a dedicated class, so ``VR`` is used in their case.
+
+Some instructions have a constraint that a register operand cannot be ``V0`` or overlap with ``V0``, so for these cases we also have ``VRNoV0`` variants.
+
+.. _RISCVInsertVSETVLI:
+
+RISCVInsertVSETVLI
+==================
+
+After vector registers are allocated, the RISCVInsertVSETVLI pass will insert the necessary vsetvlis for the pseudos.
+
+.. code-block::
+
+  dead $x0 = PseudoVSETVLI %vl:gpr, 209, implicit-def $vl, implicit-def $vtype
+  $v8m2 = PseudoVADD_VV_M2 $v8m2(tied-def 0), $v8m2, $v10m2, $noreg, 5, implicit $vl, implicit $vtype
+
+The physical ``$vl`` and ``$vtype`` registers are implicitly defined by the ``PseudoVSETVLI``, and are implicitly used by the ``PseudoVADD``.
+The VTYPE operand (``209`` in this example) is encoded as per the specification via ``RISCVVType::encodeVTYPE``.
+
+RISCVInsertVSETVLI performs dataflow analysis to emit as few vsetvlis as possible. It will also try to minimize the number of vsetvlis that set VL, i.e. it will emit ``vsetvli x0, x0`` if only VTYPE needs changed but VL doesn't.
+
+Pseudo expansion and printing
+=============================
+
+After scalar register allocation, the ``RISCVExpandPseudoInsts.cpp`` pass expands out the ``PseudoVSETVLI``.
+
+.. code-block::
+
+   dead $x0 = VSETVLI $x1, 209, implicit-def $vtype, implicit-def $vl
+   renamable $v8m2 = PseudoVADD_VV_M2 $v8m2(tied-def 0), $v8m2, $v10m2, $noreg, 5, implicit $vl, implicit $vtype
+
+Note that the vector pseudo remains as it's needed to encode the register class for the LMUL, so the VL and SEW operands are unused.
+
+``RISCVAsmPrinter`` will then lower the pseudo instructions into real ``MCInsts``.
+
+.. code-block:: nasm
+
+   vsetvli a0, zero, e32, m2, ta, ma
+   vadd.vv v8, v8, v10
+
+
+See also
+========
+
+- `2023 LLVM Dev Mtg - Vector codegen in the RISC-V backend <https://youtu.be/-ox8iJmbp0c?feature=shared>`_
+- `2023 LLVM Dev Mtg - How to add an C intrinsic and code-gen it, using the RISC-V vector C intrinsics <https://youtu.be/t17O_bU1jks?feature=shared>`_
+- `2021 LLVM Dev Mtg “Optimizing code for scalable vector architectures” <https://youtu.be/daWLCyhwrZ8?feature=shared>`_
diff --git a/llvm/docs/UserGuides.rst b/llvm/docs/UserGuides.rst
index 18d273a51daf6..bf7cdda89a009 100644
--- a/llvm/docs/UserGuides.rst
+++ b/llvm/docs/UserGuides.rst
@@ -64,6 +64,7 @@ intermediate LLVM representation.
    Remarks
    RemoveDIsDebugInfo
    RISCVUsage
+   RISCV/RISCVVectorExtension
    SourceLevelDebugging
    SPIRVUsage
    StackSafetyAnalysis
@@ -284,3 +285,6 @@ Additional Topics
 
 :doc:`RISCVUsage`
    This document describes using the RISCV-V target.
+
+:doc:`RISCV/RISCVVectorExtension`
+   This document describes how the RISC-V Vector extension can be expressed in LLVM IR and how code is generated for it in the backend.

wangpc-pp

Great! I don't see any problem here!
I think we need some unfamiliar eyes on this doc, because we may miss some important details since we all are familiar with RVV implementation.

llvm/docs/RISCV/RISCVVectorExtension.rst

wangpc-pp

I have written an internel doc about this before, and this is how I organized the content:

Hopefully this can help us to perfect this doc. :-)

llvm/docs/RISCV/RISCVVectorExtension.rst

tschuett · 2024-06-26T11:06:43Z

Thanks! It is inconsistent whether whiches deserve a comma. A glossary at the bottom would be nice. What is AVL?

llvm/docs/RISCV/RISCVVectorExtension.rst

lukel97 · 2024-06-26T13:01:39Z

I have written an internel doc about this before, and this is how I organized the content: Hopefully this can help us to perfect this doc. :-)

Nice! Good idea covering early-clobber and TIED pseudos, they probably warrant some documentation too.

lukel97 · 2024-06-26T13:05:20Z

Thanks! It is inconsistent whether whiches deserve a comma. A glossary at the bottom would be nice. What is AVL?

The AVL is the application vector length, i.e. the operand to vsetvli. I think it's quite easily confused with VL, which is the actual register. Those _VL nodes should probably be called _AVL nodes, and likewise everywhere where we have a "VL" value. The VL register isn't guaranteed to end up as that value, it's just what RISCVInsertVSETVLI will use as the vsetvli's AVL.

llvm/docs/RISCV/RISCVVectorExtension.rst

- Fix use of which - Clarify what VP operands are used for in VL nodes - Clarify why RVV intrinsics are legal - Be more precise on usage of "legal" fixed vector types. They are legal, there's just no patterns for them - Clarify how fixed length vector arguments are passed in Still some comments to address!

llvm/docs/RISCV/RISCVVectorExtension.rst

frasercrmck

In general this looks great, thank you!

I'm not arguing particularly strongly in favour of this suggestion -- and despite me being a native speaker I'm still not 100% on this -- but when the phrase "fixed length" is used as an adjective, as in "fixed-length vector", then it's technically best to use the hyphenated form. As in, "a fixed-length vector has a fixed length". Same with certain uses of "target-agnostic" vs "target agnostic". Again, I don't want to be too prescriptive on this matter and it's not a blocker for me.

llvm/docs/RISCV/RISCVVectorExtension.rst

frasercrmck · 2024-06-26T16:18:40Z

llvm/docs/RISCV/RISCVVectorExtension.rst

+RISCVInsertVSETVLI
+==================
+
+After vector registers are allocated, the RISCVInsertVSETVLI pass will insert the necessary vsetvlis for the pseudos.


I think in general, when referencing code concepts like pass names RISCVInsertVSETVLI or specific ISA instructions (vsetvli), back-ticking/escaping them is best.

Agreed: I noticed that the specification doesn't back-tick the design-time constants like EVL/AVL though, so I'll try and unback-tick those.

llvm/docs/RISCV/RISCVVectorExtension.rst

lukel97 · 2024-06-27T07:48:38Z

I'm not arguing particularly strongly in favour of this suggestion -- and despite me being a native speaker I'm still not 100% on this -- but when the phrase "fixed length" is used as an adjective, as in "fixed-length vector", then it's technically best to use the hyphenated form. As in, "a fixed-length vector has a fixed length". Same with certain uses of "target-agnostic" vs "target agnostic". Again, I don't want to be too prescriptive on this matter and it's not a blocker for me.

Thanks for the review! Agreed, that seems to be what LangRef.rst uses. I think I was too busy trying to not write "fixed vector"

…g supported

- Put more code terms inside backticks - Update VP link - fixed length -> fixed-length - Add table comparing operands and properties of different LLVM IR constructs - Add missing mask intrinsic operands - Add table listing different pseudo permutations - Add missing policy operand to PseudoVADD examples - Use MIR in masked pattern examples - i.e. -> i.e., - Move vp vmerge passthru emulation from SelectionDAG section to LLVM IR section - Fix typos - Update SelectionDAG indentation

…-> AVL

lukel97 · 2024-07-03T00:30:43Z

Ping

topperc

LGTM

wangpc-pp

LGTM! Let's take the first step!

llvm-ci · 2024-07-03T04:58:01Z

LLVM Buildbot has detected a new failure on builder sanitizer-x86_64-linux-android running on sanitizer-buildbot-android while building llvm at step 2 "annotate".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/186/builds/250

Here is the relevant piece of the build log for the reference:

Step 2 (annotate) failure: 'python ../sanitizer_buildbot/sanitizers/zorg/buildbot/builders/sanitizers/buildbot_selector.py' (failure)
...
PASS: AddressSanitizer-aarch64-android :: TestCases/poison_partial.cpp (407 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/printf-1.c (408 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/max_redzone.cpp (409 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/replaceable_new_delete_static.cpp (410 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/printf-m.c (411 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/pr33372.cpp (412 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/scariness_score_test.cpp (413 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/heavy_uar_test.cpp (414 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/printf-4.c (415 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/printf-2.c (416 of 1678)
FAIL: AddressSanitizer-aarch64-android :: TestCases/printf-5.c (417 of 1678)
******************** TEST 'AddressSanitizer-aarch64-android :: TestCases/printf-5.c' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 1: /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/sanitizer_common/android_commands/android_compile.py  /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm_build64/bin/clang  -fsanitize=address -mno-omit-leaf-frame-pointer -fno-omit-frame-pointer -fno-optimize-sibling-calls -gline-tables-only  --target=aarch64-linux-android24 --sysroot=/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64/sysroot --gcc-toolchain=/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64  -B/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64 -Wthread-safety -Wthread-safety-reference -Wthread-safety-beta  -fuse-ld=lld  -shared-libasan -Wno-excess-initializers -O2 /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c -o /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp
+ /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/sanitizer_common/android_commands/android_compile.py /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm_build64/bin/clang -fsanitize=address -mno-omit-leaf-frame-pointer -fno-omit-frame-pointer -fno-optimize-sibling-calls -gline-tables-only --target=aarch64-linux-android24 --sysroot=/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64/sysroot --gcc-toolchain=/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64 -B/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64 -Wthread-safety -Wthread-safety-reference -Wthread-safety-beta -fuse-ld=lld -shared-libasan -Wno-excess-initializers -O2 /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c -o /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp
RUN: at line 3: env ASAN_OPTIONS=abort_on_error=0:replace_intrin=0:check_printf=1 not  /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp 2>&1 | FileCheck --check-prefix=CHECK-ON /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c
+ env ASAN_OPTIONS=abort_on_error=0:replace_intrin=0:check_printf=1 not /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp
+ FileCheck --check-prefix=CHECK-ON /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c
RUN: at line 4: env ASAN_OPTIONS=abort_on_error=0:replace_intrin=0:check_printf=0  /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp 2>&1 | FileCheck --check-prefix=CHECK-OFF /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c
+ env ASAN_OPTIONS=abort_on_error=0:replace_intrin=0:check_printf=0 /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp
+ FileCheck --check-prefix=CHECK-OFF /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c

--

********************
PASS: AddressSanitizer-aarch64-android :: TestCases/report_error_summary.cpp (418 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/replaceable_new_delete_shared.cpp (419 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/realloc.cpp (420 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/partial_right.cpp (421 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/sanity_check_pure_c.c (422 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/load_and_store_n.cpp (423 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/stack-frame-demangle.cpp (424 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/strcasestr-1.c (425 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/strcat-overlap.cpp (426 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/strcasestr-2.c (427 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/stack-oob-frames.cpp (428 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/set_shadow_test.c (429 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/speculative_load.cpp (430 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/strcpy-overlap.cpp (431 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/speculative_load2.cpp (432 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/strcasestr_strict.c (433 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/small_memcpy_test.cpp (434 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/strdup_oob_test.cpp (435 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/stack-buffer-overflow.cpp (436 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/strcspn-2.c (437 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/strncat-overlap.cpp (438 of 1678)
Step 21 (run lit tests [aarch64/aosp_coral-userdebug/AOSP.MASTER]) failure: run lit tests [aarch64/aosp_coral-userdebug/AOSP.MASTER] (failure)
...
PASS: AddressSanitizer-aarch64-android :: TestCases/poison_partial.cpp (407 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/printf-1.c (408 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/max_redzone.cpp (409 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/replaceable_new_delete_static.cpp (410 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/printf-m.c (411 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/pr33372.cpp (412 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/scariness_score_test.cpp (413 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/heavy_uar_test.cpp (414 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/printf-4.c (415 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/printf-2.c (416 of 1678)
FAIL: AddressSanitizer-aarch64-android :: TestCases/printf-5.c (417 of 1678)
******************** TEST 'AddressSanitizer-aarch64-android :: TestCases/printf-5.c' FAILED ********************
Exit Code: 1

Command Output (stderr):
--
RUN: at line 1: /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/sanitizer_common/android_commands/android_compile.py  /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm_build64/bin/clang  -fsanitize=address -mno-omit-leaf-frame-pointer -fno-omit-frame-pointer -fno-optimize-sibling-calls -gline-tables-only  --target=aarch64-linux-android24 --sysroot=/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64/sysroot --gcc-toolchain=/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64  -B/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64 -Wthread-safety -Wthread-safety-reference -Wthread-safety-beta  -fuse-ld=lld  -shared-libasan -Wno-excess-initializers -O2 /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c -o /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp
+ /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/sanitizer_common/android_commands/android_compile.py /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm_build64/bin/clang -fsanitize=address -mno-omit-leaf-frame-pointer -fno-omit-frame-pointer -fno-optimize-sibling-calls -gline-tables-only --target=aarch64-linux-android24 --sysroot=/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64/sysroot --gcc-toolchain=/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64 -B/var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/android_ndk/toolchains/llvm/prebuilt/linux-x86_64 -Wthread-safety -Wthread-safety-reference -Wthread-safety-beta -fuse-ld=lld -shared-libasan -Wno-excess-initializers -O2 /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c -o /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp
RUN: at line 3: env ASAN_OPTIONS=abort_on_error=0:replace_intrin=0:check_printf=1 not  /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp 2>&1 | FileCheck --check-prefix=CHECK-ON /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c
+ env ASAN_OPTIONS=abort_on_error=0:replace_intrin=0:check_printf=1 not /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp
+ FileCheck --check-prefix=CHECK-ON /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c
RUN: at line 4: env ASAN_OPTIONS=abort_on_error=0:replace_intrin=0:check_printf=0  /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp 2>&1 | FileCheck --check-prefix=CHECK-OFF /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c
+ env ASAN_OPTIONS=abort_on_error=0:replace_intrin=0:check_printf=0 /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/compiler_rt_build_android_aarch64/test/asan/AARCH64AndroidConfig/TestCases/Output/printf-5.c.tmp
+ FileCheck --check-prefix=CHECK-OFF /var/lib/buildbot/sanitizer-buildbot6/sanitizer-x86_64-linux-android/build/llvm-project/compiler-rt/test/asan/TestCases/printf-5.c

--

********************
PASS: AddressSanitizer-aarch64-android :: TestCases/report_error_summary.cpp (418 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/replaceable_new_delete_shared.cpp (419 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/realloc.cpp (420 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/partial_right.cpp (421 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/sanity_check_pure_c.c (422 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/load_and_store_n.cpp (423 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/stack-frame-demangle.cpp (424 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/strcasestr-1.c (425 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/strcat-overlap.cpp (426 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/strcasestr-2.c (427 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/stack-oob-frames.cpp (428 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/set_shadow_test.c (429 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/speculative_load.cpp (430 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/strcpy-overlap.cpp (431 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/speculative_load2.cpp (432 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/strcasestr_strict.c (433 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/small_memcpy_test.cpp (434 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/strdup_oob_test.cpp (435 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/stack-buffer-overflow.cpp (436 of 1678)
PASS: AddressSanitizer-aarch64-android :: TestCases/strcspn-2.c (437 of 1678)
UNSUPPORTED: AddressSanitizer-aarch64-android :: TestCases/strncat-overlap.cpp (438 of 1678)

This is a revival of https://reviews.llvm.org/D142348, and attempts to document how RVV semantics can be expressed in LLVM IR as well as how codegen works in the backend. Parts of this are taken from the original RFC https://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html, but I've largely rewritten this from the original differential revision to exclude explaining the specification itself and instead just focus on the LLVM specific bits. (I figured that there's better material available elsewhere for learning about RVV itself) I've also updated it to include as much as I know about fixed vector codegen as well as the recent changes to vsetvli insertion.

lukel97 requested review from BeMg, frasercrmck, preames, rofirrim, s-barannikov, topperc, wangpc-pp and yetingk June 26, 2024 08:01

llvmbot added the backend:RISC-V label Jun 26, 2024

lukel97 requested a review from dtcxzyw June 26, 2024 08:02

wangpc-pp reviewed Jun 26, 2024

View reviewed changes

llvm/docs/RISCV/RISCVVectorExtension.rst Outdated Show resolved Hide resolved

Add a missing word, reference langref for scalable vector types

51afa69

lukel97 added the documentation label Jun 26, 2024

lukel97 requested a review from asb June 26, 2024 10:10

wangpc-pp reviewed Jun 26, 2024

View reviewed changes

llvm/docs/RISCV/RISCVVectorExtension.rst Show resolved Hide resolved

tschuett reviewed Jun 26, 2024

View reviewed changes

llvm/docs/RISCV/RISCVVectorExtension.rst Outdated Show resolved Hide resolved

tschuett reviewed Jun 26, 2024

View reviewed changes

llvm/docs/RISCV/RISCVVectorExtension.rst Outdated Show resolved Hide resolved

tschuett reviewed Jun 26, 2024

View reviewed changes

llvm/docs/RISCV/RISCVVectorExtension.rst Outdated Show resolved Hide resolved

dtcxzyw reviewed Jun 26, 2024

View reviewed changes

llvm/docs/RISCV/RISCVVectorExtension.rst Show resolved Hide resolved

rofirrim reviewed Jun 26, 2024

View reviewed changes

4vtomat reviewed Jun 26, 2024

View reviewed changes

llvm/docs/RISCV/RISCVVectorExtension.rst Show resolved Hide resolved

frasercrmck reviewed Jun 26, 2024

View reviewed changes

topperc reviewed Jun 26, 2024

View reviewed changes

llvm/docs/RISCV/RISCVVectorExtension.rst Outdated Show resolved Hide resolved

llvm/docs/RISCV/RISCVVectorExtension.rst Show resolved Hide resolved

llvm/docs/RISCV/RISCVVectorExtension.rst Show resolved Hide resolved

llvm/docs/RISCV/RISCVVectorExtension.rst Outdated Show resolved Hide resolved

Fix masked intrinsic example

704954b

lukel97 added 5 commits June 27, 2024 16:46

Add original RFC to see also

ebbc927

Clarify that defining vscale = VLEN / 64 prevents VLEN = 32 from bein…

ed6f34c

…g supported

indicate->indicates, reword how n and ty control LMUL and SEW

2e877eb

Don't backtick design time constants, backtick registers. Correct VL …

0023d00

…-> AVL

topperc approved these changes Jul 3, 2024

View reviewed changes

wangpc-pp approved these changes Jul 3, 2024

View reviewed changes

Wording tweaks, VTYPE -> vtype, remove redundant phrasing

55e4a0c

lukel97 merged commit e860c16 into llvm:main Jul 3, 2024
5 of 7 checks passed

[Docs][RISCV] Document RISC-V vector codegen #96740

[Docs][RISCV] Document RISC-V vector codegen #96740

Uh oh!

Conversation

lukel97 commented Jun 26, 2024

Uh oh!

llvmbot commented Jun 26, 2024

Uh oh!

wangpc-pp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wangpc-pp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tschuett commented Jun 26, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lukel97 commented Jun 26, 2024

Uh oh!

lukel97 commented Jun 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

frasercrmck left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

frasercrmck Jun 26, 2024

Choose a reason for hiding this comment

Uh oh!

lukel97 Jun 27, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lukel97 commented Jun 27, 2024

Uh oh!

lukel97 commented Jul 3, 2024

Uh oh!

topperc left a comment

Choose a reason for hiding this comment

Uh oh!

wangpc-pp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvm-ci commented Jul 3, 2024

Uh oh!

Uh oh!

lukel97 commented Jun 26, 2024 •

edited

Loading