[NVPTX] Add support for atomic add for f16 type #84295

akuegel · 2024-03-07T10:02:33Z

atom.add.noftz.f16 is supported since SM 7.0

Artem-B

LGTM in general, modulo missing constrain on PTX version.

llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp

llvm/lib/Target/NVPTX/NVPTXIntrinsics.td

llvm/test/CodeGen/NVPTX/atomics-sm70.ll

Artem-B · 2024-03-11T17:35:55Z

llvm/test/CodeGen/NVPTX/atomics-sm70.ll

@@ -0,0 +1,28 @@
+; RUN: llc < %s -march=nvptx -mcpu=sm_70 -mattr=+ptx63 | FileCheck %s


For this test it might be convenient to autogenerate the checks with llvm/utils/update_test_checks.py

Thanks for the suggestion. Done

The ordering of the ld.param.*s relative to the atom.* instructions isn't relevant to this test, correct? If so, we may not want to include those CHECK-NEXT:s in the test.

This reverts commit 8e0f4b9.

Reverts #84295 due to breakages.

akuegel · 2024-03-13T15:07:06Z

By now I have created a reproducer for what caused the revert. If I adjust the test slightly, and change the line with %r1:

%r1 = atomicrmw fadd ptr %dp0, half 1.0 seq_cst, align 2

The codegen for this becomes:

ld.param.u64 %rd1, [test_param_0]; 
atom.add.noftz.f16 %rs1, [%rd1], 0x3C00;

And this fails verification with this error:

Arguments mismatch for instruction 'atom'

I lack the knowledge about PTX to know what is wrong with that. Will try to figure it out.

akuegel · 2024-03-13T15:44:18Z

Ok, seems that ptx makes a difference between floating point constants and integer constants. And we are generating a integer constant, I guess due to using Int16Register. @Artem-B in case you can give me a pointer where I can make sure that we are using a floating point constant here, that would be great :)

akuegel · 2024-03-13T15:57:55Z

Ok, I found something in NVPTXISelDagToDag.cpp, seems like there is no hex representation for f16 constants, and it is replaced by loading from a f16 register. Is this the right place where to look further?

Artem-B · 2024-03-13T18:14:36Z

Huh. It appears that the instruction does not accept immetiate arguments for f16 variants, though it does accept them for f32. This looks like a bug in ptxas.
https://godbolt.org/z/9Gv1McM4M

Normally f16 instruction variants accept plain hex immediate values.

Looks like we'll need to disable insttruction variant with an immediate argument and force passing it via a register.

akuegel · 2024-03-14T08:30:28Z

Tried to disable the instruction variant with an immediate argument, and that seems to work:

#85197

akuegel added 2 commits March 7, 2024 09:56

[NVPTX] Add support for atomic add for f16 type.

7bac5a8

atom.add.noftz.f16 is supported since SM 7.0

[NVPTX] Add atomic add test.

9aefbbe

akuegel requested a review from Artem-B March 7, 2024 10:02

Artem-B reviewed Mar 7, 2024

View reviewed changes

llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp Show resolved Hide resolved

llvm/lib/Target/NVPTX/NVPTXIntrinsics.td Outdated Show resolved Hide resolved

Also check PTX version.

dbd3f9a

Artem-B reviewed Mar 8, 2024

View reviewed changes

llvm/test/CodeGen/NVPTX/atomics-sm70.ll Outdated Show resolved Hide resolved

akuegel added 2 commits March 11, 2024 07:05

Update tests and add more tests

9765a9a

Remove dump-input=always left over from debugging.

3bd7aba

Artem-B reviewed Mar 11, 2024

View reviewed changes

Artem-B approved these changes Mar 11, 2024

View reviewed changes

akuegel added 2 commits March 12, 2024 07:37

Merge branch 'main' into atomic_add_f16

0e4b917

Autogenerate tests.

34ce80d

akuegel merged commit 8e0f4b9 into llvm:main Mar 12, 2024

akuegel deleted the atomic_add_f16 branch March 12, 2024 08:18

dklimkin added a commit that referenced this pull request Mar 12, 2024

Revert "[NVPTX] Add support for atomic add for f16 type (#84295)"

8a7f465

This reverts commit 8e0f4b9.

dklimkin mentioned this pull request Mar 12, 2024

Revert "[NVPTX] Add support for atomic add for f16 type" #84918

Merged

dklimkin added a commit that referenced this pull request Mar 12, 2024

Revert "[NVPTX] Add support for atomic add for f16 type" (#84918)

afd4758

Reverts #84295 due to breakages.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NVPTX] Add support for atomic add for f16 type #84295

[NVPTX] Add support for atomic add for f16 type #84295

Uh oh!

akuegel commented Mar 7, 2024

Uh oh!

Artem-B left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Artem-B Mar 11, 2024

Uh oh!

akuegel Mar 12, 2024

Uh oh!

justinfargnoli Mar 12, 2024

Uh oh!

akuegel commented Mar 13, 2024

Uh oh!

akuegel commented Mar 13, 2024

Uh oh!

akuegel commented Mar 13, 2024

Uh oh!

Artem-B commented Mar 13, 2024

Uh oh!

akuegel commented Mar 14, 2024

Uh oh!

Uh oh!

		@@ -0,0 +1,28 @@
		; RUN: llc < %s -march=nvptx -mcpu=sm_70 -mattr=+ptx63 \| FileCheck %s

[NVPTX] Add support for atomic add for f16 type #84295

[NVPTX] Add support for atomic add for f16 type #84295

Uh oh!

Conversation

akuegel commented Mar 7, 2024

Uh oh!

Artem-B left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Artem-B Mar 11, 2024

Choose a reason for hiding this comment

Uh oh!

akuegel Mar 12, 2024

Choose a reason for hiding this comment

Uh oh!

justinfargnoli Mar 12, 2024

Choose a reason for hiding this comment

Uh oh!

akuegel commented Mar 13, 2024

Uh oh!

akuegel commented Mar 13, 2024

Uh oh!

akuegel commented Mar 13, 2024

Uh oh!

Artem-B commented Mar 13, 2024

Uh oh!

akuegel commented Mar 14, 2024

Uh oh!

Uh oh!