[NVPTX] Make GlobalUniqueCallSite a member of NVPTXISelLowering #130212

AlexMaclean · 2025-03-07T00:52:07Z

This change moves GlobalUniqueCallSite into NVPTXISelLowering. In processes where multiple compilations occur, this makes call site enumeration local to individual compilation, which ensures that call site numbers are consistently sequential within each compilation and is independent of other compilations happening in parallel.

llvmbot · 2025-03-07T00:52:46Z

@llvm/pr-subscribers-backend-nvptx

Author: Alex MacLean (AlexMaclean)

Changes

Moving GlobalUniqueCallSite into NVPTXISelLowering ensures that in processes where multiple compilations occur, race conditions do not impact the generated PTX.

Full diff: https://github.com/llvm/llvm-project/pull/130212.diff

2 Files Affected:

(modified) llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp (+2-4)
(modified) llvm/lib/Target/NVPTX/NVPTXISelLowering.h (+2)

diff --git a/llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp b/llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
index 3e755c25fd91a..b62c15ddb97d3 100644
--- a/llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
+++ b/llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
@@ -73,8 +73,6 @@
 
 using namespace llvm;
 
-static std::atomic<unsigned> GlobalUniqueCallSite;
-
 static cl::opt<bool> sched4reg(
     "nvptx-sched4reg",
     cl::desc("NVPTX Specific: schedule for register pressue"), cl::init(false));
@@ -500,7 +498,7 @@ static SDValue MaybeBitcast(SelectionDAG &DAG, SDLoc DL, EVT VT,
 // NVPTXTargetLowering Constructor.
 NVPTXTargetLowering::NVPTXTargetLowering(const NVPTXTargetMachine &TM,
                                          const NVPTXSubtarget &STI)
-    : TargetLowering(TM), nvTM(&TM), STI(STI) {
+    : TargetLowering(TM), nvTM(&TM), STI(STI), GlobalUniqueCallSite(0) {
   // always lower memset, memcpy, and memmove intrinsics to load/store
   // instructions, rather
   // then generating calls to memset, mempcy or memmove.
@@ -1474,7 +1472,7 @@ SDValue NVPTXTargetLowering::LowerCall(TargetLowering::CallLoweringInfo &CLI,
   unsigned FirstVAArg = CLI.NumFixedArgs; // position of the first variadic
   unsigned VAOffset = 0;                  // current offset in the param array
 
-  unsigned UniqueCallSite = GlobalUniqueCallSite.fetch_add(1);
+  const unsigned UniqueCallSite = GlobalUniqueCallSite++;
   SDValue TempChain = Chain;
   Chain = DAG.getCALLSEQ_START(Chain, UniqueCallSite, 0, dl);
   SDValue InGlue = Chain.getValue(1);
diff --git a/llvm/lib/Target/NVPTX/NVPTXISelLowering.h b/llvm/lib/Target/NVPTX/NVPTXISelLowering.h
index f41c569a65544..ff0241886223b 100644
--- a/llvm/lib/Target/NVPTX/NVPTXISelLowering.h
+++ b/llvm/lib/Target/NVPTX/NVPTXISelLowering.h
@@ -273,6 +273,8 @@ class NVPTXTargetLowering : public TargetLowering {
 
 private:
   const NVPTXSubtarget &STI; // cache the subtarget here
+  mutable unsigned GlobalUniqueCallSite;
+
   SDValue getParamSymbol(SelectionDAG &DAG, int idx, EVT) const;
 
   SDValue LowerADDRSPACECAST(SDValue Op, SelectionDAG &DAG) const;

Artem-B · 2025-03-07T18:31:12Z

Can you elaborate on the problem you're trying to solve? What kind of race conditions do you see while updating an atomic variable?

Moving the counter into NVPTXTargetLowering, instead of relying on a global counter, is fine in principle, I just want to understand what's going on.

AlexMaclean · 2025-03-07T18:54:54Z

Can you elaborate on the problem you're trying to solve? What kind of race conditions do you see while updating an atomic variable?

Moving the counter into NVPTXTargetLowering, instead of relying on a global counter, is fine in principle, I just want to understand what's going on.

The problem is that if a process is doing multiple distinct compilations of different modules they will share GlobalUniqueCallSite. As a result, depending on the order in which these compilations occur, the emitted PTX will be different. With this change the emitted PTX of one compilation will not be changed by whatever compilations were run before. Of course this won't be an issue for llc but we have encountered this internally.

Even setting that aside, I think it is probably a little bit cleaner/preferable to use a class member over a global variable.

Artem-B · 2025-03-07T19:54:08Z

OK, so it's not exactly a race condition, but rather compilation stability in multi-threaded compilation.

I'd rephrase the patch description along the lines of "make call site enumeration local to individual compilation, which ensures that call site numbers are consistently sequential within each compilation and is independent of other compilations happening in parallel".

Artem-B

LGTM, modulo patch description.

…#130212) This change moves GlobalUniqueCallSite into NVPTXISelLowering. In processes where multiple compilations occur, this makes call site enumeration local to individual compilation, which ensures that call site numbers are consistently sequential within each compilation and is independent of other compilations happening in parallel.

[NVPTX] Make GlobalUniqueCallSite a member of NVPTXISelLowering

fa21654

AlexMaclean requested review from Artem-B and kalxr March 7, 2025 00:52

AlexMaclean self-assigned this Mar 7, 2025

llvmbot added the backend:NVPTX label Mar 7, 2025

justinfargnoli approved these changes Mar 7, 2025

View reviewed changes

Artem-B approved these changes Mar 7, 2025

View reviewed changes

AlexMaclean merged commit 1b01f05 into llvm:main Mar 7, 2025
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[NVPTX] Make GlobalUniqueCallSite a member of NVPTXISelLowering #130212

[NVPTX] Make GlobalUniqueCallSite a member of NVPTXISelLowering #130212

Uh oh!

AlexMaclean commented Mar 7, 2025 •

edited

Loading

Uh oh!

llvmbot commented Mar 7, 2025

Uh oh!

Artem-B commented Mar 7, 2025 •

edited

Loading

Uh oh!

AlexMaclean commented Mar 7, 2025

Uh oh!

Artem-B commented Mar 7, 2025

Uh oh!

Artem-B left a comment

Uh oh!

Uh oh!

Uh oh!

[NVPTX] Make GlobalUniqueCallSite a member of NVPTXISelLowering #130212

[NVPTX] Make GlobalUniqueCallSite a member of NVPTXISelLowering #130212

Uh oh!

Conversation

AlexMaclean commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Mar 7, 2025

Uh oh!

Artem-B commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexMaclean commented Mar 7, 2025

Uh oh!

Artem-B commented Mar 7, 2025

Uh oh!

Artem-B left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

AlexMaclean commented Mar 7, 2025 •

edited

Loading

Artem-B commented Mar 7, 2025 •

edited

Loading