[clang][Sema][SYCL] Fix MSVC STL usage on AMDGPU #135979

sarnex · 2025-04-16T15:44:15Z

The MSVC STL includes specializations of _Is_memfunptr for every function pointer type, including every calling convention.

The problem is the AMDGPU target doesn't support the x86 vectorcall calling convention so clang sets it to the default CC. This ends up clashing with the already-existing overload for the default CC, so we get a duplicate definition error when including type_traits (which we heavily use in the SYCL STL) and compiling for AMDGPU on Windows.

This doesn't happen for pure AMDGPU non-SYCL because it doesn't include the C++ STL, and it doesn't happen for CUDA/HIP because a similar workaround was done here.

I am not an expert in Sema, so I did a kinda of hardcoded fix, please let me know if there is a better way to fix this.

As far as I can tell we can't do exactly the same fix that was done for CUDA because we can't differentiate between device and host code so easily.

github-actions · 2025-04-16T15:46:45Z

✅ With the latest revision this PR passed the C/C++ code formatter.

Signed-off-by: Sarnie, Nick <[email protected]>

llvmbot · 2025-04-17T16:34:50Z

@llvm/pr-subscribers-clang

Author: Nick Sarnie (sarnex)

Changes

The MSVC STL includes specializations of _Is_memfunptr for every function pointer type, including every calling convention.

The problem is the AMDGPU target doesn't support the x86 vectorcall calling convention so clang sets it to the default CC. This ends up clashing with the already-existing overload for the default CC, so we get a duplicate definition error when including type_traits (which we heavily use in the SYCL STL) and compiling for AMDGPU on Windows.

This doesn't happen for pure AMDGPU non-SYCL because it doesn't include the C++ STL, and it doesn't happen for CUDA/HIP because a similar workaround was done here.

I am not an expert in Sema, so I did a kinda of hardcoded fix, please let me know if there is a better way to fix this.

As far as I can tell we can't do exactly the same fix that was done for CUDA because we can't differentiate between device and host code so easily.

Full diff: https://github.com/llvm/llvm-project/pull/135979.diff

3 Files Affected:

(modified) clang/lib/Sema/SemaDeclAttr.cpp (+6-1)
(added) clang/test/SemaSYCL/Inputs/vectorcall.hpp (+18)
(added) clang/test/SemaSYCL/sycl-cconv-win.cpp (+5)

diff --git a/clang/lib/Sema/SemaDeclAttr.cpp b/clang/lib/Sema/SemaDeclAttr.cpp
index bc891fb009410..3594111d86ccc 100644
--- a/clang/lib/Sema/SemaDeclAttr.cpp
+++ b/clang/lib/Sema/SemaDeclAttr.cpp
@@ -5492,12 +5492,12 @@ bool Sema::CheckCallingConvAttr(const ParsedAttr &Attrs, CallingConv &CC,
 
   TargetInfo::CallingConvCheckResult A = TargetInfo::CCCR_OK;
   const TargetInfo &TI = Context.getTargetInfo();
+  auto *Aux = Context.getAuxTargetInfo();
   // CUDA functions may have host and/or device attributes which indicate
   // their targeted execution environment, therefore the calling convention
   // of functions in CUDA should be checked against the target deduced based
   // on their host/device attributes.
   if (LangOpts.CUDA) {
-    auto *Aux = Context.getAuxTargetInfo();
     assert(FD || CFT != CUDAFunctionTarget::InvalidTarget);
     auto CudaTarget = FD ? CUDA().IdentifyTarget(FD) : CFT;
     bool CheckHost = false, CheckDevice = false;
@@ -5522,6 +5522,11 @@ bool Sema::CheckCallingConvAttr(const ParsedAttr &Attrs, CallingConv &CC,
       A = HostTI->checkCallingConvention(CC);
     if (A == TargetInfo::CCCR_OK && CheckDevice && DeviceTI)
       A = DeviceTI->checkCallingConvention(CC);
+  } else if (LangOpts.SYCLIsDevice && TI.getTriple().isAMDGPU() &&
+             CC == CC_X86VectorCall) {
+    A = TI.checkCallingConvention(CC);
+    if (Aux && A != TargetInfo::CCCR_OK)
+      A = Aux->checkCallingConvention(CC);
   } else {
     A = TI.checkCallingConvention(CC);
   }
diff --git a/clang/test/SemaSYCL/Inputs/vectorcall.hpp b/clang/test/SemaSYCL/Inputs/vectorcall.hpp
new file mode 100644
index 0000000000000..566a48dd24c30
--- /dev/null
+++ b/clang/test/SemaSYCL/Inputs/vectorcall.hpp
@@ -0,0 +1,18 @@
+
+template <typename F> struct A{};
+
+template <typename Ret, typename C, typename... Args> struct A<Ret (             C::*)(Args...) noexcept> { static constexpr int value = 0; };
+template <typename Ret, typename C, typename... Args> struct A<Ret (__vectorcall C::*)(Args...) noexcept> { static constexpr int value = 1; };
+
+template <typename F> constexpr int A_v = A<F>::value;
+
+struct B
+{
+    void f() noexcept {}
+    void __vectorcall g() noexcept {}
+};
+
+int main()
+{
+    return A_v<decltype(&B::f)> + A_v<decltype(&B::g)>;
+}
diff --git a/clang/test/SemaSYCL/sycl-cconv-win.cpp b/clang/test/SemaSYCL/sycl-cconv-win.cpp
new file mode 100644
index 0000000000000..f0c22a2ed3921
--- /dev/null
+++ b/clang/test/SemaSYCL/sycl-cconv-win.cpp
@@ -0,0 +1,5 @@
+// RUN: %clang_cc1 -isystem %S/Inputs/ -fsycl-is-device -triple amdgcn-amd-hsa -aux-triple x86_64-pc-windows-msvc -fsyntax-only -verify %s
+
+// expected-no-diagnostics
+
+#include <vectorcall.hpp>

clang/lib/Sema/SemaDeclAttr.cpp

Signed-off-by: Sarnie, Nick <[email protected]>

The MSVC STL includes specializations of `_Is_memfunptr` for every function pointer type, including every calling convention. The problem is the AMDGPU target doesn't support the x86 `vectorcall` calling convention so clang sets it to the default CC. This ends up clashing with the already-existing overload for the default CC, so we get a duplicate definition error when including `type_traits` (which we heavily use in the SYCL STL) and compiling for AMDGPU on Windows. This doesn't happen for pure AMDGPU non-SYCL because it doesn't include the C++ STL, and it doesn't happen for CUDA/HIP because a similar workaround was done [here](llvm@fa49c3a). I am not an expert in Sema, so I did a kinda of hardcoded fix, please let me know if there is a better way to fix this. As far as I can tell we can't do exactly the same fix that was done for CUDA because we can't differentiate between device and host code so easily. --------- Signed-off-by: Sarnie, Nick <[email protected]>

sarnex force-pushed the amdgpu branch 2 times, most recently from 2dfcad2 to 6ed0ba1 Compare April 16, 2025 18:50

[clang][Sema][SYCL] Fix MSVC STL usage on AMDGPU

ca1fa92

Signed-off-by: Sarnie, Nick <[email protected]>

sarnex force-pushed the amdgpu branch from 6ed0ba1 to ca1fa92 Compare April 17, 2025 14:36

sarnex marked this pull request as ready for review April 17, 2025 16:34

llvmbot added clang Clang issues not falling into any other category clang:frontend Language frontend issues, e.g. anything involving "Sema" labels Apr 17, 2025

sarnex requested a review from Fznamznon April 17, 2025 16:34

sarnex requested a review from elizabethandrews April 17, 2025 16:35

sarnex mentioned this pull request Apr 17, 2025

[clang][Sema][SYCL] Fix SYCL AMDGPU compilation on Windows intel/llvm#17983

Closed

sarnex requested a review from yxsamliu April 17, 2025 16:35

yxsamliu reviewed Apr 17, 2025

View reviewed changes

clang/lib/Sema/SemaDeclAttr.cpp Show resolved Hide resolved

yxsamliu approved these changes Apr 17, 2025

View reviewed changes

add comment, address feedback

8b757b1

Signed-off-by: Sarnie, Nick <[email protected]>

sarnex merged commit 257b727 into llvm:main Apr 18, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[clang][Sema][SYCL] Fix MSVC STL usage on AMDGPU #135979

[clang][Sema][SYCL] Fix MSVC STL usage on AMDGPU #135979

Uh oh!

sarnex commented Apr 16, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 16, 2025 •

edited

Loading

Uh oh!

llvmbot commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[clang][Sema][SYCL] Fix MSVC STL usage on AMDGPU #135979

[clang][Sema][SYCL] Fix MSVC STL usage on AMDGPU #135979

Uh oh!

Conversation

sarnex commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sarnex commented Apr 16, 2025 •

edited

Loading

github-actions bot commented Apr 16, 2025 •

edited

Loading