[mlir] Don't rely on values of MLIR_(CURDA|ROCM)_CONVERSIONS_ENABLED. #82988

ingomueller-net · 2024-02-26T11:46:33Z

This PR changes the #if XXX == 1 sections for these macros to #ifdef sections such that the actual values that the macros are defined with do not matter. This scheme is used for most (but not all) such macros in the MLIR code base and the fact that these two macros did not follow it has lead to bugs downstream (which assumed that #ifdef was being used and defined them to a value != 1).

llvmbot · 2024-02-26T11:47:05Z

@llvm/pr-subscribers-mlir
@llvm/pr-subscribers-mlir-llvm

@llvm/pr-subscribers-mlir-gpu

Author: Ingo Müller (ingomueller-net)

Changes

This PR changes the #if XXX == 1 sections for these macros to #ifdef sections such that the actual values that the macros are defined with do not matter. This scheme is used for most (but not all) such macros in the MLIR code base and the fact that these two macros did not follow it has lead to bugs downstream (which assumed that #ifdef was being used and defined them to a value != 1).

Full diff: https://github.com/llvm/llvm-project/pull/82988.diff

8 Files Affected:

(modified) mlir/CMakeLists.txt (+3-3)
(modified) mlir/include/mlir/InitAllPasses.h (+1-1)
(modified) mlir/lib/Dialect/GPU/Pipelines/GPUToNVVMPipeline.cpp (+1-1)
(modified) mlir/lib/Dialect/GPU/Transforms/ModuleToBinary.cpp (+2-2)
(modified) mlir/lib/Target/LLVM/NVVM/Target.cpp (+5-5)
(modified) mlir/lib/Target/LLVM/ROCDL/Target.cpp (+4-4)
(modified) mlir/unittests/Target/LLVM/SerializeNVVMTarget.cpp (+1-1)
(modified) mlir/unittests/Target/LLVM/SerializeROCDLTarget.cpp (+1-1)

diff --git a/mlir/CMakeLists.txt b/mlir/CMakeLists.txt
index 16c898bdeb6e00..d2a1d6c1340c0f 100644
--- a/mlir/CMakeLists.txt
+++ b/mlir/CMakeLists.txt
@@ -108,20 +108,20 @@ endif()
 # is available
 if ("NVPTX" IN_LIST LLVM_TARGETS_TO_BUILD)
   set(MLIR_ENABLE_CUDA_CONVERSIONS 1)
+  # TODO: we should use a config.h file like LLVM does
+  add_definitions(-DMLIR_CUDA_CONVERSIONS_ENABLED)
 else()
   set(MLIR_ENABLE_CUDA_CONVERSIONS 0)
 endif()
-# TODO: we should use a config.h file like LLVM does
-add_definitions(-DMLIR_CUDA_CONVERSIONS_ENABLED=${MLIR_ENABLE_CUDA_CONVERSIONS})
 
 # Build the ROCm conversions and run according tests if the AMDGPU backend
 # is available.
 if ("AMDGPU" IN_LIST LLVM_TARGETS_TO_BUILD)
   set(MLIR_ENABLE_ROCM_CONVERSIONS 1)
+  add_definitions(-DMLIR_ROCM_CONVERSIONS_ENABLED)
 else()
   set(MLIR_ENABLE_ROCM_CONVERSIONS 0)
 endif()
-add_definitions(-DMLIR_ROCM_CONVERSIONS_ENABLED=${MLIR_ENABLE_ROCM_CONVERSIONS})
 
 set(MLIR_ENABLE_CUDA_RUNNER 0 CACHE BOOL "Enable building the mlir CUDA runner")
 set(MLIR_ENABLE_ROCM_RUNNER 0 CACHE BOOL "Enable building the mlir ROCm runner")
diff --git a/mlir/include/mlir/InitAllPasses.h b/mlir/include/mlir/InitAllPasses.h
index e28921619fe582..7f14577d98a561 100644
--- a/mlir/include/mlir/InitAllPasses.h
+++ b/mlir/include/mlir/InitAllPasses.h
@@ -96,7 +96,7 @@ inline void registerAllPasses() {
   bufferization::registerBufferizationPipelines();
   sparse_tensor::registerSparseTensorPipelines();
   tosa::registerTosaToLinalgPipelines();
-#if MLIR_CUDA_CONVERSIONS_ENABLED
+#ifdef MLIR_CUDA_CONVERSIONS_ENABLED
   gpu::registerGPUToNVVMPipeline();
 #endif
 }
diff --git a/mlir/lib/Dialect/GPU/Pipelines/GPUToNVVMPipeline.cpp b/mlir/lib/Dialect/GPU/Pipelines/GPUToNVVMPipeline.cpp
index 935f0deaf9c8a6..31699ee44b6671 100644
--- a/mlir/lib/Dialect/GPU/Pipelines/GPUToNVVMPipeline.cpp
+++ b/mlir/lib/Dialect/GPU/Pipelines/GPUToNVVMPipeline.cpp
@@ -38,7 +38,7 @@
 
 using namespace mlir;
 
-#if MLIR_CUDA_CONVERSIONS_ENABLED
+#ifdef MLIR_CUDA_CONVERSIONS_ENABLED
 namespace {
 
 //===----------------------------------------------------------------------===//
diff --git a/mlir/lib/Dialect/GPU/Transforms/ModuleToBinary.cpp b/mlir/lib/Dialect/GPU/Transforms/ModuleToBinary.cpp
index 0527073da85b69..9fe7b03a62a6ba 100644
--- a/mlir/lib/Dialect/GPU/Transforms/ModuleToBinary.cpp
+++ b/mlir/lib/Dialect/GPU/Transforms/ModuleToBinary.cpp
@@ -48,10 +48,10 @@ void GpuModuleToBinaryPass::getDependentDialects(
   // Register all GPU related translations.
   registry.insert<gpu::GPUDialect>();
   registry.insert<LLVM::LLVMDialect>();
-#if MLIR_CUDA_CONVERSIONS_ENABLED == 1
+#ifdef MLIR_CUDA_CONVERSIONS_ENABLED
   registry.insert<NVVM::NVVMDialect>();
 #endif
-#if MLIR_ROCM_CONVERSIONS_ENABLED == 1
+#ifdef MLIR_ROCM_CONVERSIONS_ENABLED
   registry.insert<ROCDL::ROCDLDialect>();
 #endif
   registry.insert<spirv::SPIRVDialect>();
diff --git a/mlir/lib/Target/LLVM/NVVM/Target.cpp b/mlir/lib/Target/LLVM/NVVM/Target.cpp
index 71b15a92782ed7..66efc9f416f21f 100644
--- a/mlir/lib/Target/LLVM/NVVM/Target.cpp
+++ b/mlir/lib/Target/LLVM/NVVM/Target.cpp
@@ -156,7 +156,7 @@ SerializeGPUModuleBase::loadBitcodeFiles(llvm::Module &module) {
   return std::move(bcFiles);
 }
 
-#if MLIR_CUDA_CONVERSIONS_ENABLED == 1
+#ifdef MLIR_CUDA_CONVERSIONS_ENABLED
 namespace {
 class NVPTXSerializer : public SerializeGPUModuleBase {
 public:
@@ -555,14 +555,14 @@ NVPTXSerializer::moduleToObject(llvm::Module &llvmModule) {
     return SmallVector<char, 0>(bin.begin(), bin.end());
   }
 
-    // Compile to binary.
+  // Compile to binary.
 #if MLIR_NVPTXCOMPILER_ENABLED == 1
   return compileToBinaryNVPTX(*serializedISA);
 #else
   return compileToBinary(*serializedISA);
 #endif // MLIR_NVPTXCOMPILER_ENABLED == 1
 }
-#endif // MLIR_CUDA_CONVERSIONS_ENABLED == 1
+#endif // MLIR_CUDA_CONVERSIONS_ENABLED
 
 std::optional<SmallVector<char, 0>>
 NVVMTargetAttrImpl::serializeToObject(Attribute attribute, Operation *module,
@@ -574,7 +574,7 @@ NVVMTargetAttrImpl::serializeToObject(Attribute attribute, Operation *module,
     module->emitError("Module must be a GPU module.");
     return std::nullopt;
   }
-#if MLIR_CUDA_CONVERSIONS_ENABLED == 1
+#ifdef MLIR_CUDA_CONVERSIONS_ENABLED
   NVPTXSerializer serializer(*module, cast<NVVMTargetAttr>(attribute), options);
   serializer.init();
   return serializer.run();
@@ -582,7 +582,7 @@ NVVMTargetAttrImpl::serializeToObject(Attribute attribute, Operation *module,
   module->emitError(
       "The `NVPTX` target was not built. Please enable it when building LLVM.");
   return std::nullopt;
-#endif // MLIR_CUDA_CONVERSIONS_ENABLED == 1
+#endif // MLIR_CUDA_CONVERSIONS_ENABLED
 }
 
 Attribute
diff --git a/mlir/lib/Target/LLVM/ROCDL/Target.cpp b/mlir/lib/Target/LLVM/ROCDL/Target.cpp
index cdcef1d6b459c6..9e608843998103 100644
--- a/mlir/lib/Target/LLVM/ROCDL/Target.cpp
+++ b/mlir/lib/Target/LLVM/ROCDL/Target.cpp
@@ -120,7 +120,7 @@ void SerializeGPUModuleBase::init() {
   static llvm::once_flag initializeBackendOnce;
   llvm::call_once(initializeBackendOnce, []() {
   // If the `AMDGPU` LLVM target was built, initialize it.
-#if MLIR_ROCM_CONVERSIONS_ENABLED == 1
+#ifdef MLIR_ROCM_CONVERSIONS_ENABLED
     LLVMInitializeAMDGPUTarget();
     LLVMInitializeAMDGPUTargetInfo();
     LLVMInitializeAMDGPUTargetMC();
@@ -318,7 +318,7 @@ SerializeGPUModuleBase::assembleIsa(StringRef isa) {
   return result;
 }
 
-#if MLIR_ROCM_CONVERSIONS_ENABLED == 1
+#ifdef MLIR_ROCM_CONVERSIONS_ENABLED
 namespace {
 class AMDGPUSerializer : public SerializeGPUModuleBase {
 public:
@@ -462,7 +462,7 @@ std::optional<SmallVector<char, 0>> ROCDLTargetAttrImpl::serializeToObject(
     module->emitError("Module must be a GPU module.");
     return std::nullopt;
   }
-#if MLIR_ROCM_CONVERSIONS_ENABLED == 1
+#ifdef MLIR_ROCM_CONVERSIONS_ENABLED
   AMDGPUSerializer serializer(*module, cast<ROCDLTargetAttr>(attribute),
                               options);
   serializer.init();
@@ -471,7 +471,7 @@ std::optional<SmallVector<char, 0>> ROCDLTargetAttrImpl::serializeToObject(
   module->emitError("The `AMDGPU` target was not built. Please enable it when "
                     "building LLVM.");
   return std::nullopt;
-#endif // MLIR_ROCM_CONVERSIONS_ENABLED == 1
+#endif // MLIR_ROCM_CONVERSIONS_ENABLED
 }
 
 Attribute
diff --git a/mlir/unittests/Target/LLVM/SerializeNVVMTarget.cpp b/mlir/unittests/Target/LLVM/SerializeNVVMTarget.cpp
index 26bfbd5c11e81e..0cbc938e189b68 100644
--- a/mlir/unittests/Target/LLVM/SerializeNVVMTarget.cpp
+++ b/mlir/unittests/Target/LLVM/SerializeNVVMTarget.cpp
@@ -29,7 +29,7 @@
 using namespace mlir;
 
 // Skip the test if the NVPTX target was not built.
-#if MLIR_CUDA_CONVERSIONS_ENABLED == 0
+#ifndef MLIR_CUDA_CONVERSIONS_ENABLED
 #define SKIP_WITHOUT_NVPTX(x) DISABLED_##x
 #else
 #define SKIP_WITHOUT_NVPTX(x) x
diff --git a/mlir/unittests/Target/LLVM/SerializeROCDLTarget.cpp b/mlir/unittests/Target/LLVM/SerializeROCDLTarget.cpp
index e91e69b102043a..5b3fe4979a2d01 100644
--- a/mlir/unittests/Target/LLVM/SerializeROCDLTarget.cpp
+++ b/mlir/unittests/Target/LLVM/SerializeROCDLTarget.cpp
@@ -31,7 +31,7 @@
 using namespace mlir;
 
 // Skip the test if the AMDGPU target was not built.
-#if MLIR_ROCM_CONVERSIONS_ENABLED == 0
+#ifndef MLIR_ROCM_CONVERSIONS_ENABLED
 #define SKIP_WITHOUT_AMDGPU(x) DISABLED_##x
 #else
 #define SKIP_WITHOUT_AMDGPU(x) x

ingomueller-net · 2024-02-26T11:49:49Z

As a comparison, these are the macros where #if is being used. (I removed manually some tests like __clang_major__ <= 5.) Note that the majority of those also doesn't rely on the values of the macros.

> git grep -E "^#if " -- mlir | cut -f2 -d: | sort -u
...
#if LLVM_ENABLE_ABI_BREAKING_CHECKS
#if LLVM_ENABLE_STATS
#if LLVM_ENABLE_THREADS != 0
#if LLVM_HAS_NVPTX_TARGET
#if LLVM_NATIVE_TARGET_TEST_ENABLED == 0
#if MLIR_CAPI_BUILDING_LIBRARY
#if MLIR_DEPRECATED_GPU_SERIALIZATION_ENABLE == 1
#if MLIR_ENABLE_EXPENSIVE_PATTERN_API_CHECKS
#if MLIR_ENABLE_PDL_IN_PATTERNMATCH
#if MLIR_GPU_TO_HSACO_PASS_ENABLE
#if MLIR_NVPTXCOMPILER_ENABLED == 1

And this is where #ifdef is being used:

> git grep -E "^#ifdef" -- mlir | cut -f2 -d: | sort -u
#ifdef __cplusplus
#ifdef CUSPARSE_COO_AOS
#ifdef CUSPARSE_COO_AOS // deprecated in cuSPARSE 11.2
#ifdef ERROR1
#ifdef ERROR10
#ifdef ERROR11
#ifdef ERROR12
#ifdef ERROR13
#ifdef ERROR2
#ifdef ERROR3
#ifdef ERROR4
#ifdef ERROR5
#ifdef ERROR6
#ifdef ERROR7
#ifdef ERROR8
#ifdef ERROR9
#ifdef EXPENSIVE_CHECKS
#ifdef GEN_PASS_DECL_MYPASS
#ifdef __has_attribute
#ifdef __linux__
#ifdef LLVM_ENABLE_ABI_BREAKING_CHECKS
#ifdef mlir_arm_sme_abi_stubs_EXPORTS
#ifdef mlir_async_runtime_EXPORTS
#ifdef MLIR_CRUNNERUTILS_DEFINE_FUNCTIONS
#ifdef mlir_c_runner_utils_EXPORTS
#ifdef MLIR_CUDA_CONVERSIONS_ENABLED
#ifdef MLIR_ENABLE_CUDA_CUSPARSE
#ifdef MLIR_ENABLE_CUDA_CUSPARSELT
#ifdef MLIR_FLOAT16_DEFINE_FUNCTIONS // We are building this library
#ifdef mlir_float16_utils_EXPORTS // We are building this library
#ifdef MLIR_GREEDY_REWRITE_RANDOMIZER_SEED
#ifdef MLIR_INCLUDE_TESTS
#ifdef MLIR_ROCM_CONVERSIONS_ENABLED
#ifdef mlir_runner_utils_EXPORTS
#ifdef NDEBUG
#ifdef __sparc__
#ifdef _WIN32

fabianmcg · 2024-02-26T13:43:38Z

Did you run mlir-check on both AMD and NVIDIA GPUs and verified the integration tests run?
I never liked the pattern #if X == 1, however, I don't remember if it was setup like that because of the tests.

ingomueller-net · 2024-02-26T14:12:46Z

I have just run all bazel test targets under mlir/test and they passed. However, I am not 100% sure whether actually runs on the two GPU architectures. Doesn't the CI do that eventually?

fabianmcg · 2024-02-26T14:23:41Z

However, I am not 100% sure whether actually runs on the two GPU architectures. Doesn't the CI do that eventually?

We don't test GPU integration tests until the commit lands on main, hence we wouldn't know until a builder fails and we have to revert.

If you build with -DLLVM_LIT_ARGS="-v --xunit-xml-output test-results.xml" then the file ${BUILD_DIR}/tools/mlir/test/test-results.xml should show you whether the test was skipped, failed or passed.

Otherwise, I can test this later on the day or tomorrow morning.

ingomueller-net · 2024-02-26T14:28:08Z

Yeah, I see the problem. The thing with testing with CMake is that I don't have a machine with a GPU to test things. I can only run those GPU tests that Bazel runs in our internal build/test cloud... If you could run the tests, that'd be great -- thanks!

This PR changes the `#if XXX == 1` sections for these macros to `#ifdef` sections such that the actual values that the macros are defined with do not matter. This scheme is used for most (but not all) such macros in the MLIR code base and the fact that these two macros did not follow it has lead to bugs downstream (which assumed that `#ifdef` was being used and defined them to a value != 1).

joker-eph

I have concerns with this direction which seems the opposite of what we should do.

I would want to never use ifdef so that we can turn on the -Wundef warning and catch the usual misconfiguration problem where there are spelling issue, or a macro is renamed and usage not updated, etc.

joker-eph · 2024-02-27T00:54:10Z

As an example, the form #if LLVM_ENABLE_ABI_BREAKING_CHECKS would be caught by -Wundef if the Config.h header isn't included, but silently misconfigured with ifdef.

ingomueller-net · 2024-02-27T09:46:30Z

OK, that's a good argument. That might have caught #83004 -- I'll try out running with that warning flag and see if I find similar problems.

One clarification, though: even if we use #if instead of #ifdef, we don't have to rely on a value, right? That would prefer #if X over #if X == 1. Is that so?

FWIW, I think our downstream problems were actually unrelated to the macro value.

ingomueller-net · 2024-02-27T16:43:11Z

Closing in favor of #83004. That PR only makes changes to the CUDA-related variable but I'll probably submit other PRs for the other ones once we agreed on the approach.

llvmbot added mlir:llvm mlir:gpu mlir labels Feb 26, 2024

ingomueller-net requested review from fabianmcg, jpienaar and grypp February 26, 2024 11:53

ingomueller-net force-pushed the cuda-rocm-macro-defines branch from ea2280f to 592c4f9 Compare February 26, 2024 13:31

ingomueller-net force-pushed the cuda-rocm-macro-defines branch from 592c4f9 to 93386c2 Compare February 26, 2024 16:00

joker-eph requested changes Feb 27, 2024

View reviewed changes

ingomueller-net mentioned this pull request Feb 27, 2024

[mlir] Expose MLIR_CUDA_CONVERSIONS_ENABLED in mlir-config.h. #83004

Merged

ingomueller-net closed this Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mlir] Don't rely on values of MLIR_(CURDA|ROCM)_CONVERSIONS_ENABLED. #82988

[mlir] Don't rely on values of MLIR_(CURDA|ROCM)_CONVERSIONS_ENABLED. #82988

Uh oh!

ingomueller-net commented Feb 26, 2024

Uh oh!

llvmbot commented Feb 26, 2024 •

edited

Loading

Uh oh!

ingomueller-net commented Feb 26, 2024

Uh oh!

fabianmcg commented Feb 26, 2024

Uh oh!

ingomueller-net commented Feb 26, 2024

Uh oh!

fabianmcg commented Feb 26, 2024

Uh oh!

ingomueller-net commented Feb 26, 2024

Uh oh!

joker-eph left a comment

Uh oh!

joker-eph commented Feb 27, 2024

Uh oh!

ingomueller-net commented Feb 27, 2024

Uh oh!

ingomueller-net commented Feb 27, 2024

Uh oh!

Uh oh!

[mlir] Don't rely on values of MLIR_(CURDA|ROCM)_CONVERSIONS_ENABLED. #82988

[mlir] Don't rely on values of MLIR_(CURDA|ROCM)_CONVERSIONS_ENABLED. #82988

Uh oh!

Conversation

ingomueller-net commented Feb 26, 2024

Uh oh!

llvmbot commented Feb 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ingomueller-net commented Feb 26, 2024

Uh oh!

fabianmcg commented Feb 26, 2024

Uh oh!

ingomueller-net commented Feb 26, 2024

Uh oh!

fabianmcg commented Feb 26, 2024

Uh oh!

ingomueller-net commented Feb 26, 2024

Uh oh!

joker-eph left a comment

Choose a reason for hiding this comment

Uh oh!

joker-eph commented Feb 27, 2024

Uh oh!

ingomueller-net commented Feb 27, 2024

Uh oh!

ingomueller-net commented Feb 27, 2024

Uh oh!

Uh oh!

llvmbot commented Feb 26, 2024 •

edited

Loading