cmake: fix clang build when CUDA is enabled (#4208) #4323

jonppe · 2023-12-04T13:38:27Z

Don't use the cxx_flags for .cu files so that -Wunreachable-code-break and -Wunreachable-code-return would not be sent to GCC which doesn't understand them.
By default, nvcc uses gcc as the preprocessor even when Clang is used for .c/.cpp.

This should fix the CUDA build with clang 14 (when gcc is used with nvcc). With more complex configuration, it probably would be possible to use clang itself (instead of nvcc) to compile CUDA code or use clang as the preprocessor for nvcc. But these are probably to be quite rare use cases. It seems that Clang 16 would require even more changes.

Don't use the cxx_flags for .cu files so that -Wunreachable-code-break and -Wunreachable-code-return would not be sent to to GCC which doesn't understand them. By default, nvcc uses gcc as the preprocessor even when Clang is used for .c/.cpp. This should fix the CUDA build with clang 14 (when gcc is used with nvcc). With more complex configuration, it probably would be possible to use clang itself (instead of nvcc) to compile CUDA code or use clang as the preprocessor for nvcc. But these are probably to be quite rare use cases. It seems that Clang 16 would require even more changes.

cebtenzzre · 2023-12-05T21:38:00Z

CMakeLists.txt

-            set(warning_flags ${warning_flags} -Wunreachable-code-break -Wunreachable-code-return)
+            set(c_flags ${c_flags} -Wunreachable-code-break -Wunreachable-code-return)
+            # cxx_flags are used for C++ files but not for CUDA
+            set(cxx_flags -Wunreachable-code-break -Wunreachable-code-return)


Hm, I think we should be passing cxx_flags to nvcc as in the Makefile - that's how I intended it. We probably shouldn't pass host_cxx_flags to nvcc at all, since we can't assume that it uses the same compiler as the host. We could possibly use -Xcompiler for cxx_flags, not sure if there's any benefit.

So I think warning_flags should be split into c_flags and host_cxx_flags.

I think it's a valid point that it would be nice if the Makefile and CMakeLists.txt would have somewhat similar patterns and variable names. Anyway, now that I checked, it it seems that also the Makefile fails in building CUDA enabled version with clang.
E.g.:

export CXX=/usr/bin/clang++ export CC=/usr/bin/clang make LLAMA_CUBLAS=1 ... /usr/bin/clang++ -I. -Icommon -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -DNDEBUG -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include -std=c++11 -fPIC -O3 -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wmissing-declarations -Wmissing-noreturn -pthread -Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -march=native -mtune=native -c common/build-info.cpp -o build-info.o nvcc --forward-unknown-to-host-compiler -use_fast_math -arch=native -DGGML_CUDA_DMMV_X=32 -DGGML_CUDA_MMV_Y=1 -DK_QUANTS_PER_ITERATION=2 -DGGML_CUDA_PEER_MAX_BATCH_SIZE=128 -I. -Icommon -D_XOPEN_SOURCE=600 -D_GNU_SOURCE -DNDEBUG -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include -std=c++11 -fPIC -O3 -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wmissing-declarations -Wmissing-noreturn -pthread -Wno-pedantic -Xcompiler "-Wunreachable-code-break -Wunreachable-code-return -Wmissing-prototypes -Wextra-semi -march=native -mtune=native " -c ggml-cuda.cu -o ggml-cuda.o gcc: error: unrecognized command-line option '-Wunreachable-code-break'; did you mean '-Wunreachable-code'? gcc: error: unrecognized command-line option '-Wunreachable-code-return'; did you mean '-Wunreachable-code'? make: *** [Makefile:451: ggml-cuda.o] Error 1

To sum up, I think the parameters get set correctly using this PR. But perhaps the variable names could be clarified, and possible there could be some changes in the way they get set.
Then the Makefile should probably have similar changes.

cebtenzzre · 2023-12-11T20:14:52Z

@jonppe I implemented #4414 for the Makefile. Does that seem like what you need? If it works for you, I'll port it to CMake.

jonppe · 2023-12-11T21:04:24Z

@jonppe I implemented #4414 for the Makefile. Does that seem like what you need? If it works for you, I'll port it to CMake.

Yes, look quite nice and clean to me and avoids the nasty repetition in compiler checks.

jonppe mentioned this pull request Dec 4, 2023

Compilation error #4208

Closed

4 tasks

cebtenzzre reviewed Dec 5, 2023

View reviewed changes

cebtenzzre mentioned this pull request Dec 11, 2023

build : detect host compiler and cuda compiler separately #4414

Merged

cebtenzzre closed this Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cmake: fix clang build when CUDA is enabled (#4208) #4323

cmake: fix clang build when CUDA is enabled (#4208) #4323

Uh oh!

jonppe commented Dec 4, 2023

Uh oh!

cebtenzzre Dec 5, 2023

Uh oh!

jonppe Dec 7, 2023

Uh oh!

cebtenzzre commented Dec 11, 2023

Uh oh!

jonppe commented Dec 11, 2023

Uh oh!

Uh oh!

cmake: fix clang build when CUDA is enabled (#4208) #4323

cmake: fix clang build when CUDA is enabled (#4208) #4323

Uh oh!

Conversation

jonppe commented Dec 4, 2023

Uh oh!

cebtenzzre Dec 5, 2023

Choose a reason for hiding this comment

Uh oh!

jonppe Dec 7, 2023

Choose a reason for hiding this comment

Uh oh!

cebtenzzre commented Dec 11, 2023

Uh oh!

jonppe commented Dec 11, 2023

Uh oh!

Uh oh!