CUDA: fixed LLAMA_FAST compilation option #2473

JohannesGaessler · 2023-07-31T18:19:41Z

The LLAMA_FAST compilation option which replaces -O3 with -Ofast currently does not work in combination with CUDA because nvcc does not have that compilation option. This PR makes it so that -O3 is always used for nvcc and -Ofast otherwise.

slaren

Looks good, but I wonder if LLAMA_FAST should toggle -use_fast_math for nvcc instead.

JohannesGaessler · 2023-07-31T18:39:34Z

All llama.cpp CUDA calculations at some point use half precision or worse so I don't think there's a point in disabling --use_fast_math. I don't know if there are side effects for using -Ofast for llama.cpp/ggml in general so I'm being conservative in not making it the default.

ggerganov · 2023-08-01T07:40:19Z

All llama.cpp CUDA calculations at some point use half precision or worse so I don't think there's a point in disabling --use_fast_math. I don't know if there are side effects for using -Ofast for llama.cpp/ggml in general so I'm being conservative in not making it the default.

At some point there was discussion about this in whisper.cpp and I decided to keep O3 as default:

ggml-org/whisper.cpp#252 (comment)

CUDA: fixed LLAMA_FAST compilation option

5be88d1

slaren approved these changes Jul 31, 2023

View reviewed changes

JohannesGaessler merged commit 49e7cb5 into ggml-org:master Jul 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: fixed LLAMA_FAST compilation option #2473

CUDA: fixed LLAMA_FAST compilation option #2473

Uh oh!

JohannesGaessler commented Jul 31, 2023

Uh oh!

slaren left a comment

Uh oh!

JohannesGaessler commented Jul 31, 2023

Uh oh!

ggerganov commented Aug 1, 2023

Uh oh!

Uh oh!

CUDA: fixed LLAMA_FAST compilation option #2473

CUDA: fixed LLAMA_FAST compilation option #2473

Uh oh!

Conversation

JohannesGaessler commented Jul 31, 2023

Uh oh!

slaren left a comment

Choose a reason for hiding this comment

Uh oh!

JohannesGaessler commented Jul 31, 2023

Uh oh!

ggerganov commented Aug 1, 2023

Uh oh!

Uh oh!