-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Renames NVIDIA GPU-architecture flags to avoid name clashes with WinAPI #10736
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Renames NVIDIA GPU-architecture flags to avoid name clashes with WinAPI #10736
Conversation
…PI. (e.g. CC_PASCAL, GPU architecture or WinAPI pascal compiler flag?)
ggml/src/ggml-cuda/common.cuh
Outdated
#define CC_TURING 750 | ||
#define CC_AMPERE 800 | ||
#define GGML_CUDA_CC_PASCAL 600 | ||
#define GGML_CUDA_MIN_CC_DP4A 610 // minimum compute capability for __dp4a, an intrinsic for byte-wise dot products |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since you're already renaming the macros, can you change this to GGML_CUDA_CC_DP4A
? CUDA code is forwards-compatible so it should be implicitly clear that this is the minimum compute capability.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will do.
ggml/src/ggml-sycl/vecdotq.hpp
Outdated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is not a CUDA file but a SYCL file. It should not be changed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You are right, fixed it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please also change the compute capability macros for the non-NVIDIA architectures in order to be consistent.
Just so there is no confusion: I don't mean for you to touch any of e.g. the SYCL code, I'm only asking you to rename the macros like |
Do you have a suggestion for the prefix? Is the |
My opinion is that |
@JohannesGaessler feel free to look over it again, I implemented your suggestions. |
* Renames NVIDIA GPU-architecture flags to avoid name clashes with WinAPI. (e.g. CC_PASCAL, GPU architecture or WinAPI pascal compiler flag?) * Reverts erroneous rename in SYCL-code. * Renames GGML_CUDA_MIN_CC_DP4A to GGML_CUDA_CC_DP4A. * Renames the rest of the compute capability macros for consistency.
* Renames NVIDIA GPU-architecture flags to avoid name clashes with WinAPI. (e.g. CC_PASCAL, GPU architecture or WinAPI pascal compiler flag?) * Reverts erroneous rename in SYCL-code. * Renames GGML_CUDA_MIN_CC_DP4A to GGML_CUDA_CC_DP4A. * Renames the rest of the compute capability macros for consistency.
For example,
CC_PASCAL
is a conflict, as it is both a GPU architecture and a WinAPI pascal compiler flag.