Skip to content

HIP: add GGML_CUDA_CC_IS_* macros for testing for amd architeure familys #11601

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Feb 2, 2025

Conversation

IMbackK
Copy link
Collaborator

@IMbackK IMbackK commented Feb 2, 2025

For amd devices, increasing cc architectures are not supersets of eatch other causing the use of checks against cc to be bug prone

This fixes a bug where RDNA1 gpus other than gfx1010 where not handled correctly.

The GGML_CUDA_CC_IS_ macros are way less bug prone as they mirror the #if defined(CDNA) etc checks we do in device code.

…res for amd gpus are not supersets of eatch other

This fixes a bug where RDNA1 gpus other than gfx1010 where not handled corectly
@github-actions github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Feb 2, 2025
Copy link
Collaborator

@JohannesGaessler JohannesGaessler left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If these functions are intended for host code and not needed in preprocessor macros you may consider using constexpr functions instead but either way is fine I think.

@IMbackK IMbackK merged commit 4d0598e into ggml-org:master Feb 2, 2025
46 checks passed
@IMbackK IMbackK deleted the ccFix branch February 2, 2025 21:08
tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025
…res for amd gpus are not supersets of eatch other (ggml-org#11601)

This fixes a bug where RDNA1 gpus other than gfx1010 where not handled correctly
orca-zhang pushed a commit to orca-zhang/llama.cpp that referenced this pull request Feb 26, 2025
…res for amd gpus are not supersets of eatch other (ggml-org#11601)

This fixes a bug where RDNA1 gpus other than gfx1010 where not handled correctly
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025
…res for amd gpus are not supersets of eatch other (ggml-org#11601)

This fixes a bug where RDNA1 gpus other than gfx1010 where not handled correctly
mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025
…res for amd gpus are not supersets of eatch other (ggml-org#11601)

This fixes a bug where RDNA1 gpus other than gfx1010 where not handled correctly
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants