CUDA: fix FP16 cuBLAS GEMM #11396

JohannesGaessler · 2025-01-24T19:37:37Z

#11356 introduced a bug that caused FP16 cuBLAS GEMM to be incorrect because the wrong pointer was being used. This PR fixes it.

IMbackK · 2025-01-24T19:49:58Z

Upps sorry about that, copy paste mistake.

This reverts commit c5d9eff.

This reverts commit f0e6b2a.

CUDA: fix FP16 cuBLAS GEMM

8aa0338

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Jan 24, 2025

JohannesGaessler mentioned this pull request Jan 24, 2025

Avoid fp32->fp16->fp32 conversion on cdna in ggml_cuda_op_mul_mat_cublas #11356

Merged

slaren approved these changes Jan 24, 2025

View reviewed changes

JohannesGaessler merged commit c5d9eff into ggml-org:master Jan 24, 2025
44 checks passed

anagri pushed a commit to BodhiSearch/llama.cpp that referenced this pull request Jan 26, 2025

CUDA: fix FP16 cuBLAS GEMM (ggml-org#11396)

46a6df7

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Jan 26, 2025

Revert "CUDA: fix FP16 cuBLAS GEMM (ggml-org#11396)"

f0e6b2a

This reverts commit c5d9eff.

Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Jan 28, 2025

Reapply "CUDA: fix FP16 cuBLAS GEMM (ggml-org#11396)"

aefe880

This reverts commit f0e6b2a.

tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025

CUDA: fix FP16 cuBLAS GEMM (ggml-org#11396)

21fd5e6

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

CUDA: fix FP16 cuBLAS GEMM (ggml-org#11396)

95413b2

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

CUDA: fix FP16 cuBLAS GEMM (ggml-org#11396)

2f3a224

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: fix FP16 cuBLAS GEMM #11396

CUDA: fix FP16 cuBLAS GEMM #11396

Uh oh!

JohannesGaessler commented Jan 24, 2025

Uh oh!

IMbackK commented Jan 24, 2025

Uh oh!

Uh oh!

Uh oh!

CUDA: fix FP16 cuBLAS GEMM #11396

CUDA: fix FP16 cuBLAS GEMM #11396

Uh oh!

Conversation

JohannesGaessler commented Jan 24, 2025

Uh oh!

IMbackK commented Jan 24, 2025

Uh oh!

Uh oh!

Uh oh!