vulkan: fix diag_mask_inf #11323

jeffbolznv · 2025-01-21T04:20:28Z

With robustbufferaccess disabled, this shader was showing OOB stores. There is a bounds check in the code, but the workgrouop dimensions were reversed vs CUDA and it was running the wrong number of threads. So fix the workgroup dimensions and disable robustness for this pipeline.

0cc4m

Interesting bug I introduced when I ported this shader from CUDA, and funny that it worked anyways, on most devices.

With robustbufferaccess disabled, this shader was showing OOB stores. There is a bounds check in the code, but the workgrouop dimensions were reversed vs CUDA and it was running the wrong number of threads. So fix the workgroup dimensions and disable robustness for this pipeline.

jeffbolznv requested a review from 0cc4m January 21, 2025 04:20

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Jan 21, 2025

jeffbolznv mentioned this pull request Jan 21, 2025

MoltenVK bug people are discussing re: ollama, llama.cpp KhronosGroup/MoltenVK#2423

Closed

0cc4m approved these changes Jan 23, 2025

View reviewed changes

0cc4m merged commit 5245729 into ggml-org:master Jan 23, 2025
45 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: fix diag_mask_inf #11323

vulkan: fix diag_mask_inf #11323

Uh oh!

jeffbolznv commented Jan 21, 2025

Uh oh!

0cc4m left a comment

Uh oh!

Uh oh!

Uh oh!

vulkan: fix diag_mask_inf #11323

vulkan: fix diag_mask_inf #11323

Uh oh!

Conversation

jeffbolznv commented Jan 21, 2025

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!