Vulkan-run-test: fix mmq_wg_denoms #11343

AMD-dwang · 2025-01-22T06:24:51Z

There should be a copy-and-paste error here.
*mmq_wg_denoms should be used together with *warptile_mmq, instead of wg_denoms.

There should be a copy-and-paste error here. *mmq_wg_denoms should be used together with *warptile_mmq, instead of wg_denoms.

0cc4m · 2025-01-22T08:54:11Z

Yes, I copied this wrong. It didn't make a difference because wg_denoms == mmq_wg_denoms for non-coopmat2, but it should be fixed. I'll run a test later, but it should be fine.

AMD-dwang · 2025-01-22T09:38:35Z

Yes, I copied this wrong. It didn't make a difference because wg_denoms == mmq_wg_denoms for non-coopmat2, but it should be fixed. I'll run a test later, but it should be fine.

l_wg_denoms or l_mmq_wg_denoms may be modified. Please see https://github.com/AMD-dwang/llama.cpp/blob/vulkan-compMat/ggml/src/ggml-vulkan/ggml-vulkan.cpp#L1476-L1499

l_wg_denoms and l_warptile_mmq will mismatch once it is modified. I just hit this error, then the number of work group is insufficient.

0cc4m · 2025-01-22T10:10:40Z

You're right, I forgot about that logic. Not sure it's needed anymore, but the fix is definitely necessary.

0cc4m

Thank you for the fix, looks good.

There should be a copy-and-paste error here. *mmq_wg_denoms should be used together with *warptile_mmq, instead of wg_denoms.

Vulkan-run-test: fix mmq_wg_denoms

c31a340

There should be a copy-and-paste error here. *mmq_wg_denoms should be used together with *warptile_mmq, instead of wg_denoms.

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Jan 22, 2025

0cc4m self-requested a review January 22, 2025 08:54

jeffbolznv approved these changes Jan 22, 2025

View reviewed changes

0cc4m approved these changes Jan 23, 2025

View reviewed changes

0cc4m merged commit 955a6c2 into ggml-org:master Jan 23, 2025
45 checks passed

0cc4m mentioned this pull request Jan 23, 2025

vulkan: implement initial support for IQ2 and IQ3 quantizations #11360

Merged

anagri pushed a commit to BodhiSearch/llama.cpp that referenced this pull request Jan 26, 2025

Vulkan-run-test: fix mmq_wg_denoms (ggml-org#11343)

13298ab

There should be a copy-and-paste error here. *mmq_wg_denoms should be used together with *warptile_mmq, instead of wg_denoms.

tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025

Vulkan-run-test: fix mmq_wg_denoms (ggml-org#11343)

1283989

There should be a copy-and-paste error here. *mmq_wg_denoms should be used together with *warptile_mmq, instead of wg_denoms.

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

Vulkan-run-test: fix mmq_wg_denoms (ggml-org#11343)

d83ccfe

There should be a copy-and-paste error here. *mmq_wg_denoms should be used together with *warptile_mmq, instead of wg_denoms.

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

Vulkan-run-test: fix mmq_wg_denoms (ggml-org#11343)

8a748ef

There should be a copy-and-paste error here. *mmq_wg_denoms should be used together with *warptile_mmq, instead of wg_denoms.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vulkan-run-test: fix mmq_wg_denoms #11343

Vulkan-run-test: fix mmq_wg_denoms #11343

Uh oh!

AMD-dwang commented Jan 22, 2025

Uh oh!

0cc4m commented Jan 22, 2025

Uh oh!

AMD-dwang commented Jan 22, 2025

Uh oh!

0cc4m commented Jan 22, 2025

Uh oh!

0cc4m left a comment

Uh oh!

Uh oh!

Uh oh!

Vulkan-run-test: fix mmq_wg_denoms #11343

Vulkan-run-test: fix mmq_wg_denoms #11343

Uh oh!

Conversation

AMD-dwang commented Jan 22, 2025

Uh oh!

0cc4m commented Jan 22, 2025

Uh oh!

AMD-dwang commented Jan 22, 2025

Uh oh!

0cc4m commented Jan 22, 2025

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!