metal: Copy kernels for quant to F32 conversions (#10976). #12017

gcp · 2025-02-22T00:03:24Z

Modeled after the CUDA implementations.

Because of the use of type4x4 I had no idea how to reuse the existing dequantize functions, so those are repeated here in float form.

Fixes issue #10976.

ggerganov · 2025-02-22T09:55:13Z

The dequantize functions return a group of 16 elements from a given block of quants. The short il argument specifies the index of the group. I.e il == 0 will return the first 16 elements, il == 1 will return the second 16 elements and so on. For quantizations with block size of 32, il = [0..1] while for quantizations with block size of 256, il = [0..15].

I pushed an implementation that uses the dequantize functions and also supports copy to F16 although the latter is not yet implemented on the CPU, so it's currently not tested.

metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <[email protected]>

gcp · 2025-02-23T18:05:03Z

All OK from my side.

metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <[email protected]>

github-actions bot added ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Feb 22, 2025

gcp force-pushed the cpy_metal_quants branch from a43f2fb to 9d00bc2 Compare February 22, 2025 00:08

ggerganov force-pushed the cpy_metal_quants branch 2 times, most recently from c642a56 to be1542e Compare February 22, 2025 09:51

metal: Copy kernels for quant to F32 conversions (ggml-org#10976).

bfc305a

metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <[email protected]>

gcp force-pushed the cpy_metal_quants branch from be1542e to bfc305a Compare February 23, 2025 18:03

ggerganov merged commit 58d07a8 into ggml-org:master Feb 25, 2025
47 checks passed

orca-zhang pushed a commit to orca-zhang/llama.cpp that referenced this pull request Feb 26, 2025

metal : copy kernels for quant to F32/F16 conversions (ggml-org#12017)

3ba18ef

metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <[email protected]>

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

metal : copy kernels for quant to F32/F16 conversions (ggml-org#12017)

4cce5e9

metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <[email protected]>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Mar 19, 2025

metal : copy kernels for quant to F32/F16 conversions (ggml-org#12017)

c4a3e97

metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <[email protected]>

mostlyuseful pushed a commit to mostlyuseful/llama.cpp that referenced this pull request May 12, 2025

metal : copy kernels for quant to F32/F16 conversions (ggml-org#12017)

50e7b53

metal: use dequantize_q templates --------- Co-authored-by: Georgi Gerganov <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

metal: Copy kernels for quant to F32 conversions (#10976). #12017

metal: Copy kernels for quant to F32 conversions (#10976). #12017

Uh oh!

gcp commented Feb 22, 2025 •

edited

Loading

Uh oh!

ggerganov commented Feb 22, 2025

Uh oh!

gcp commented Feb 23, 2025

Uh oh!

Uh oh!

Uh oh!

metal: Copy kernels for quant to F32 conversions (#10976). #12017

metal: Copy kernels for quant to F32 conversions (#10976). #12017

Uh oh!

Conversation

gcp commented Feb 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Feb 22, 2025

Uh oh!

gcp commented Feb 23, 2025

Uh oh!

Uh oh!

Uh oh!

gcp commented Feb 22, 2025 •

edited

Loading