Skip to content

Commit 84c4ee1

Browse files
metascroymalfet
authored andcommitted
new gguf parsing for Q40 that conforms with pytorch's quantization stack (#150)
* new gguf parsing for Q40 that conforms with pytorch's quantization stack * updates * add q6_k and clean up q40 * fixes to unpack_q40
1 parent e49d36e commit 84c4ee1

File tree

4 files changed

+222
-360
lines changed

4 files changed

+222
-360
lines changed

gguf_util/ggml_quantization_type/Q4_0.py

Lines changed: 0 additions & 252 deletions
This file was deleted.

0 commit comments

Comments
 (0)