SpQR Quantization? ... for Near-Lossless LLM Weight Compression #2061
ianscrivener
started this conversation in
Ideas
Replies: 1 comment 3 replies
-
#1602 (comment) |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
almost sounds too good to be true... but this technique even makes sense to the layman
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
https://arxiv.org/abs/2306.03078
https://github.com/Vahe1994/SpQR
via: https://www.superdatascience.com/podcast/near-lossless-llm-quantization
Beta Was this translation helpful? Give feedback.
All reactions