Replies: 1 comment
-
Sorry it is already there in quantize help, found it |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I could not find any documentation on how to do quantisation of llama model with new k-quant methods: q2_K, q3_K_S, q3_K_M, q3_K_L, q4_K_S, q4_K_M, q5_K_S, q6_K.
It will be very helpful if someone could share the steps/code to run that.
Beta Was this translation helpful? Give feedback.
All reactions