You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Rename int4 to 8da4w in llama2 quantization (#2573)
Summary:
Pull Request resolved: #2573
int4 has been confused with "int4 weight only" before, when
in reality it is "int4 weights + int8 dynamic activations".
Renaming it to "8da4w" will reduce confusion and make it
more consistent with "8da4w-gptq".
#accept2land
Reviewed By: jerryzh168
Differential Revision: D55215146
fbshipit-source-id: 435c9b3e70e2546c8e0afc2df848546d7eb2d208
0 commit comments