You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Defer resolution of the default value of arguments used by quantize (#2738)
Summary:
Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing.
* For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully.
This diff defers default value resolution to quantize() since that is the direct call that uses them.
Differential Revision: D55458866
0 commit comments