-
Notifications
You must be signed in to change notification settings - Fork 608
Defer resolution of the default value of arguments used by quantize #2738
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2738
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 0bdf70f with merge base 45c2557 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D55458866 |
"-G", | ||
"--group_size", | ||
type=int, | ||
default=256, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
since group_size doesn't make sense for other settings like fp, maybe default to None as well
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should probably do the same for the GPTQ specific args then (calibration_*)
I'll change this PR to update all of them and the lower one to just plumbing and keeping None
This pull request was exported from Phabricator. Differential Revision: D55458866 |
…ytorch#2738) Summary: Pull Request resolved: pytorch#2738 Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Differential Revision: D55458866
4bf35d0
to
a2d0e22
Compare
This pull request was exported from Phabricator. Differential Revision: D55458866 |
a2d0e22
to
d514ae3
Compare
This pull request was exported from Phabricator. Differential Revision: D55458866 |
…ytorch#2738) Summary: Pull Request resolved: pytorch#2738 Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Differential Revision: D55458866
d514ae3
to
4d096db
Compare
…ytorch#2738) Summary: Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Differential Revision: D55458866
4d096db
to
81e9225
Compare
This pull request was exported from Phabricator. Differential Revision: D55458866 |
…ytorch#2738) Summary: Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Differential Revision: D55458866
81e9225
to
fde8e4a
Compare
This pull request was exported from Phabricator. Differential Revision: D55458866 |
…ytorch#2738) Summary: Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Reviewed By: jerryzh168 Differential Revision: D55458866
fde8e4a
to
2a3e6ab
Compare
This pull request was exported from Phabricator. Differential Revision: D55458866 |
Summary: Previously group size wasn't being passed properly to 4b quant. This just passes it through Reviewed By: mergennachin Differential Revision: D55458352
…ytorch#2738) Summary: Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Reviewed By: jerryzh168 Differential Revision: D55458866
2a3e6ab
to
0bdf70f
Compare
This pull request was exported from Phabricator. Differential Revision: D55458866 |
This pull request has been merged in 57e3449. |
…ytorch#2738) Summary: Pull Request resolved: pytorch#2738 Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Reviewed By: jerryzh168 Differential Revision: D55458866 fbshipit-source-id: 1afc5a62d214409f31c43c07be66bfd69712bb74
Summary:
Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing.
This diff defers default value resolution to quantize() since that is the direct call that uses them.
Differential Revision: D55458866