Defer resolution of the default value of arguments used by quantize #2738

Jack-Khuu · 2024-03-28T02:16:18Z

Summary:
Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing.

For example, previously the default value of calibration tasks was [], which is not something Int8DynActInt4WeightGPTQQuantizer handles gracefully.

This diff defers default value resolution to quantize() since that is the direct call that uses them.

Differential Revision: D55458866

pytorch-bot · 2024-03-28T02:16:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2738

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 0bdf70f with merge base 45c2557 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-03-28T02:16:26Z

This pull request was exported from Phabricator. Differential Revision: D55458866

jerryzh168 · 2024-03-28T03:10:18Z

examples/models/llama2/export_llama_lib.py

+        "-G",
+        "--group_size",
+        type=int,
+        default=256,


since group_size doesn't make sense for other settings like fp, maybe default to None as well

We should probably do the same for the GPTQ specific args then (calibration_*)

I'll change this PR to update all of them and the lower one to just plumbing and keeping None

facebook-github-bot · 2024-03-28T04:47:36Z

This pull request was exported from Phabricator. Differential Revision: D55458866

…ytorch#2738) Summary: Pull Request resolved: pytorch#2738 Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Differential Revision: D55458866

facebook-github-bot · 2024-03-28T04:54:16Z

This pull request was exported from Phabricator. Differential Revision: D55458866

facebook-github-bot · 2024-03-28T05:00:47Z

This pull request was exported from Phabricator. Differential Revision: D55458866

…ytorch#2738) Summary: Pull Request resolved: pytorch#2738 Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Differential Revision: D55458866

…ytorch#2738) Summary: Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Differential Revision: D55458866

facebook-github-bot · 2024-03-28T17:16:09Z

This pull request was exported from Phabricator. Differential Revision: D55458866

…ytorch#2738) Summary: Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Differential Revision: D55458866

facebook-github-bot · 2024-03-28T17:17:05Z

This pull request was exported from Phabricator. Differential Revision: D55458866

…ytorch#2738) Summary: Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Reviewed By: jerryzh168 Differential Revision: D55458866

facebook-github-bot · 2024-03-28T22:50:14Z

This pull request was exported from Phabricator. Differential Revision: D55458866

Summary: Previously group size wasn't being passed properly to 4b quant. This just passes it through Reviewed By: mergennachin Differential Revision: D55458352

…ytorch#2738) Summary: Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Reviewed By: jerryzh168 Differential Revision: D55458866

facebook-github-bot · 2024-03-28T22:51:54Z

This pull request was exported from Phabricator. Differential Revision: D55458866

facebook-github-bot · 2024-03-29T00:55:50Z

This pull request has been merged in 57e3449.

…ytorch#2738) Summary: Pull Request resolved: pytorch#2738 Quantize() (specifically GPTQ) is the sole user of the many params, but default values are introduced early and in multiple places. This is bug prone and confusing. * For example, previously the default value of calibration tasks was [], which is not something `Int8DynActInt4WeightGPTQQuantizer` handles gracefully. This diff defers default value resolution to quantize() since that is the direct call that uses them. Reviewed By: jerryzh168 Differential Revision: D55458866 fbshipit-source-id: 1afc5a62d214409f31c43c07be66bfd69712bb74

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 28, 2024

facebook-github-bot added the fb-exported label Mar 28, 2024

jerryzh168 approved these changes Mar 28, 2024

View reviewed changes

jerryzh168 reviewed Mar 28, 2024

View reviewed changes

Jack-Khuu changed the title ~~Change the default argument for Calibration Tasks for GPTQ~~ Defer resolution of the default value of arguments used by quantize Mar 28, 2024

Jack-Khuu force-pushed the export-D55458866 branch from 4bf35d0 to a2d0e22 Compare March 28, 2024 04:47

Jack-Khuu force-pushed the export-D55458866 branch from a2d0e22 to d514ae3 Compare March 28, 2024 04:54

Jack-Khuu force-pushed the export-D55458866 branch from d514ae3 to 4d096db Compare March 28, 2024 05:01

Jack-Khuu force-pushed the export-D55458866 branch from 4d096db to 81e9225 Compare March 28, 2024 17:15

Jack-Khuu force-pushed the export-D55458866 branch from 81e9225 to fde8e4a Compare March 28, 2024 17:16

Jack-Khuu force-pushed the export-D55458866 branch from fde8e4a to 2a3e6ab Compare March 28, 2024 22:50

Jack-Khuu added 2 commits March 28, 2024 15:51

Plumb group_size to 4b quant (pytorch#2734)

d91861e

Summary: Previously group size wasn't being passed properly to 4b quant. This just passes it through Reviewed By: mergennachin Differential Revision: D55458352

Jack-Khuu force-pushed the export-D55458866 branch from 2a3e6ab to 0bdf70f Compare March 28, 2024 22:51

facebook-github-bot closed this in 57e3449 Mar 29, 2024

facebook-github-bot added the Merged label Mar 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Defer resolution of the default value of arguments used by quantize #2738

Defer resolution of the default value of arguments used by quantize #2738

Uh oh!

Jack-Khuu commented Mar 28, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 28, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

jerryzh168 Mar 28, 2024

Uh oh!

Jack-Khuu Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 29, 2024

Uh oh!

Uh oh!

Defer resolution of the default value of arguments used by quantize #2738

Defer resolution of the default value of arguments used by quantize #2738

Uh oh!

Conversation

Jack-Khuu commented Mar 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2738

✅ No Failures

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

jerryzh168 Mar 28, 2024

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu Mar 28, 2024

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 28, 2024

Uh oh!

facebook-github-bot commented Mar 29, 2024

Uh oh!

Uh oh!

Jack-Khuu commented Mar 28, 2024 •

edited

Loading

pytorch-bot bot commented Mar 28, 2024 •

edited

Loading