Update quantize.py to use AO's int4 quantizer #919

jackzhxng · 2024-07-18T00:54:29Z

Description

As a follow up from #882, remove duplicate code for WeightOnlyInt4QuantHandler and use TorchAO's respective API instead.

Test plan

Past existing tests.

pytorch-bot · 2024-07-18T00:54:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/919

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 706a447 with merge base ab85b2a ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

Jack-Khuu · 2024-07-19T17:49:08Z

Looks great, thanks for working on this
Let's wait until #927 to land first

Use ao's int4 quantizer

bab2d99

jackzhxng requested a review from larryliu0820 July 18, 2024 00:54

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 18, 2024

jackzhxng and others added 2 commits July 17, 2024 20:10

Point AO to commit hash of Jerry's fix

cf12fae

When device is cuda, only run for dtype==bfloat16

b127063

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

larryliu0820 requested review from jerryzh168 and Jack-Khuu July 18, 2024 18:56

larryliu0820 added 10 commits July 18, 2024 12:02

Typo

3f86188

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

Use tensor subclass for int4 weight only quant

cd1661a

Fix bug

5fecac9

Fix

83818a1

Use both quantizer and subclass API

3136315

Bug

1766fcc

unwrap tensor subclass for aoti

2b5e1a2

Add import

60b4de7

Eval fix

e37c6da

Evaluate AOTI

706a447

larryliu0820 approved these changes Jul 19, 2024

View reviewed changes

Jack-Khuu approved these changes Jul 19, 2024

View reviewed changes

larryliu0820 merged commit 87798fd into main Jul 19, 2024
51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update quantize.py to use AO's int4 quantizer #919

Update quantize.py to use AO's int4 quantizer #919

Uh oh!

jackzhxng commented Jul 18, 2024

Uh oh!

pytorch-bot bot commented Jul 18, 2024 •

edited

Loading

Uh oh!

Jack-Khuu commented Jul 19, 2024

Uh oh!

Uh oh!

Uh oh!

Update quantize.py to use AO's int4 quantizer #919

Update quantize.py to use AO's int4 quantizer #919

Uh oh!

Conversation

jackzhxng commented Jul 18, 2024

Description

Test plan

Uh oh!

pytorch-bot bot commented Jul 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/919

✅ No Failures

Uh oh!

Jack-Khuu commented Jul 19, 2024

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 18, 2024 •

edited

Loading