Update README.md #479

mikekgfb · 2024-04-25T07:01:56Z

readme update

mikekgfb · 2024-04-25T07:36:13Z

README only update. Please ignore pending tests

readme update

* Update quantize.py to use torchao Quantizers Summary: Remove duplicate code for Int4WeightOnlyQuantizer and Int8DynActInt4WeightQuantizer and use torchao API. Test Plan: ``` python torchchat.py generate llama2 --quantize '{"linear:int4": {"groupsize": 256}, "precision": {"dtype":"float16"}, "executor":{"accelerator":"cpu"}}' --prompt "Once upon a time," --max-new-tokens 256 python torchchat.py generate llama2 --quantize '{"linear:a8w4dq": {"groupsize": 256}, "precision": {"dtype":"float16"}, "executor":{"accelerator":"cpu"}}' --prompt "Once upon a time," --max-new-tokens 256 ``` Reviewers: Subscribers: Tasks: Tags: * Fix import Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Install torchao from gh * Explain import Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Fix dependencies * Test ao PR #479 * Update torchao hash * Update torchao pin * Fix scheduler bf16/fp16 mix error * Incorporate torchao changes Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * update hash * Fix GPU CI job Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * More fix * Fix executorch CI job * Use quant api for int4 weight only quantization Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Fix Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Fix again Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: * Fix 3 * Fix 4 * Try something * debug * Only migrate 8a4w --------- Co-authored-by: Jack Zhang <[email protected]>

Update README.md

42060e3

readme update

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 25, 2024

kirklandsign approved these changes Apr 25, 2024

View reviewed changes

mikekgfb merged commit a024fb2 into main Apr 25, 2024

mikekgfb deleted the mikekgfb-patch-16 branch April 25, 2024 07:51

larryliu0820 added a commit that referenced this pull request Jul 5, 2024

Test ao PR #479

23ad59b

malfet pushed a commit that referenced this pull request Jul 17, 2024

Update README.md (#479)

0a5fd9a

readme update

malfet pushed a commit that referenced this pull request Jul 17, 2024

Update README.md (#479)

3f9ee71

readme update

malfet pushed a commit that referenced this pull request Jul 17, 2024

Update README.md (#479)

35d5931

readme update

malfet pushed a commit that referenced this pull request Jul 17, 2024

Update README.md (#479)

4476bc1

readme update

malfet pushed a commit that referenced this pull request Jul 17, 2024

Update README.md (#479)

6ee96e4

readme update

malfet pushed a commit that referenced this pull request Jul 17, 2024

Update README.md (#479)

1ecc094

readme update

larryliu0820 added a commit that referenced this pull request Jul 17, 2024

Test ao PR #479

35e88de

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update README.md #479

Update README.md #479

Uh oh!

mikekgfb commented Apr 25, 2024

Uh oh!

mikekgfb commented Apr 25, 2024

Uh oh!

Uh oh!

Update README.md #479

Update README.md #479

Uh oh!

Conversation

mikekgfb commented Apr 25, 2024

Uh oh!

mikekgfb commented Apr 25, 2024

Uh oh!

Uh oh!