-
Notifications
You must be signed in to change notification settings - Fork 607
[quant] Use Int8DynActInt4WeightQuantizer
in torchao
#2551
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Summary: att Test Plan: python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2551
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New Failures, 1 Unrelated FailureAs of commit c061495 with merge base 6bef9e7 ( NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
…` in torchao" Summary: att Test Plan: python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
Summary: att Test Plan: python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
@jerryzh168 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
…` in torchao" Summary: att Test Plan: python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
Summary: att Test Plan: python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
3c40571
to
aaee570
Compare
…` in torchao" Summary: att Test Plan: python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
Summary: att Test Plan: python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]
aaee570
to
f401505
Compare
@jerryzh168 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
@jerryzh168 merged this pull request in a6aefc0. |
Stack from ghstack (oldest at bottom):
Int8DynActInt4WeightQuantizer
in torchao #2551Summary:
att
Test Plan:
python3 -m examples.models.llama2.export_llama -c stories110M.pt -p params.json -qmode 8da4w -X -d fp32
Reviewers:
Subscribers:
Tasks:
Tags: