[xnnpack] Reexport after quantize in aot_compiler #7714

digantdesai · 2025-01-16T22:11:32Z

Summary

It was not using quantized model even when quantize flag, this fixes that.

Test plan

local tests and ci.

Before

-rw-r--r--  1 digantdesai  staff  13974408 Jan 16 15:44 mv2_xnnpack_fp32.pte
-rw-r--r--  1 digantdesai  staff  13974408 Jan 16 15:45 mv2_xnnpack_q8.pte

After

-rw-r--r--  1 digantdesai  staff  13974408 Jan 16 15:44 mv2_xnnpack_fp32.pte
-rw-r--r--  1 digantdesai  staff   3572400 Jan 16 16:05 mv2_xnnpack_q8.pte

cc @mcr229

pytorch-bot · 2025-01-16T22:11:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7714

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit bf5367a with merge base a18f6e8 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-01-16T22:11:57Z

@digantdesai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

This reverts commit 2f0518d.

huydhn · 2025-01-24T05:58:01Z

@digantdesai @shoumikhin I believe this is the PR that is causing the trunk failure with emformer_transcribe. I attempted to revert the change in #7879 and the signal recovered. I see that @shoumikhin is attempting the same revert here, so maybe he could also confirm this.

GH job link HUD commit link

[xnnpack] Reexport after quantize in aot_compiler

bf5367a

digantdesai added bug module: xnnpack Issues related to xnnpack delegation and the code under backends/xnnpack/ release notes: examples Changes to any of our example LLMs integrations, such as Llama3 and Llava labels Jan 16, 2025

digantdesai requested a review from mcr229 January 16, 2025 22:11

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 16, 2025

mcr229 approved these changes Jan 16, 2025

View reviewed changes

digantdesai merged commit 2f0518d into main Jan 17, 2025
10 of 11 checks passed

digantdesai deleted the reexport_after_quant branch January 17, 2025 00:03

shoumikhin added a commit that referenced this pull request Jan 22, 2025

Revert #7714 to check if CI passes

e99e25b

huydhn added a commit that referenced this pull request Jan 23, 2025

Revert "[xnnpack] Reexport after quantize in aot_compiler (#7714)"

b8c88b2

This reverts commit 2f0518d.

YIWENX14 pushed a commit that referenced this pull request Jan 28, 2025

[xnnpack] Reexport after quantize in aot_compiler (#7714)

eba96b2

digantdesai mentioned this pull request Feb 26, 2025

The error in the XNNPACK quantize script in aot_compiler.py #6886

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[xnnpack] Reexport after quantize in aot_compiler #7714

[xnnpack] Reexport after quantize in aot_compiler #7714

Uh oh!

digantdesai commented Jan 16, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Jan 16, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jan 16, 2025

Uh oh!

Uh oh!

huydhn commented Jan 24, 2025

Uh oh!

Uh oh!

[xnnpack] Reexport after quantize in aot_compiler #7714

[xnnpack] Reexport after quantize in aot_compiler #7714

Uh oh!

Conversation

digantdesai commented Jan 16, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Jan 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7714

⏳ No Failures, 1 Pending

Uh oh!

facebook-github-bot commented Jan 16, 2025

Uh oh!

Uh oh!

huydhn commented Jan 24, 2025

Uh oh!

Uh oh!

digantdesai commented Jan 16, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Jan 16, 2025 •

edited

Loading