Add filter function to XNNPack Quantizer #10626

pssrawat · 2025-05-01T21:06:49Z

Summary:
Like HTP quantizer, add support so that user can specify a filter function to xnnpack quantizer. If specified, we only quantize nodes that return True for the filter function as well. This allows a much finer control on how we quantize a graph.

For multichannel ASR, we don't want to quantize certain nodes in certain layers of the encoder. These nodes don't have a proper module_name, so having a proper controlled suppression of quantization for such nodes is not feasible without a filter function.

Differential Revision: D73677442

pytorch-bot · 2025-05-01T21:06:52Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10626

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 251698a with merge base e912c65 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-05-01T21:07:04Z

This pull request was exported from Phabricator. Differential Revision: D73677442

Summary: Like HTP quantizer, add support so that user can specify a filter function to xnnpack quantizer. If specified, we only quantize nodes that return True for the filter function as well. This allows a much finer control on how we quantize a graph. For multichannel ASR, we don't want to quantize certain nodes in certain layers of the encoder. These nodes don't have a proper module_name, so having a proper controlled suppression of quantization for such nodes is not feasible without a filter function. Reviewed By: mcr229 Differential Revision: D73677442

Summary: Pull Request resolved: pytorch#10626 Like HTP quantizer, add support so that user can specify a filter function to xnnpack quantizer. If specified, we only quantize nodes that return True for the filter function as well. This allows a much finer control on how we quantize a graph. For multichannel ASR, we don't want to quantize certain nodes in certain layers of the encoder. These nodes don't have a proper module_name, so having a proper controlled suppression of quantization for such nodes is not feasible without a filter function. Reviewed By: mcr229 Differential Revision: D73677442

facebook-github-bot · 2025-05-01T23:59:44Z

This pull request was exported from Phabricator. Differential Revision: D73677442

Differential Revision: D73677442 Pull Request resolved: pytorch#10626

pssrawat requested review from digantdesai and mcr229 as code owners May 1, 2025 21:06

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 1, 2025

facebook-github-bot added the fb-exported label May 1, 2025

pssrawat added the topic: not user facing label May 1, 2025

mcr229 approved these changes May 1, 2025

View reviewed changes

pssrawat force-pushed the export-D73677442 branch from dad79c2 to e9cdd19 Compare May 1, 2025 23:56

pssrawat force-pushed the export-D73677442 branch from e9cdd19 to 251698a Compare May 1, 2025 23:59

facebook-github-bot merged commit d7030aa into pytorch:main May 2, 2025
85 of 87 checks passed

jhelsby pushed a commit to jhelsby/executorch that referenced this pull request May 9, 2025

Add filter function to XNNPack Quantizer

83139e4

Differential Revision: D73677442 Pull Request resolved: pytorch#10626

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add filter function to XNNPack Quantizer #10626

Add filter function to XNNPack Quantizer #10626

Uh oh!

pssrawat commented May 1, 2025

Uh oh!

pytorch-bot bot commented May 1, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented May 1, 2025

Uh oh!

facebook-github-bot commented May 1, 2025

Uh oh!

Uh oh!

Uh oh!

Add filter function to XNNPack Quantizer #10626

Add filter function to XNNPack Quantizer #10626

Uh oh!

Conversation

pssrawat commented May 1, 2025

Uh oh!

pytorch-bot bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10626

✅ No Failures

Uh oh!

facebook-github-bot commented May 1, 2025

Uh oh!

facebook-github-bot commented May 1, 2025

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented May 1, 2025 •

edited

Loading