Qualcomm AI Engine Direct - Add rewrite function of observer #10093

chunit-quic · 2025-04-11T00:25:21Z

Add function to rewrite observer after prepare_pt2e
Add corresponding test case

pytorch-bot · 2025-04-11T00:25:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10093

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 3caeaab with merge base 95a1db5 ():

NEW FAILURE - The following job has failed:

pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh)
RuntimeError: Command docker exec -t 820663bbe7e20c43cfa19c191f5a7c366181251ee3380a0333ff6db60064c3fa /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

chunit-quic · 2025-04-11T00:27:59Z

Hi @cccclai,

This PR is for the requested functionality to overwrite a quanization parameters after calibration. It can handle shared observer too. Thank you!

cccclai · 2025-04-15T22:55:24Z

@chunit-quic thanks for putting up the pr. cc: @sxu @billmguo

cccclai · 2025-04-15T22:56:19Z

Mind sharing a bit more details for the request? I probably miss it

chunit-quic · 2025-04-16T00:49:57Z

Mind sharing a bit more details for the request? I probably miss it

No problem. :)
We received some feature requests in a mail thread regarding quantization requirements. This particular request is for the following purpose:

The QNN Quantizer should allow users to override quantization parameters for specific tensors, regardless of the data ranges observed during calibration or QAT. This override must respect the transitive closures established by SharedQuantizationSpec.

After a brief discussion your team, we concluded that the rewriting stage should occur after calibration but before conversion. That's essentially the background.

sxu · 2025-04-24T00:21:10Z

Looks like this would work to me, however it would be great if someone from AO can confirm this kind of overwriting activation postprocessing submodule is enough and there's no other references to them.

sxu · 2025-04-24T00:22:41Z

Maybe @jerryzh168 can confirm?

cccclai · 2025-05-01T09:05:54Z

@kirklandsign can you help checking the CI signal for this one too? I remember there are some errors.

cccclai · 2025-05-23T17:30:00Z

Hey sorry for being late, can you rebase this PR?

chunit-quic · 2025-05-26T00:48:40Z

Hey sorry for being late, can you rebase this PR?

Done! Thank you.

facebook-github-bot · 2025-05-26T23:32:24Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai · 2025-05-28T00:06:09Z

Hey there some errors likely due to the pytorch pin update. Mind rebasing again? Sorry for the inconvinience

chunit-quic · 2025-05-28T01:21:25Z

No problem. Just rebased. Pleasefel free to let me know if there is any issue. Thank you!

jerryzh168 · 2025-05-28T02:43:03Z

backends/qualcomm/tests/test_qnn_delegate.py

+            qscheme=torch.per_tensor_affine,
+        )
+
+        rewrite_prepared_observer(prepared, {"activation_post_process_2": new_obs})


@sxu does your callback work for this? maybe you can share your example

cccclai · 2025-05-28T03:44:47Z

No problem. Just rebased. Pleasefel free to let me know if there is any issue. Thank you!

Did you rebase? Seems like there is no commit..

chunit-quic · 2025-05-28T04:10:10Z

Ops, Sorry I rebased locally without push. Thank you for pointing out! Done.

chunit-quic · 2025-05-28T04:10:11Z

Ops, Sorry I rebased locally without push. Thank you for pointing out! Done.

facebook-github-bot · 2025-05-29T17:07:47Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai · 2025-05-29T17:15:59Z

There seems to be a lint error, and that might be reason merging fails. Since #11049 is landed, can you apply the according changes and see if the error is gone?

- Add function to rewrite observer after prepare_pt2e - Add corresponding test case

chunit-quic · 2025-06-02T00:42:58Z

There seems to be a lint error, and that might be reason merging fails. Since #11049 is landed, can you apply the according changes and see if the error is gone?

Fixed. Thanks a lot for pointing out the error!

facebook-github-bot · 2025-06-02T01:29:36Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

kimishpatel · 2025-06-02T13:46:41Z

Mind sharing a bit more details for the request? I probably miss it

No problem. :) We received some feature requests in a mail thread regarding quantization requirements. This particular request is for the following purpose:

The QNN Quantizer should allow users to override quantization parameters for specific tensors, regardless of the data ranges observed during calibration or QAT. This override must respect the transitive closures established by SharedQuantizationSpec.

After a brief discussion your team, we concluded that the rewriting stage should occur after calibration but before conversion. That's essentially the background.

I dont quite follow why we cannot do this by allowing module specific customization? FOr example can I not say quanitzer.set_module_qconfig(....). And @jerryzh168 I really think we should make this part of the base class. Quantizers that dont implement this will fallback to baseclass which will just raise error.

chunit-quic requested a review from cccclai as a code owner April 11, 2025 00:25

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 11, 2025

cccclai requested review from sxu and billmguo April 17, 2025 19:35

sxu approved these changes Apr 24, 2025

View reviewed changes

chunit-quic force-pushed the dev_rewrite_qconfig branch from 2c9c79c to b92c102 Compare May 26, 2025 00:47

billmguo approved these changes May 27, 2025

View reviewed changes

jerryzh168 reviewed May 28, 2025

View reviewed changes

chunit-quic force-pushed the dev_rewrite_qconfig branch from b92c102 to 943eb09 Compare May 28, 2025 04:07

cccclai added the release notes: qualcomm Changes to the Qualcomm backend delegate label May 29, 2025

Chun-I Tsai added 2 commits June 2, 2025 08:26

Qualcomm AI Engine Direct - Add rewrite function of observer

678cd29

- Add function to rewrite observer after prepare_pt2e - Add corresponding test case

Fix torch.ao to torchao error from rebase

3caeaab

chunit-quic force-pushed the dev_rewrite_qconfig branch from 943eb09 to 3caeaab Compare June 2, 2025 00:42

facebook-github-bot merged commit d5c4ba7 into pytorch:main Jun 2, 2025
96 of 97 checks passed

Qualcomm AI Engine Direct - Add rewrite function of observer #10093

Qualcomm AI Engine Direct - Add rewrite function of observer #10093

Conversation

chunit-quic commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10093

❌ 1 New Failure

Uh oh!

chunit-quic commented Apr 11, 2025

Uh oh!

cccclai commented Apr 15, 2025

Uh oh!

cccclai commented Apr 15, 2025

Uh oh!

chunit-quic commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sxu commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sxu commented Apr 24, 2025

Uh oh!

cccclai commented May 1, 2025

Uh oh!

cccclai commented May 23, 2025

Uh oh!

chunit-quic commented May 26, 2025

Uh oh!

facebook-github-bot commented May 26, 2025

Uh oh!

cccclai commented May 28, 2025

Uh oh!

chunit-quic commented May 28, 2025

Uh oh!

jerryzh168 May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cccclai commented May 28, 2025

Uh oh!

chunit-quic commented May 28, 2025

Uh oh!

chunit-quic commented May 28, 2025

Uh oh!

facebook-github-bot commented May 29, 2025

Uh oh!

cccclai commented May 29, 2025

Uh oh!

chunit-quic commented Jun 2, 2025

Uh oh!

facebook-github-bot commented Jun 2, 2025

Uh oh!

Uh oh!

kimishpatel commented Jun 2, 2025

Uh oh!

Uh oh!

chunit-quic commented Apr 11, 2025 •

edited

Loading

pytorch-bot bot commented Apr 11, 2025 •

edited

Loading

chunit-quic commented Apr 16, 2025 •

edited

Loading

sxu commented Apr 24, 2025 •

edited

Loading

jerryzh168 May 28, 2025 •

edited

Loading