Qualcomm AI Engine Direct - support skip quantization #5070

haowhsu-quic · 2024-09-04T12:01:51Z

Summary:

Utility to skip operator annotation, unskipped nodes will be gathered into submodules and lowered with quantization annotation. Skipped nodes could either fallback to cpu or delegated with HTP fp16.
Fix uplevel breakage.
Refactor & retire some outdated implmentation.

pytorch-bot · 2024-09-04T12:01:54Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5070

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 28ead61 with merge base 083b9e6 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

haowhsu-quic · 2024-09-04T13:48:18Z

Hi @cccclai, this PR provides an approach of keeping designated nodes from quantizer annotation, it's also possible to delegate the unannotated nodes with QNN fp16.
We notice the custom operator generated directly from AIHUB artifact will cause segfault in torch.export.export with 0901 pytorch nightly. Will address it in another PR if possible.

Please have a look, thank you.

cccclai · 2024-09-10T04:51:21Z

@haowhsu-quic hey sorry I miss the PR, could you rebase and I'll merge it?

Summary: - Utility to skip operator annotation, unskipped nodes will be gathered into submodules and lowered with quantization annotation. Skipped nodes could either fallback to cpu or delegated with HTP fp16. - Fix uplevel breakage. - Refactor & retire some outdated implmentation.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 4, 2024

haowhsu-quic force-pushed the dev_skip_annotation branch from 4274084 to 22050a4 Compare September 4, 2024 13:40

haowhsu-quic force-pushed the dev_skip_annotation branch from 22050a4 to 28ead61 Compare September 10, 2024 06:45

cccclai approved these changes Sep 10, 2024

View reviewed changes

cccclai merged commit 43e2f2d into pytorch:main Sep 10, 2024
36 checks passed

haowhsu-quic deleted the dev_skip_annotation branch February 7, 2025 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qualcomm AI Engine Direct - support skip quantization #5070

Qualcomm AI Engine Direct - support skip quantization #5070

Uh oh!

haowhsu-quic commented Sep 4, 2024

Uh oh!

pytorch-bot bot commented Sep 4, 2024 •

edited

Loading

Uh oh!

haowhsu-quic commented Sep 4, 2024

Uh oh!

cccclai commented Sep 10, 2024

Uh oh!

Uh oh!

Uh oh!

Qualcomm AI Engine Direct - support skip quantization #5070

Qualcomm AI Engine Direct - support skip quantization #5070

Uh oh!

Conversation

haowhsu-quic commented Sep 4, 2024

Uh oh!

pytorch-bot bot commented Sep 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5070

✅ No Failures

Uh oh!

haowhsu-quic commented Sep 4, 2024

Uh oh!

cccclai commented Sep 10, 2024

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 4, 2024 •

edited

Loading