Qualcomm AI Engine Direct - FbNet enablement #2706

chunit-quic · 2024-03-27T05:50:01Z

Add test cases
Fix compile error

pytorch-bot · 2024-03-27T05:50:03Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2706

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[PREEMPTIVE] - We recently implemented changes in pull job linux-jammy-py3.8-gcc11 / build

✅ No Failures

As of commit 3de82d5 with merge base 4111b3f ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

- Add test cases

cccclai · 2024-03-31T05:58:58Z

examples/qualcomm/scripts/dummy_llama2.py

@@ -39,6 +39,9 @@ def create_device_inputs(example_inputs, use_kv_cache):


 if __name__ == "__main__":
+    print(


did you run into any issue with the script?

I test it last week and it seems ok

Hi @cccclai,

We found some unideal behavior in our CI. For the following reasons we think it's better to have this warning:

In 8a8w case, the output shape seems to be different from what it has before.

python dummy_llama2.py --ptq 8a8w ...

In 16a4w case, it even fails to export now.

python dummy_llama2.py --ptq 16a4w ...

Prevent from creating too many issues. bucause users might want to try it, but we are still working on some of its components.

I test it last week and it seems ok

Would you mind to share your command please? We can also reproduce it and find what the difference. Thanks! :D

Ah I take my word back - I just try export the model and see this error when I try to load the model in the runtime

[INFO] [Qnn ExecuTorch]: create QNN Logger with log_level 2 [WARNING] [Qnn ExecuTorch]: <W> Initializing HtpProvider [WARNING] [Qnn ExecuTorch]: <W> Function not called, PrepareLib isn't loaded! [INFO] [Qnn ExecuTorch]: Initialize Qnn backend parameters for Qnn executorch backend type 2 [INFO] [Qnn ExecuTorch]: Caching: Caching is in RESTORE MODE. [WARNING] [Qnn ExecuTorch]: <W> sg_stubPtr is not null, skip loadRemoteSymbols [ERROR] [Qnn ExecuTorch]: <E> DspTransport.openSession qnn_open failed, 0x80000406 [ERROR] [Qnn ExecuTorch]: <E> IDspTransport: Unable to load lib 0x80000406 [ERROR] [Qnn ExecuTorch]: <E> DspTransport failed,cannot open session, error 0x00000009 [ERROR] [Qnn ExecuTorch]: <E> Unable to load Skel Library. transportStatus: 9 [ERROR] [Qnn ExecuTorch]: <E> Failed to retrieve skel build id: err: 1008 [ERROR] [Qnn ExecuTorch]: <E> Failed to create transport for device, error: 1008 [ERROR] [Qnn ExecuTorch]: <E> Failed to load skel, error: 1008 [ERROR] [Qnn ExecuTorch]: <E> Transport layer setup failed: 1008 [ERROR] [Qnn ExecuTorch]: <E> Failed to parse default platform info: 1008 [ERROR] [Qnn ExecuTorch]: <E> Failed to load default platform info: 1008 [ERROR] [Qnn ExecuTorch]: <E> Failed to parse platform config: 1008 [ERROR] [Qnn ExecuTorch]: Failed to create device_handle for Backend ID 6, error=1008 E 00:00:00.245462 executorch:QnnManager.cpp:154] Fail to configure Qnn device E 00:00:00.245471 executorch:QnnExecuTorchBackend.cpp:54] Fail to initialize Qnn Manager E 00:00:00.245478 executorch:method.cpp:106] Init failed for backend QnnBackend: 0x1 F 00:00:00.245497 executorch:qnn_executor_runner.cpp:215] In function main(), assert failed (method.ok()): Loading of method forward failed with status 0x1 Aborted

Any chance you know the reason?

Oh also I think the code change in llama_transformer.py might be the culprit when the issue you saw.

Actually the error message might be just for me because I only have SM8450. Just open an issue here #2788

Oh also I think the code change in llama_transformer.py might be the culprit when the issue you saw.

Thank you for pointing out the possibility. We will investigate it later.

Actually the error message might be just for me because I only have SM8450. Just open an issue here #2788

We will find a 8450 device and try to reproduce it. Once we have any news we will reply at issue 2788. Thank you for report.

Maybe I ask what device you've been using? Is it SM8450?

No, I ususally work on SM8550. I don't evevn test a 8450 device personally.

facebook-github-bot · 2024-03-31T05:59:31Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai · 2024-03-31T06:00:59Z

backends/qualcomm/scripts/build.sh

@@ -71,7 +71,6 @@ if [ "$BUILD_AARCH64" = true ]; then
        -DCMAKE_INSTALL_PREFIX=$BUILD_ROOT \
        -DEXECUTORCH_BUILD_QNN=ON \
        -DEXECUTORCH_BUILD_SDK=ON \
-        -DFLATCC_TEST=OFF \


Any specific reason we turn it on? I guess I didn't realize it was OFF before

We explicitly turn OFF it before.
Because recently PR 2466 turn it off by default, we don't need to set it again here.

facebook-github-bot · 2024-04-01T16:14:17Z

@cccclai merged this pull request in 15d9ddd.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 27, 2024

chunit-quic force-pushed the fbnet branch from ba18a61 to 1279573 Compare March 27, 2024 06:03

[FbNet enablement]

3de82d5

- Add test cases

chunit-quic force-pushed the fbnet branch from 1279573 to 3de82d5 Compare March 27, 2024 07:56

cccclai approved these changes Mar 31, 2024

View reviewed changes

cccclai reviewed Mar 31, 2024

View reviewed changes

facebook-github-bot closed this in 15d9ddd Apr 1, 2024

facebook-github-bot added the Merged label Apr 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qualcomm AI Engine Direct - FbNet enablement #2706

Qualcomm AI Engine Direct - FbNet enablement #2706

Uh oh!

chunit-quic commented Mar 27, 2024

Uh oh!

pytorch-bot bot commented Mar 27, 2024 •

edited

Loading

Uh oh!

cccclai Mar 31, 2024

Uh oh!

cccclai Mar 31, 2024

Uh oh!

chunit-quic Apr 1, 2024 •

edited

Loading

Uh oh!

cccclai Apr 1, 2024

Uh oh!

cccclai Apr 1, 2024

Uh oh!

cccclai Apr 1, 2024

Uh oh!

chunit-quic Apr 1, 2024

Uh oh!

cccclai Apr 1, 2024

Uh oh!

chunit-quic Apr 2, 2024

Uh oh!

facebook-github-bot commented Mar 31, 2024

Uh oh!

cccclai Mar 31, 2024 •

edited

Loading

Uh oh!

chunit-quic Apr 1, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Apr 1, 2024

Uh oh!

Uh oh!

		@@ -39,6 +39,9 @@ def create_device_inputs(example_inputs, use_kv_cache):


		if __name__ == "__main__":
		print(

Qualcomm AI Engine Direct - FbNet enablement #2706

Qualcomm AI Engine Direct - FbNet enablement #2706

Uh oh!

Conversation

chunit-quic commented Mar 27, 2024

Uh oh!

pytorch-bot bot commented Mar 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2706

❗ 1 Active SEVs

✅ No Failures

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chunit-quic Apr 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Mar 31, 2024

Uh oh!

cccclai Mar 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chunit-quic Apr 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Apr 1, 2024

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 27, 2024 •

edited

Loading

chunit-quic Apr 1, 2024 •

edited

Loading

cccclai Mar 31, 2024 •

edited

Loading

chunit-quic Apr 1, 2024 •

edited

Loading