Skip to content

Qualcomm AI Engine Direct - Refine max spill fill buffer setting #5989

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed

Conversation

cccclai
Copy link
Contributor

@cccclai cccclai commented Oct 8, 2024

  • Get required spillFillBufferSize from context binary and set to compiler_spec
  • Quantize embedding op in qnn.
  • If enable multi-contexts, maxSpillFillBuffer could not set to zero.

Copy link

pytorch-bot bot commented Oct 8, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5989

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 01afb5e with merge base f0112a2 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 8, 2024
@facebook-github-bot
Copy link
Contributor

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@cccclai cccclai mentioned this pull request Oct 8, 2024
@cccclai cccclai force-pushed the refine_max_spill_fill_buffer_setting branch from 34e6fd6 to 8d4175a Compare October 8, 2024 18:59
@facebook-github-bot
Copy link
Contributor

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@cccclai cccclai force-pushed the refine_max_spill_fill_buffer_setting branch from 8d4175a to bf6ccb4 Compare October 8, 2024 19:36
@facebook-github-bot
Copy link
Contributor

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

- Get required spillFillBufferSize from context binary and set to compiler_spec
- Quantize embedding op in qnn.
- If enable multi-contexts, maxSpillFillBuffer could not set to zero.
@cccclai cccclai force-pushed the refine_max_spill_fill_buffer_setting branch from bf6ccb4 to 01afb5e Compare October 8, 2024 22:06
@facebook-github-bot
Copy link
Contributor

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

dbort pushed a commit to dbort/executorch that referenced this pull request Oct 8, 2024
…orch#5989)

Summary:
- Get required spillFillBufferSize from context binary and set to compiler_spec
- Quantize embedding op in qnn.
- If enable multi-contexts, maxSpillFillBuffer could not set to zero.

Pull Request resolved: pytorch#5989

Reviewed By: kirklandsign

Differential Revision: D64056107

Pulled By: cccclai
@facebook-github-bot
Copy link
Contributor

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@cccclai merged this pull request in 01fcdf4.

@cccclai
Copy link
Contributor Author

cccclai commented Oct 9, 2024

@pytorchbot cherry-pick --onto release/0.4 -c critical

pytorchbot pushed a commit that referenced this pull request Oct 9, 2024
Summary:
- Get required spillFillBufferSize from context binary and set to compiler_spec
- Quantize embedding op in qnn.
- If enable multi-contexts, maxSpillFillBuffer could not set to zero.

Pull Request resolved: #5989

Reviewed By: kirklandsign

Differential Revision: D64056107

Pulled By: cccclai

fbshipit-source-id: 9f9846e6ac7b4a27d734d2812ac3bbad32fb194f
(cherry picked from commit 01fcdf4)
@pytorchbot
Copy link
Collaborator

Cherry picking #5989

The cherry pick PR is at #6041 and it is recommended to link a critical cherry pick PR with an issue. The following tracker issues are updated:

Details for Dev Infra team Raised by workflow job

jackzhxng pushed a commit that referenced this pull request Oct 9, 2024
Qualcomm AI Engine Direct - Refine max spill fill buffer setting (#5989)

Summary:
- Get required spillFillBufferSize from context binary and set to compiler_spec
- Quantize embedding op in qnn.
- If enable multi-contexts, maxSpillFillBuffer could not set to zero.

Pull Request resolved: #5989

Reviewed By: kirklandsign

Differential Revision: D64056107

Pulled By: cccclai

fbshipit-source-id: 9f9846e6ac7b4a27d734d2812ac3bbad32fb194f
(cherry picked from commit 01fcdf4)

Co-authored-by: Sheng Feng Wu <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants