Skip to content

[llama runner] add xnnpack backend #2585

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 13 commits into from
Closed

[llama runner] add xnnpack backend #2585

wants to merge 13 commits into from

Conversation

mcr229
Copy link
Contributor

@mcr229 mcr229 commented Mar 22, 2024

Adding support for OSS Dynamically Quantized Linear via XNNPACK for llama models.

This now works for cmake on non-mac devices. Additionally, we modified the test-llama-runner-linux work flow and script to test exporting and running via XNNPACK DQ Linear

Copy link

pytorch-bot bot commented Mar 22, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2585

Note: Links to docs will display an error until the docs builds have been completed.

❌ 10 New Failures, 1 Unrelated Failure

As of commit 1b99d29 with merge base 24fe99c (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 22, 2024
@facebook-github-bot
Copy link
Contributor

@mcr229 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@mcr229 mcr229 changed the base branch from main to llama2_xnnpack March 22, 2024 23:16
@mcr229 mcr229 changed the base branch from llama2_xnnpack to main March 22, 2024 23:16
@facebook-github-bot
Copy link
Contributor

@mcr229 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@mcr229 merged this pull request in 725c590.

dbort added a commit to dbort/executorch that referenced this pull request Jun 17, 2024
The only reference to this was removed in
pytorch#2585

Test Plan:
CI
facebook-github-bot pushed a commit that referenced this pull request Jun 17, 2024
Summary:
The only reference to this was removed in
#2585

Pull Request resolved: #3999

Test Plan:
- CI
- `./install_requirements.sh --pybind xnnpack` built successfully

Reviewed By: mcr229

Differential Revision: D58686093

Pulled By: dbort

fbshipit-source-id: b0e7c7092ea47468d6a5cd8c241f53a1ee4bf301
@mcr229 mcr229 deleted the xnnpack_llama_runner branch March 24, 2025 17:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants