[Cadence] Add scalar cases for binary ops (add, mul, sub, div) on HiFi #9411

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

facebook-github-bot merged 1 commit into main from export-D71495734

Mar 20, 2025

Contributor

mcremon-meta commented Mar 19, 2025 •

edited

Loading

Summary:
As titled. Currently those cases will go to the unoptimized broadcast call, which is extremely inefficient. A simple loop will do much better, and can be further optimized later if needed.

Differential Revision: D71495734

mcremon-meta requested a review from tarun292 as a code owner

March 19, 2025 20:27

pytorch-bot bot commented Mar 19, 2025 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/9411

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit b8e3d48 with merge base ea43453 ():

NEW FAILURES - The following jobs have failed:

pull / unittest-arm / linux-job (gh)
RuntimeError: Command docker exec -t 46010fadd9e3bdba89c61b481f0ddd6c8b1fe2f3b8305539bd795570de893d51 /exec failed with exit code 1
pull / unittest-buck / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 3

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

Contributor

facebook-github-bot commented Mar 19, 2025

This pull request was exported from Phabricator. Differential Revision: D71495734

facebook-github-bot added the fb-exported label

mcremon-meta added the topic: not user facing label

Contributor Author

mcremon-meta commented Mar 19, 2025

cc @cad-audio @dijopaul I'll merge this to unblock a couple internal models that showed pretty bad regressions, but I guess these "scalar" cases can be further optimized so I would leave it to you to assess that! No particular rush, this is such a simple op that compiler vectorization is doing pretty well apparently (e.g. we've seen 40M to 123k cycles on one model using mul)

facebook-github-bot force-pushed the export-D71495734 branch from fb0a6e1 to fed29b4 Compare

March 19, 2025 22:54

facebook-github-bot pushed a commit that referenced this pull request


          Add scalar cases for binary ops (add, mul, sub, div) on HiFi (#9411)

fed29b4

Summary:

As titled. Currently those cases will go to the unoptimized broadcast call, which is extremely inefficient. A simple loop will do much better, and can be further optimized later if needed.
Example of gains: mul op goes from 40M to 123k on the 27M ASR encoder.

Differential Revision: D71495734

Contributor

facebook-github-bot commented Mar 19, 2025

This pull request was exported from Phabricator. Differential Revision: D71495734

mcremon-meta changed the title ~~Add scalar cases for add and mul on HiFi~~ Add scalar cases for binary ops (add, mul, sub, div) on HiFi

mcremon-meta changed the title ~~Add scalar cases for binary ops (add, mul, sub, div) on HiFi~~ [CadenceAdd scalar cases for binary ops (add, mul, sub, div) on HiFi

mcremon-meta changed the title ~~[CadenceAdd scalar cases for binary ops (add, mul, sub, div) on HiFi~~ [Cadence] Add scalar cases for binary ops (add, mul, sub, div) on HiFi

facebook-github-bot pushed a commit that referenced this pull request


          Add scalar cases for binary ops (add, mul, sub, div) on HiFi (#9411)

cf6497c

Summary:

As titled. Currently those cases will go to the unoptimized broadcast call, which is extremely inefficient. A simple loop will do much better, and can be further optimized later if needed.
Example of gains: mul op goes from 40M to 123k on the 27M ASR encoder.

Differential Revision: D71495734

facebook-github-bot force-pushed the export-D71495734 branch from fed29b4 to cf6497c Compare

March 19, 2025 23:08

Contributor

facebook-github-bot commented Mar 19, 2025

This pull request was exported from Phabricator. Differential Revision: D71495734

facebook-github-bot pushed a commit that referenced this pull request


          Add scalar cases for binary ops (add, mul, sub, div) on HiFi (#9411)

c5317e4

Summary:

As titled. Currently those cases will go to the unoptimized broadcast call, which is extremely inefficient. A simple loop will do much better, and can be further optimized later if needed.
Example of gains: mul op goes from 40M to 123k on the 27M ASR encoder.

Differential Revision: D71495734

facebook-github-bot force-pushed the export-D71495734 branch from cf6497c to c5317e4 Compare

March 19, 2025 23:25

Contributor

facebook-github-bot commented Mar 19, 2025

This pull request was exported from Phabricator. Differential Revision: D71495734


          Add scalar cases for binary ops (add, mul, sub, div) on HiFi (#9411)

b8e3d48

Summary:

As titled. Currently those cases will go to the unoptimized broadcast call, which is extremely inefficient. A simple loop will do much better, and can be further optimized later if needed.
Example of gains: mul op goes from 40M to 123k on the 27M ASR encoder.

Differential Revision: D71495734

facebook-github-bot force-pushed the export-D71495734 branch from c5317e4 to b8e3d48 Compare

March 19, 2025 23:37

Contributor

facebook-github-bot commented Mar 19, 2025

This pull request was exported from Phabricator. Differential Revision: D71495734

zonglinpeng approved these changes

View reviewed changes

facebook-github-bot merged commit 87dd81a into main

79 of 82 checks passed

facebook-github-bot deleted the export-D71495734 branch

March 20, 2025 16:44

oscarandersson8218 pushed a commit to oscarandersson8218/executorch that referenced this pull request


          [Cadence] Add scalar cases for binary ops (add, mul, sub, div) on HiFi

30dfc8d

Differential Revision: D71495734

Pull Request resolved: pytorch#9411

DannyYuyang-quic pushed a commit to CodeLinaro/executorch that referenced this pull request


          [Cadence] Add scalar cases for binary ops (add, mul, sub, div) on HiFi

233670f

Differential Revision: D71495734

Pull Request resolved: pytorch#9411

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported topic: not user facing