[Executorch][perf-ci] Fix perf ci #8374

kimishpatel · 2025-02-11T16:00:11Z

Stack from ghstack (oldest at bottom):

-> [Executorch][perf-ci] Fix perf ci #8374

Summary:
Previous PR #7927 deecoupled max_seq_length from kv cache. That broke
perf ci workflow. Fix that.

Test Plan:
Trigger it manually and check
apple perf: https://github.com/pytorch/executorch/actions/runs/13267110949
android perf: https://github.com/pytorch/executorch/actions/runs/13267110908

Reviewers:

Subscribers:

Tasks:

Tags:

cc @guangy10 @huydhn @kirklandsign @shoumikhin

Summary: Previous PR #7927 deecoupled max_seq_length from kv cache. That broke perf ci workflow. Fix that. Test Plan: Trigger it manually and check Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

pytorch-bot · 2025-02-11T16:00:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8374

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 553d875 with merge base 78752a0 ():

NEW FAILURE - The following job has failed:

Apple / build-demo-ios / macos-job (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Previous PR #7927 deecoupled max_seq_length from kv cache. That broke perf ci workflow. Fix that. Test Plan: Trigger it manually and check Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 3a09b1a Pull Request resolved: #8374

guangy10 · 2025-02-11T23:01:12Z

The linked job in the PR summary doesn't run with the SpinQuant and QLora. You need to trigger the job using the model id on Hugging Face:

mergennachin · 2025-02-11T23:11:51Z

Also the readme pages?

https://github.com/pytorch/executorch/blob/main/examples/models/llama/README.md

https://github.com/pytorch/executorch/blob/main/examples/demo-apps/apple_ios/LLaMA/docs/delegates/xnnpack_README.md

https://github.com/pytorch/executorch/blob/main/examples/demo-apps/android/LlamaDemo/docs/delegates/xnnpack_README.md

kimishpatel · 2025-02-12T03:19:40Z

doesn't run with the SpinQuant and QLora

let me do this in follow up PR

Actually let me just do it here

kimishpatel · 2025-02-12T03:20:46Z

The linked job in the PR summary doesn't run with the SpinQuant and QLora. You need to trigger the job using the model id on Hugging Face:

https://huggingface.co/meta-llama/Llama-3.2-1B

https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8

https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8

need to trigger the job using the model id on Hugging Face:

What does this mean? Is there description as to how to trigger this. I followed steps here https://github.com/pytorch/executorch/tree/main/extension/benchmark

Summary: Previous PR #7927 deecoupled max_seq_length from kv cache. That broke perf ci workflow. Fix that. Test Plan: Trigger it manually and check apple perf: https://github.com/pytorch/executorch/actions/runs/13267110949 android perf: https://github.com/pytorch/executorch/actions/runs/13267110908 Reviewers: Subscribers: Tasks: Tags: cc guangy10 huydhn kirklandsign shoumikhin [ghstack-poisoned]

Summary: Previous PR #7927 deecoupled max_seq_length from kv cache. That broke perf ci workflow. Fix that. Test Plan: Trigger it manually and check Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: cd637af Pull Request resolved: #8374

kimishpatel · 2025-02-12T03:32:11Z

doesn't run with the SpinQuant and QLora

let me do this in follow up PR

Actually let me just do it here

this is updated. But I think I am gonna have to do one more round of scrubbing in subsequent PRs for various incarnations of llama

guangy10 · 2025-02-12T03:33:11Z

The linked job in the PR summary doesn't run with the SpinQuant and QLora. You need to trigger the job using the model id on Hugging Face:

https://huggingface.co/meta-llama/Llama-3.2-1B

https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8

https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8

need to trigger the job using the model id on Hugging Face:

What does this mean? Is there description as to how to trigger this. I followed steps here https://github.com/pytorch/executorch/tree/main/extension/benchmark

You need to specify the models you want to benchmark against explicitly, separated by ",". In this case, they are "meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8,meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8". See the screenshot for example.

Updated the screenshot. You need to run against your branch, not on main.

guangy10

The changes look good to me

kimishpatel · 2025-02-12T03:36:33Z

The linked job in the PR summary doesn't run with the SpinQuant and QLora. You need to trigger the job using the model id on Hugging Face:

https://huggingface.co/meta-llama/Llama-3.2-1B

https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8

https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8

need to trigger the job using the model id on Hugging Face:

What does this mean? Is there description as to how to trigger this. I followed steps here https://github.com/pytorch/executorch/tree/main/extension/benchmark

You need to specify the models you want to benchmark against explicitly, separated by ",". In this case, they are "meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8,meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8". See the screenshot for example.

oh you are right. I forgot about that step.

[Executorch][perf-ci] Fix perf ci

a162017

Summary: Previous PR #7927 deecoupled max_seq_length from kv cache. That broke perf ci workflow. Fix that. Test Plan: Trigger it manually and check Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 11, 2025

kimishpatel mentioned this pull request Feb 11, 2025

QLora and SpintQuant recipes fail to export on CI #8154

Open

kimishpatel temporarily deployed to upload-benchmark-results February 11, 2025 16:48 — with GitHub Actions Inactive

kimishpatel temporarily deployed to upload-benchmark-results February 11, 2025 17:13 — with GitHub Actions Inactive

kimishpatel requested a review from guangy10 February 11, 2025 22:49

kimishpatel added release notes: misc Miscellaneous module: benchmark Issues related to the benchmark infrastructure labels Feb 11, 2025

mergennachin self-requested a review February 11, 2025 23:11

guangy10 approved these changes Feb 12, 2025

View reviewed changes

kimishpatel temporarily deployed to upload-benchmark-results February 12, 2025 04:16 — with GitHub Actions Inactive

kimishpatel temporarily deployed to upload-benchmark-results February 12, 2025 04:46 — with GitHub Actions Inactive

kimishpatel changed the base branch from gh/kimishpatel/158/base to main February 12, 2025 15:02

kimishpatel merged commit e137c22 into main Feb 12, 2025
70 of 71 checks passed

kimishpatel deleted the gh/kimishpatel/158/head branch February 12, 2025 15:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Executorch][perf-ci] Fix perf ci #8374

[Executorch][perf-ci] Fix perf ci #8374

Uh oh!

kimishpatel commented Feb 11, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 11, 2025 •

edited

Loading

Uh oh!

guangy10 commented Feb 11, 2025

Uh oh!

mergennachin commented Feb 11, 2025

Uh oh!

kimishpatel commented Feb 12, 2025 •

edited

Loading

Uh oh!

kimishpatel commented Feb 12, 2025

Uh oh!

kimishpatel commented Feb 12, 2025

Uh oh!

guangy10 commented Feb 12, 2025 •

edited

Loading

Uh oh!

guangy10 left a comment

Uh oh!

kimishpatel commented Feb 12, 2025

Uh oh!

Uh oh!

Uh oh!

[Executorch][perf-ci] Fix perf ci #8374

[Executorch][perf-ci] Fix perf ci #8374

Uh oh!

Conversation

kimishpatel commented Feb 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8374

❌ 1 New Failure

Uh oh!

guangy10 commented Feb 11, 2025

Uh oh!

mergennachin commented Feb 11, 2025

Uh oh!

kimishpatel commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kimishpatel commented Feb 12, 2025

Uh oh!

kimishpatel commented Feb 12, 2025

Uh oh!

guangy10 commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

guangy10 left a comment

Choose a reason for hiding this comment

Uh oh!

kimishpatel commented Feb 12, 2025

Uh oh!

Uh oh!

Uh oh!

kimishpatel commented Feb 11, 2025 •

edited

Loading

pytorch-bot bot commented Feb 11, 2025 •

edited

Loading

kimishpatel commented Feb 12, 2025 •

edited

Loading

guangy10 commented Feb 12, 2025 •

edited

Loading