add proper calibration to pt2e flow #4452

cccclai · 2024-07-29T22:43:21Z

Stack from ghstack (oldest at bottom):

Differential Revision: D60419364

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) [ghstack-poisoned]

pytorch-bot · 2024-07-29T22:43:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4452

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 8 New Failures

As of commit 290ca5d with merge base a743a3b ():

NEW FAILURES - The following jobs have failed:

pull / test-export-llava-linux / linux-job (gh)
RuntimeError: Command docker exec -t ea83bd7c92622e0bbab46a51f59ad7547b7a10faba824abe3d13296aecbfd37a /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, buck2, portable) / linux-job (gh)
RuntimeError: Command docker exec -t 2f2326281ca7d4b0660b9f7414c3b5fea918e275e722861497bdd738f5fa8a0d /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, buck2, xnnpack+custom) / linux-job (gh)
RuntimeError: Command docker exec -t 7afaf45d19eec471e0d4f350e1ffc64e7b3ae562b5cce5b2043173c1e156d4bc /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, buck2, xnnpack+custom+qe) / linux-job (gh)
RuntimeError: Command docker exec -t a85f715c81b66c9a59edb2be53c5a0a8ec297dec1bfb68daba68a0f523794c61 /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, cmake, portable) / linux-job (gh)
RuntimeError: Command docker exec -t 446e700165b43d9133e40130a0d3f4a5de07e0026e2e49a813ce5c3a02d44f22 /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, cmake, xnnpack+custom) / linux-job (gh)
RuntimeError: Command docker exec -t 1c2aa6ef70387e016859837af8b769684393504c8357509648b05451bbb15750 /exec failed with exit code 1
pull / test-llama-runner-linux (fp32, cmake, xnnpack+custom+qe) / linux-job (gh)
RuntimeError: Command docker exec -t a27fa14c7e2e0bfcff9b5990f13314ce0bb34fc6372916deebce65edab0d0e7f /exec failed with exit code 1
pull / unittest / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 2

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) ghstack-source-id: 235717228 Pull Request resolved: #4452

facebook-github-bot · 2024-07-29T22:43:40Z

This pull request was exported from Phabricator. Differential Revision: D60419364

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) [ghstack-poisoned]

facebook-github-bot · 2024-07-30T19:25:46Z

This pull request was exported from Phabricator. Differential Revision: D60419364

Pull Request resolved: #4452 Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) ghstack-source-id: 235862897

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) [ghstack-poisoned]

facebook-github-bot · 2024-07-30T19:40:41Z

This pull request was exported from Phabricator. Differential Revision: D60419364

Pull Request resolved: #4452 ghstack-source-id: 235865853 Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/)

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) [ghstack-poisoned]

facebook-github-bot · 2024-08-01T18:58:58Z

This pull request was exported from Phabricator. Differential Revision: D60419364

Pull Request resolved: #4452 ghstack-source-id: 236234222 Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/)

shewu-quic · 2024-08-01T23:00:41Z

extension/llm/export/builder.py

+            )
+
+        tokenizer = get_tokenizer(tokenizer_path)
+        eval_wrapper = EagerEvalWrapper(


Hi @cccclai,
Thanks for this change.
It seems to have a problem when I capture seq_len = 1 in kv_cache mode but calibrate with whole sequence in batch process.
From my understanding, after we capture, some variable will be fixed such as "batch, seqlen, _ = x.shape".
If I am mistaken, please correct me.

Hi thank you for checking out the pr! I pulled some changes from another local patch and haven't tested this pr properly. What you mentioned is true...I vaguelly remember I modified the code there to use kv cache instead, however it was too slow to calibrate (as you observed too)

cccclai · 2024-09-04T22:56:30Z

Create a new pr #5095 instead as this pr wasn't tested properly

add proper calibration to pt2e flow

a5a242c

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) [ghstack-poisoned]

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 29, 2024

cccclai mentioned this pull request Jul 29, 2024

move get tokenizer to export_llama_lib #4451

Closed

facebook-github-bot added the fb-exported label Jul 29, 2024

cccclai added a commit that referenced this pull request Jul 29, 2024

add proper calibration to pt2e flow

64d2078

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) ghstack-source-id: 235717228 Pull Request resolved: #4452

Update on "add proper calibration to pt2e flow"

9cc9a2c

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) [ghstack-poisoned]

cccclai added a commit that referenced this pull request Jul 30, 2024

add proper calibration to pt2e flow

b3d9006

Pull Request resolved: #4452 Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) ghstack-source-id: 235862897

Update on "add proper calibration to pt2e flow"

fb38e87

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) [ghstack-poisoned]

cccclai mentioned this pull request Jul 30, 2024

fix eval llama #4469

Closed

cccclai added a commit that referenced this pull request Jul 30, 2024

add proper calibration to pt2e flow

2fc9001

Pull Request resolved: #4452 ghstack-source-id: 235865853 Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/)

Update on "add proper calibration to pt2e flow"

290ca5d

Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/) [ghstack-poisoned]

cccclai added a commit that referenced this pull request Aug 1, 2024

add proper calibration to pt2e flow

69187cd

Pull Request resolved: #4452 ghstack-source-id: 236234222 Differential Revision: [D60419364](https://our.internmc.facebook.com/intern/diff/D60419364/)

shewu-quic reviewed Aug 1, 2024

View reviewed changes

shewu-quic mentioned this pull request Aug 12, 2024

Add stories ci for qnn #4662

Merged

cccclai closed this Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add proper calibration to pt2e flow #4452

add proper calibration to pt2e flow #4452

Uh oh!

cccclai commented Jul 29, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 29, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Jul 29, 2024

Uh oh!

facebook-github-bot commented Jul 30, 2024

Uh oh!

facebook-github-bot commented Jul 30, 2024

Uh oh!

facebook-github-bot commented Aug 1, 2024

Uh oh!

shewu-quic Aug 1, 2024 •

edited

Loading

Uh oh!

cccclai Aug 2, 2024

Uh oh!

cccclai commented Sep 4, 2024

Uh oh!

Uh oh!

add proper calibration to pt2e flow #4452

add proper calibration to pt2e flow #4452

Uh oh!

Conversation

cccclai commented Jul 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4452

❌ 8 New Failures

Uh oh!

facebook-github-bot commented Jul 29, 2024

Uh oh!

facebook-github-bot commented Jul 30, 2024

Uh oh!

facebook-github-bot commented Jul 30, 2024

Uh oh!

facebook-github-bot commented Aug 1, 2024

Uh oh!

shewu-quic Aug 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cccclai Aug 2, 2024

Choose a reason for hiding this comment

Uh oh!

cccclai commented Sep 4, 2024

Uh oh!

Uh oh!

cccclai commented Jul 29, 2024 •

edited

Loading

pytorch-bot bot commented Jul 29, 2024 •

edited

Loading

shewu-quic Aug 1, 2024 •

edited

Loading