loose sdk example verification requirement #906

Gasoonjia · 2023-10-13T20:12:45Z

Summary: previous requirement is too tight for every model.

Differential Revision: D50280300

netlify · 2023-10-13T20:12:51Z

✅ Deploy Preview for resplendent-gnome-14e531 ready!

Name	Link
🔨 Latest commit	`66788e9`
🔍 Latest deploy log	https://app.netlify.com/sites/resplendent-gnome-14e531/deploys/6529bf91aa3077000757eeb9
😎 Deploy Preview	https://deploy-preview-906--resplendent-gnome-14e531.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot · 2023-10-13T20:12:54Z

This pull request was exported from Phabricator. Differential Revision: D50280300

facebook-github-bot · 2023-10-13T20:59:02Z

This pull request was exported from Phabricator. Differential Revision: D50280300

facebook-github-bot · 2023-10-13T21:04:01Z

This pull request was exported from Phabricator. Differential Revision: D50280300

Summary: Pull Request resolved: pytorch/executorch#880 Update the source of flatten_tree functino from executorch.extension to torch core to support OrderedDict type. Reviewed By: tarun292 Differential Revision: D50245226 fbshipit-source-id: f85b1d1a8711f593155799d0652f9e48b8bbc5c3

Summary: Pull Request resolved: pytorch/executorch#879 1. Remove deprecated profiler stuffs. 2. Add print_output flag for sdk_example_runners. Printing output each scalar per line is sometimes pretty verbose, especially some outputs have thousands or even millions of scalars. Make the print as optional in this diff. Reviewed By: tarun292 Differential Revision: D50246087 fbshipit-source-id: 49c76f0ebcf9ec75ef070e66156b28f70b19dcd9

Summary: Pull Request resolved: pytorch/executorch#906 previous requirement is too tight for every model. Reviewed By: tarun292 Differential Revision: D50280300 fbshipit-source-id: 634ade693ea0e893c26ce705c868748eb4c73ec9

facebook-github-bot · 2023-10-13T22:07:06Z

This pull request was exported from Phabricator. Differential Revision: D50280300

facebook-github-bot · 2023-10-13T23:49:16Z

This pull request has been merged in 5c00a83.

This PR refactors the chat function in generate.py by creating a `Generator` class, removing unnecessary global variables and simplifying the code structure. Tokens and metrics are now yielded by the Generator rather than being printed directly to stdout, making it easier to re-use this code for non-CLI tools. **Tests:** Generate ``` python3 torchchat.py generate stories15M --prompt "Once upon a time," Using device=mps Loading model... Time to load model: 0.46 seconds ----------------------------------------------------------- Once upon a time, there was a little girl named Lily. One day, she went to the park with her mom. They saw a big tree with lots of pears on it. Lily wanted to eat a pear, but they were too high up. She tried to jump, but she couldn't reach them. Then, a boy came and took a pear from the tree. Lily was surprised! She thought the boy would be mean, but he was harmless and didn't mean to hurt her. She asked him if she could have a pear too. The boy said yes and gave her a pear. Lily was very happy and thanked the boy for sharing. They sat under the tree and ate the pear together. It was a good day at the park. Once upon a time, there was a little girl named Lily. She loved to play in the garden with her mommy. One day, they were planting some Time for inference 1: 3.55 sec total, time to first token 0.00 sec with parallel prefill, 199 tokens, 56.01 tokens/sec, 17.85 ms/token Bandwidth achieved: 2.73 GB/s *** This first iteration will include cold start effects for dynamic import, hardware caches. *** ======================================== Average tokens/sec: 56.01 ======================================== Average tokens/sec: 119.35 Memory used: 0.00 GB ``` Eval ``` python3 torchchat.py eval stories15M --tasks wikitext --limit 10 NumExpr defaulting to 10 threads. PyTorch version 2.5.0.dev20240629 available. Using device=mps Loading model... Time to load model: 0.38 seconds ----------------------------------------------------------- Using device 'mps' [Task: wikitext] metric word_perplexity is defined, but aggregation is not. using default aggregation=weighted_perplexity [Task: wikitext] metric word_perplexity is defined, but higher_is_better is not. using default higher_is_better=False [Task: wikitext] metric byte_perplexity is defined, but aggregation is not. using default aggregation=weighted_perplexity [Task: wikitext] metric byte_perplexity is defined, but higher_is_better is not. using default higher_is_better=False [Task: wikitext] metric bits_per_byte is defined, but aggregation is not. using default aggregation=bits_per_byte [Task: wikitext] metric bits_per_byte is defined, but higher_is_better is not. using default higher_is_better=False Repo card metadata block was not found. Setting CardData to empty. Repo card metadata block was not found. Setting CardData to empty. Building contexts for wikitext on rank 0... 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 10/10 [00:00<00:00, 541.59it/s] Running loglikelihood_rolling requests 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 10/10 [00:04<00:00, 2.25it/s] Time to run eval: 22.57s. Time in model.forward: 1.01s, over 33 model evaluations forward run time stats - Median: 0.02s Min: 0.01s Max: 0.27s For model /Users/puri/.torchchat/model-cache/stories15M/stories15M.pt wikitext: word_perplexity,none: 47350.8811 byte_perplexity,none: 7.7811 bits_per_byte,none: 2.9600 alias: wikitext ``` Co-authored-by: vmpuri <[email protected]>

Remove the existing browser UI and replace it with a UI built with Streamlit. This reduces complexity & leverages the functionality introduced in PR #906 to display chunked responses. **Testing** ``` streamlit run torchchat.py -- browser stories110M --compile --max-new-tokens 256 You can now view your Streamlit app in your browser. Local URL: http://localhost:8501 Network URL: http://192.0.0.2:8501 ``` <img width="1002" alt="image" src="https://github.com/user-attachments/assets/df305943-2326-4d01-a48b-61dd2006fa28">

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 13, 2023

facebook-github-bot added the fb-exported label Oct 13, 2023

Gasoonjia force-pushed the export-D50280300 branch from e228e68 to 17d32b7 Compare October 13, 2023 20:59

Gasoonjia force-pushed the export-D50280300 branch from 17d32b7 to 5b68669 Compare October 13, 2023 21:04

Gasoonjia added 3 commits October 13, 2023 15:05

loose sdk example verification requirement (#906)

66788e9

Summary: Pull Request resolved: pytorch/executorch#906 previous requirement is too tight for every model. Reviewed By: tarun292 Differential Revision: D50280300 fbshipit-source-id: 634ade693ea0e893c26ce705c868748eb4c73ec9

Gasoonjia force-pushed the export-D50280300 branch from 5b68669 to 66788e9 Compare October 13, 2023 22:07

facebook-github-bot closed this in 5c00a83 Oct 13, 2023

facebook-github-bot added the Merged label Oct 13, 2023

dbort mentioned this pull request Oct 14, 2023

[release/0.1] Cherrypick docs-only commits from main #923

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

loose sdk example verification requirement #906

loose sdk example verification requirement #906

Uh oh!

Gasoonjia commented Oct 13, 2023

Uh oh!

netlify bot commented Oct 13, 2023 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

Uh oh!

loose sdk example verification requirement #906

loose sdk example verification requirement #906

Uh oh!

Conversation

Gasoonjia commented Oct 13, 2023

Uh oh!

netlify bot commented Oct 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for resplendent-gnome-14e531 ready!

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

facebook-github-bot commented Oct 13, 2023

Uh oh!

Uh oh!

netlify bot commented Oct 13, 2023 •

edited

Loading