[LLava] Fix stats for C++ runner #5147

digantdesai · 2024-09-06T21:05:43Z

Before:

I 00:00:28.414816 executorch:stats.h:84] Prompt Tokens: 616 Generated Tokens: 33
I 00:00:28.414826 executorch:stats.h:90] Model Load Time: 9.244000 (seconds)
I 00:00:28.414835 executorch:stats.h:100] Total inference time: 0.000000 (seconds) Rate: inf (tokens/second)
I 00:00:28.414838 executorch:stats.h:108] Prompt evaluation: 0.000000 (seconds) Rate: inf (tokens/second)
I 00:00:28.414839 executorch:stats.h:119] Generated 33 tokens: 0.000000 (seconds) Rate: inf (tokens/second)
I 00:00:28.414841 executorch:stats.h:127] Time to first generated token: 0.000000 (seconds)
I 00:00:28.414842 executorch:stats.h:134] Sampling time over 649 tokens: 0.002000 (seconds)

With real image on M1:

I 00:00:34.231017 executorch:stats.h:84] Prompt Tokens: 616 Generated Tokens: 33
I 00:00:34.231028 executorch:stats.h:90] Model Load Time: 9.108000 (seconds)
I 00:00:34.231038 executorch:stats.h:100] Total inference time: 25.103000 (seconds) Rate: 1.314584 (tokens/second)
I 00:00:34.231040 executorch:stats.h:108] Prompt evaluation: 11.544000 (seconds) Rate: 53.361053 (tokens/second)
I 00:00:34.231042 executorch:stats.h:119] Generated 33 tokens: 13.559000 (seconds) Rate: 2.433808 (tokens/second)
I 00:00:34.231043 executorch:stats.h:127] Time to first generated token: 11.544000 (seconds)
I 00:00:34.231045 executorch:stats.h:134] Sampling time over 649 tokens: 0.000000 (seconds)

With bogus image (same dims) on Android S23:

I 00:00:34.649120 executorch:stats.h:84] Prompt Tokens: 616 Generated Tokens: 33
I 00:00:34.649128 executorch:stats.h:90] Model Load Time: 12.337000 (seconds)
I 00:00:34.649169 executorch:stats.h:100] Total inference time: 22.301000 (seconds) Rate: 1.479754 (tokens/second)
I 00:00:34.649174 executorch:stats.h:108] Prompt evaluation: 17.964000 (seconds) Rate: 34.290804 (tokens/second)
I 00:00:34.649179 executorch:stats.h:119] Generated 33 tokens: 4.337000 (seconds) Rate: 7.608946 (tokens/second)
I 00:00:34.649183 executorch:stats.h:127] Time to first generated token: 17.964000 (seconds)
I 00:00:34.649186 executorch:stats.h:134] Sampling time over 649 tokens: 0.001000 (seconds)

pytorch-bot · 2024-09-06T21:05:46Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5147

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 5a5dbc5 with merge base 549f14b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-09-10T03:07:32Z

@digantdesai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Before: I 00:00:28.414816 executorch:stats.h:84] Prompt Tokens: 616 Generated Tokens: 33 I 00:00:28.414826 executorch:stats.h:90] Model Load Time: 9.244000 (seconds) I 00:00:28.414835 executorch:stats.h:100] Total inference time: 0.000000 (seconds) Rate: inf (tokens/second) I 00:00:28.414838 executorch:stats.h:108] Prompt evaluation: 0.000000 (seconds) Rate: inf (tokens/second) I 00:00:28.414839 executorch:stats.h:119] Generated 33 tokens: 0.000000 (seconds) Rate: inf (tokens/second) I 00:00:28.414841 executorch:stats.h:127] Time to first generated token: 0.000000 (seconds) I 00:00:28.414842 executorch:stats.h:134] Sampling time over 649 tokens: 0.002000 (seconds) With real image on M1: I 00:00:34.231017 executorch:stats.h:84] Prompt Tokens: 616 Generated Tokens: 33 I 00:00:34.231028 executorch:stats.h:90] Model Load Time: 9.108000 (seconds) I 00:00:34.231038 executorch:stats.h:100] Total inference time: 25.103000 (seconds) Rate: 1.314584 (tokens/second) I 00:00:34.231040 executorch:stats.h:108] Prompt evaluation: 11.544000 (seconds) Rate: 53.361053 (tokens/second) I 00:00:34.231042 executorch:stats.h:119] Generated 33 tokens: 13.559000 (seconds) Rate: 2.433808 (tokens/second) I 00:00:34.231043 executorch:stats.h:127] Time to first generated token: 11.544000 (seconds) I 00:00:34.231045 executorch:stats.h:134] Sampling time over 649 tokens: 0.000000 (seconds) With bogus image (same dims) on Android S23: I 00:00:34.649120 executorch:stats.h:84] Prompt Tokens: 616 Generated Tokens: 33 I 00:00:34.649128 executorch:stats.h:90] Model Load Time: 12.337000 (seconds) I 00:00:34.649169 executorch:stats.h:100] Total inference time: 22.301000 (seconds) Rate: 1.479754 (tokens/second) I 00:00:34.649174 executorch:stats.h:108] Prompt evaluation: 17.964000 (seconds) Rate: 34.290804 (tokens/second) I 00:00:34.649179 executorch:stats.h:119] Generated 33 tokens: 4.337000 (seconds) Rate: 7.608946 (tokens/second) I 00:00:34.649183 executorch:stats.h:127] Time to first generated token: 17.964000 (seconds) I 00:00:34.649186 executorch:stats.h:134] Sampling time over 649 tokens: 0.001000 (seconds)

facebook-github-bot · 2024-09-10T16:20:42Z

@digantdesai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 6, 2024

larryliu0820 approved these changes Sep 6, 2024

View reviewed changes

digantdesai force-pushed the llava_stat_fix branch 2 times, most recently from 183d726 to 7c942c4 Compare September 9, 2024 19:31

digantdesai force-pushed the llava_stat_fix branch from 7c942c4 to 30bbed1 Compare September 10, 2024 16:18

digantdesai force-pushed the llava_stat_fix branch from 30bbed1 to 5a5dbc5 Compare September 10, 2024 16:18

facebook-github-bot merged commit db34239 into main Sep 10, 2024
65 checks passed

facebook-github-bot deleted the llava_stat_fix branch September 10, 2024 17:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LLava] Fix stats for C++ runner #5147

[LLava] Fix stats for C++ runner #5147

Uh oh!

digantdesai commented Sep 6, 2024

Uh oh!

pytorch-bot bot commented Sep 6, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 10, 2024

Uh oh!

facebook-github-bot commented Sep 10, 2024

Uh oh!

Uh oh!

Uh oh!

[LLava] Fix stats for C++ runner #5147

[LLava] Fix stats for C++ runner #5147

Uh oh!

Conversation

digantdesai commented Sep 6, 2024

Uh oh!

pytorch-bot bot commented Sep 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5147

✅ No Failures

Uh oh!

facebook-github-bot commented Sep 10, 2024

Uh oh!

facebook-github-bot commented Sep 10, 2024

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 6, 2024 •

edited

Loading