adding run time info to eval and cleaning up output #422

HDCharles · 2024-04-23T20:22:20Z

Summary:

output now includes info on model run time distribution and a cleaned up result output.

also added info to evaluation.md

Test Plan:

python eval.py --checkpoint-path checkpoints/$MODEL_REPO/model.pth
--dtype bfloat16 --device cuda \

Time to run eval: 53.31s.
Time in model.forward: 20.29s, over 186 model evaluations 
forward run time stats - Median: 0.10s Min: 0.04s Max: 2.18s 
For model checkpoints/meta-llama/Llama-2-7b-hf/model.pth 
wikitext:
 word_perplexity,none: 9.1649
 byte_perplexity,none: 1.5133
 bits_per_byte,none: 0.5977
 alias: wikitext

Reviewers:

Subscribers:

Tasks:

Tags:

Summary: output now includes info on model run time distribution and a cleaned up result output. Test Plan: python eval.py --checkpoint-path checkpoints/$MODEL_REPO/model.pth \ --dtype bfloat16 --device cuda \ Time to run eval: 53.31s. Time in model.forward: 20.29s, over 186 model evaluations forward run time stats - Median: 0.10s Min: 0.04s Max: 2.18s For model checkpoints/meta-llama/Llama-2-7b-hf/model.pth wikitext: word_perplexity,none: 9.1649 byte_perplexity,none: 1.5133 bits_per_byte,none: 0.5977 alias: wikitext Reviewers: Subscribers: Tasks: Tags:

Summary: see added content Test Plan: n/a Reviewers: Subscribers: Tasks: Tags:

Summary: removing install instructions Test Plan: Reviewers: Subscribers: Tasks: Tags:

mikekgfb

Thank you!

mikekgfb · 2024-04-25T07:43:19Z

runner-et is a known issue, not related to the present PR.

@dbort @larryliu0820

* adding run time info to eval and cleaning up output Summary: output now includes info on model run time distribution and a cleaned up result output. Test Plan: python eval.py --checkpoint-path checkpoints/$MODEL_REPO/model.pth \ --dtype bfloat16 --device cuda \ Time to run eval: 53.31s. Time in model.forward: 20.29s, over 186 model evaluations forward run time stats - Median: 0.10s Min: 0.04s Max: 2.18s For model checkpoints/meta-llama/Llama-2-7b-hf/model.pth wikitext: word_perplexity,none: 9.1649 byte_perplexity,none: 1.5133 bits_per_byte,none: 0.5977 alias: wikitext Reviewers: Subscribers: Tasks: Tags: * Adding evaluation.md content Summary: see added content Test Plan: n/a Reviewers: Subscribers: Tasks: Tags: * docs update Summary: removing install instructions Test Plan: Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 23, 2024

HDCharles requested review from mikekgfb, malfet and jerryzh168 April 23, 2024 20:22

HDCharles added 2 commits April 24, 2024 13:23

Adding evaluation.md content

83877e6

Summary: see added content Test Plan: n/a Reviewers: Subscribers: Tasks: Tags:

HDCharles force-pushed the 098_eval_output branch from 6c25922 to 83877e6 Compare April 24, 2024 22:16

HDCharles mentioned this pull request Apr 24, 2024

[Release][documentation] torchchat model evaluation #339

Closed

docs update

a7fa195

Summary: removing install instructions Test Plan: Reviewers: Subscribers: Tasks: Tags:

mikekgfb approved these changes Apr 25, 2024

View reviewed changes

mikekgfb merged commit f4f315b into main Apr 25, 2024

mikekgfb deleted the 098_eval_output branch April 25, 2024 07:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adding run time info to eval and cleaning up output #422

adding run time info to eval and cleaning up output #422

Uh oh!

HDCharles commented Apr 23, 2024 •

edited

Loading

Uh oh!

mikekgfb left a comment

Uh oh!

mikekgfb commented Apr 25, 2024

Uh oh!

Uh oh!

adding run time info to eval and cleaning up output #422

adding run time info to eval and cleaning up output #422

Uh oh!

Conversation

HDCharles commented Apr 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikekgfb left a comment

Choose a reason for hiding this comment

Uh oh!

mikekgfb commented Apr 25, 2024

Uh oh!

Uh oh!

HDCharles commented Apr 23, 2024 •

edited

Loading