Clean up CLI output #473

GregoryComer · 2024-04-24T23:50:52Z

Move a few messages to debug. Update chat messages to a slightly more minimal form.

With changes to chat:

Entering Chat Mode. Will continue chatting back and forth with the language model until the models max context length of 2048 tokens is hit or until the user says /bye
System Prompt [Optional]:
User: Hello llama. Please write a script to print the numbers 1 through 10 in python.
Model: Hello there! As a llama, I'd be delighted to help you with that. Here's a simple script to print the numbers 1 through 10 in Python:

for i in range(1, 11):
    print(i)

Let me explain what this script does:

...
User: /bye
Exiting Chat.

==========
Average tokens/sec: 2.32
Memory used: 0.00 GB

Also, I tested generate stories15m. Should be unchanged:

Using device=cpu Apple M1 Pro
Loading model...
Time to load model: 0.03 seconds
Quantizing the model with: { }
Time to quantize model: 0.00 seconds
Hello, my name is Sue. He is a brave bear. He is always ready to help.
Sue loves to bounce around in the forest. One day, she was bouncing around when she spotted some old, tough honey.
She hopped closer and said, "Oh no! What can I do?"
Suddenly, a bear appeared. He said, "Don't worry, Sue. I can get you some honey!"
Sue was very happy. She said, "Thank you!" The bear smiled, and then said, "Now I can help you eat."
Sue thanked the bear again and went off to bounce around in the forest. She had lots of fun and was so happy that the bear helped her. Once upon a time, there was a little girl named Lily. She loved to play outside in the snow. One day, she went outside to build a snowman.
Max Sequence Length Reached. Ending Conversation.
==========
Average tokens/sec: 238.14
Memory used: 0.00 GB

malfet · 2024-04-25T00:01:00Z

generate.py

@@ -667,10 +667,10 @@ def callback(x):
        tokens_generated = y.size(0) - prompt_length
        tokens_sec = tokens_generated / t
        aggregate_metrics["tokens_per_sec"].append(tokens_sec)
-        logging.info(


Hmm, what's wrong with this being an info? I think for generate it's pretty important to see the perf, isn't it?

These are being printed after every model response, which breaks up the chat session. It is still printed at info level at the end of conversation (verified for both chat and generate).

Clean up CLI output

4683ae2

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 24, 2024

larryliu0820 approved these changes Apr 24, 2024

View reviewed changes

malfet reviewed Apr 25, 2024

View reviewed changes

GregoryComer merged commit 1d4841f into pytorch:main Apr 25, 2024

malfet pushed a commit that referenced this pull request Jul 17, 2024

Clean up CLI output (#473)

a17445b

malfet pushed a commit that referenced this pull request Jul 17, 2024

Clean up CLI output (#473)

20ec9b8

malfet pushed a commit that referenced this pull request Jul 17, 2024

Clean up CLI output (#473)

d7d0ce2

malfet pushed a commit that referenced this pull request Jul 17, 2024

Clean up CLI output (#473)

276eefe

malfet pushed a commit that referenced this pull request Jul 17, 2024

Clean up CLI output (#473)

57ef128

malfet pushed a commit that referenced this pull request Jul 17, 2024

Clean up CLI output (#473)

770f70c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clean up CLI output #473

Clean up CLI output #473

Uh oh!

GregoryComer commented Apr 24, 2024 •

edited

Loading

Uh oh!

malfet Apr 25, 2024

Uh oh!

GregoryComer Apr 25, 2024

Uh oh!

Uh oh!

Clean up CLI output #473

Clean up CLI output #473

Uh oh!

Conversation

GregoryComer commented Apr 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

malfet Apr 25, 2024

Choose a reason for hiding this comment

Uh oh!

GregoryComer Apr 25, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

GregoryComer commented Apr 24, 2024 •

edited

Loading