Continue on empty line #529

thement · 2023-03-26T14:55:01Z

Do not insert a "newline" token if user inputs empty line. This let's
user to continue the output after she has been asked by reverse prompt
for more data. Otherwise an empty-line input would insert a "newline"
token which would break the flow of the conversation.

Do not insert a "newline" token if user inputs empty line. This let's user to continue the output after she has been asked by reverse prompt for more data. Otherwise an empty-line input would insert a "newline" token which would break the flow of the conversation.

anzz1

This is a good change, however it needs some changes to work with the instruct mode.
As currently you wrap the suffix "### Response:" inside the buffer.length check but not the prefix "### Instruction:", breaking the model.
The prefix needs to be wrapped too. Something like this:

Line 437:

                if (params.instruct) {
                    printf("\n> ");
                }

Line 468:

                // Add tokens to buffer only if the line is non-empty.
                // This let's the user make the chat continue if it was stopped
                // on a reverse prompt.
                if (buffer.length() > 1) {
                    if (params.instruct) {
                        n_consumed = embd_inp.size();
                        embd_inp.insert(embd_inp.end(), inp_pfx.begin(), inp_pfx.end());
                    }

                    auto line_inp = ::llama_tokenize(ctx, buffer, false);
                    embd_inp.insert(embd_inp.end(), line_inp.begin(), line_inp.end());

                    if (params.instruct) {
                        embd_inp.insert(embd_inp.end(), inp_sfx.begin(), inp_sfx.end());
                    }

                    n_remain -= line_inp.size();
               }

Also replace the tabs on L480 with spaces

edit

or actually should it be:

Line 437:

                if (params.instruct) {
                    n_consumed = embd_inp.size();
                    printf("\n> ");
                }

Line 468:

                // Add tokens to buffer only if the line is non-empty.
                // This let's the user make the chat continue if it was stopped
                // on a reverse prompt.
                if (buffer.length() > 1) {
                    if (params.instruct) {
                        embd_inp.insert(embd_inp.end(), inp_pfx.begin(), inp_pfx.end());
                    }

                    auto line_inp = ::llama_tokenize(ctx, buffer, false);
                    embd_inp.insert(embd_inp.end(), line_inp.begin(), line_inp.end());

                    if (params.instruct) {
                        embd_inp.insert(embd_inp.end(), inp_sfx.begin(), inp_sfx.end());
                    }

                    n_remain -= line_inp.size();
               }

I'm not 100% certain of the n_consumed part which place is the right for it. Should test for both.

ping @blackhole89 @ggerganov Requesting backup 😄

thement · 2023-03-26T19:18:07Z

This is a good change, however it needs some changes to work with the instruct mode.

I wasn't aware of instruct mode. I'll look into it and update the pull request.

rabidcopy · 2023-03-26T20:59:15Z

This is a good change, however it needs some changes to work with the instruct mode.

I wasn't aware of instruct mode. I'll look into it and update the pull request.

Apparently @ggerganov is looking to move instruct mode out of main.cpp and as its own thing according to #508. May be worth waiting for progress on that or any code on this pertaining to instruct mode may be for naught.

thement · 2023-03-26T21:00:29Z

This is a good change, however it needs some changes to work with the instruct mode.

I wasn't aware of instruct mode. I'll look into it and update the pull request.

Apparently @ggerganov is looking to move instruct mode out of main.cpp and as its own thing according to #508. May be worth waiting for progress on that or any code on this pertaining to instruct mode may be for naught.

Very well. Let's close it for now then.

anzz1 · 2023-03-26T22:48:43Z

Hmmm the change is so small and the work is already done so why not just merge it? The only thing need to know anymore is the whether the option 1 or option 2 above for the placement of n_consumed is right, and thats why I pinged @blackhole89 since he originally made the instruct option and probably knows the answer on spot. The work would transfer over to the instruct implementation then when its moved.

rabidcopy · 2023-03-27T00:28:01Z

Ah, sorry. Didn't mean nudge this down. Above is right. The change is still good as far as I can tell.

anzz1 · 2023-03-27T05:50:47Z

I'll reopen this so that whichever happens first (this gets merged or instruct mode moved to another file), that in any case it doesnt get forgotten.

anzz1 · 2023-03-27T13:34:53Z

~~Will be implemented in #555 , closing.~~

Merged 7b8dbcb (#571)

thement marked this pull request as ready for review March 26, 2023 14:57

anzz1 self-requested a review March 26, 2023 15:51

anzz1 suggested changes Mar 26, 2023

View reviewed changes

anzz1 requested review from blackhole89 and ggerganov March 26, 2023 16:25

thement marked this pull request as draft March 26, 2023 19:18

thement closed this Mar 26, 2023

anzz1 reopened this Mar 27, 2023

anzz1 added enhancement New feature or request generation quality Quality of model output labels Mar 27, 2023

anzz1 closed this Mar 27, 2023

ScarletEmerald mentioned this pull request Mar 29, 2023

User should be able to return control without inserting a newline #587

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Continue on empty line #529

Continue on empty line #529

Uh oh!

thement commented Mar 26, 2023

Uh oh!

anzz1 left a comment •

edited

Loading

Uh oh!

thement commented Mar 26, 2023

Uh oh!

rabidcopy commented Mar 26, 2023 •

edited

Loading

Uh oh!

thement commented Mar 26, 2023

Uh oh!

anzz1 commented Mar 26, 2023 •

edited

Loading

Uh oh!

rabidcopy commented Mar 27, 2023

Uh oh!

anzz1 commented Mar 27, 2023

Uh oh!

anzz1 commented Mar 27, 2023 •

edited

Loading

Uh oh!

Uh oh!

Continue on empty line #529

Continue on empty line #529

Uh oh!

Conversation

thement commented Mar 26, 2023

Uh oh!

anzz1 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thement commented Mar 26, 2023

Uh oh!

rabidcopy commented Mar 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thement commented Mar 26, 2023

Uh oh!

anzz1 commented Mar 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rabidcopy commented Mar 27, 2023

Uh oh!

anzz1 commented Mar 27, 2023

Uh oh!

anzz1 commented Mar 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

anzz1 left a comment •

edited

Loading

rabidcopy commented Mar 26, 2023 •

edited

Loading

anzz1 commented Mar 26, 2023 •

edited

Loading

anzz1 commented Mar 27, 2023 •

edited

Loading