main-interactive-mode: optionally allow for special tokens from user in interactive mode for fill-in-middle etal #7097

hanishkvc · 2024-05-06T06:11:02Z

Add a flag --interactive-specials to examples/main

This controls whether the user entered text in interactive mode is tokenized with parse_special flag set or not.

When main is run with this flag, it will allow users to enter any special tokens for fill-in-mode or equivalent supported by any ai model, so that the model can do the same at the appropriate location, based on the surrounding context in the user text.

@ggerganov @teleprint-me

I havent tested this yet, but logically this should allow for the above mentioned use case.

hanishkvc · 2024-05-06T13:55:26Z

Tested fill-in-middle sample prompt mentioned wrt Refact 1.6B,

and it correctly generates a suitable comment as expected that should be placed in-between the user provided code context in the sample prompt, when --interactive-specials argument is passed to main. And the proper token ids are generated for <fim_prefix>, <fim_suffix> and <fim_middle>

Without --interactive-specials argument, it generates a some what repeating general note about the code in the given sample prompt. And as expected <fim_prefix> <fim_suffix> and <fim_middle> dont get converted to proper tokens, because parse_special wont be set when tokenizing.

Merge master has of 20240510IST1236 into this branch. Fix a merge conflict with the newly added conversation flag in master branch.

hanishkvc · 2024-05-10T07:18:50Z

Have force pushed a merge with the master. @mofosyne , I think I overwrote your equivalent merge. Sorry I hadnt noticed that Allow edits by maintainers was enabled.

mofosyne · 2024-05-10T08:08:26Z

Not a problem. I would prefer that as I can't always guess your full intent. You are a more accurate author than I am. But I was hoping to at least make your life easier :)

mofosyne

Double checked that llama_tokenize() forth arguments is actually for special/control token in method definitions. Looks right.

Can observe change to code that would add a new flag that would allow for enabling the ability to add a special token in the middle of the embedding stream.

Main+: optionally allow special tokens from user in interactive mode

76730e1

hanishkvc mentioned this pull request May 6, 2024

Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also #6834

Draft

hanishkvc changed the title ~~main-interactive-mode: optionally allow for special tokens from user in interactive mode for fill-in etal~~ main-interactive-mode: optionally allow for special tokens from user in interactive mode for fill-in-middle etal May 6, 2024

mofosyne added enhancement New feature or request Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix labels May 9, 2024

Merge branch 'master' into hkvc_chat_interactivespecials

9566de9

Merge master has of 20240510IST1236 into this branch. Fix a merge conflict with the newly added conversation flag in master branch.

hanishkvc force-pushed the hkvc_chat_interactivespecials branch from b6f2e53 to 9566de9 Compare May 10, 2024 07:15

mofosyne approved these changes May 10, 2024

View reviewed changes

mofosyne merged commit f89fe27 into ggml-org:master May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

main-interactive-mode: optionally allow for special tokens from user in interactive mode for fill-in-middle etal #7097

main-interactive-mode: optionally allow for special tokens from user in interactive mode for fill-in-middle etal #7097

Uh oh!

hanishkvc commented May 6, 2024

Uh oh!

hanishkvc commented May 6, 2024 •

edited

Loading

Uh oh!

hanishkvc commented May 10, 2024 •

edited

Loading

Uh oh!

mofosyne commented May 10, 2024

Uh oh!

mofosyne left a comment

Uh oh!

Uh oh!

main-interactive-mode: optionally allow for special tokens from user in interactive mode for fill-in-middle etal #7097

main-interactive-mode: optionally allow for special tokens from user in interactive mode for fill-in-middle etal #7097

Uh oh!

Conversation

hanishkvc commented May 6, 2024

Uh oh!

hanishkvc commented May 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanishkvc commented May 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mofosyne commented May 10, 2024

Uh oh!

mofosyne left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hanishkvc commented May 6, 2024 •

edited

Loading

hanishkvc commented May 10, 2024 •

edited

Loading