Server: Handle n_keep parameter in the request #6174

jkarthic · 2024-03-20T09:59:19Z

No description provided.

phymbert · 2024-03-20T10:13:46Z

examples/server/utils.hpp

@@ -371,6 +371,7 @@ static json oaicompat_completion_params_parse(
    llama_params["repeat_last_n"]     = json_value(body,   "repeat_last_n",     default_sparams.penalty_last_n);
    llama_params["ignore_eos"]        = json_value(body,   "ignore_eos",        false);
    llama_params["tfs_z"]             = json_value(body,   "tfs_z",             default_sparams.tfs_z);
+    llama_params["n_keep"]            = json_value(body,   "n_keep",            0);


Hello, thanks but @ggerganov @ngxson I worry this is actually not OAI compatible ?

we can consider it as an "extension" to OAI, for example tfs_z or mirostat that we're having, they are not available on OAI.

In fact this code is duplicated to the one inside launch_slot_with_task, I planned to refactor all of OAI-related logic to one place, maybe I'll do this during weekend.

ngxson

LGTM. It's quite surprise to know that server does not have --n-keep argument, maybe we need to add that in the future.

Server: Handle n_keep parameter in the request

3e67baa

phymbert reviewed Mar 20, 2024

View reviewed changes

ngxson approved these changes Mar 20, 2024

View reviewed changes

phymbert approved these changes Mar 20, 2024

View reviewed changes

phymbert merged commit 47cc7a7 into ggml-org:master Mar 20, 2024

jkarthic deleted the server_n_keep branch March 20, 2024 13:26

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

Server: Handle n_keep parameter in the request (ggml-org#6174)

ff1279c

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 3, 2024

Server: Handle n_keep parameter in the request (ggml-org#6174)

786af84

tybalex pushed a commit to rubra-ai/tools.cpp that referenced this pull request Apr 17, 2024

Server: Handle n_keep parameter in the request (ggml-org#6174)

f268226

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Server: Handle n_keep parameter in the request #6174

Server: Handle n_keep parameter in the request #6174

Uh oh!

jkarthic commented Mar 20, 2024

Uh oh!

phymbert Mar 20, 2024

Uh oh!

ngxson Mar 20, 2024

Uh oh!

ngxson Mar 20, 2024 •

edited

Loading

Uh oh!

ngxson left a comment

Uh oh!

Uh oh!

Server: Handle n_keep parameter in the request #6174

Server: Handle n_keep parameter in the request #6174

Uh oh!

Conversation

jkarthic commented Mar 20, 2024

Uh oh!

phymbert Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

ngxson Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

ngxson Mar 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngxson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ngxson Mar 20, 2024 •

edited

Loading