fix(cli): allow passing n_ctx=0 to openAI API server CLI arguments #1093

K-Mistele · 2024-01-16T21:53:46Z

What?

This PR updates the OpenAI API server's command line arguments to allow passing --n_ctx 0 as a command-line argument. Currently, --n_ctx parameter has a minimum of 1: in llama_cpp/server/settings.py you can see the following context CLI parameter configuration: n_ctx: int = Field(default=2048, ge=1, description="The context size.")

Why?

As of [email protected], per #1015 by @DanieleMorotti, passing n_ctx=0 to the LLama class in llama_cpp/llama.py automatically sets the n_ctx to the model's n_ctx_train paramter from KV, and also updates the model's n_batch to min(n_ctx, n_batch). This is intentional to allow llama-cpp-python to infer the context size from the GGUF model file's KV parameters.

However, when this change was made, the OpenAI API server's CLI argument configuration was not updated, so the minimum value for the option remained at 1 - making the patch in #1015 unavailable to users of the OpenAI API serve.r

…l n_ctx_train field per abetlen#1015

abetlen · 2024-01-16T23:54:04Z

@K-Mistele thank you!

fix(cli): allow passing n_ctx=0 to openAI API server args to use mode…

269327c

…l n_ctx_train field per abetlen#1015

abetlen merged commit 9c36688 into abetlen:main Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(cli): allow passing n_ctx=0 to openAI API server CLI arguments #1093

fix(cli): allow passing n_ctx=0 to openAI API server CLI arguments #1093

Uh oh!

K-Mistele commented Jan 16, 2024

Uh oh!

abetlen commented Jan 16, 2024

Uh oh!

Uh oh!

fix(cli): allow passing n_ctx=0 to openAI API server CLI arguments #1093

fix(cli): allow passing n_ctx=0 to openAI API server CLI arguments #1093

Uh oh!

Conversation

K-Mistele commented Jan 16, 2024

What?

Why?

Uh oh!

abetlen commented Jan 16, 2024

Uh oh!

Uh oh!