sampling: fix top_k <= 0 #5388

JohannesGaessler · 2024-02-07T12:24:57Z

Fixes abetlen/llama-cpp-python#1154 .
(I assume) the issue is that top_k sampling is called directly rather than via the sampling queue.
top_k values <= 0 are then rounded up to min_keep instead of being set to candidates.size.
This PR adds a corresponding check and two more related test cases.

Green-Sky

Does this fix the server behavior, when you set the "Show Probabilities" option ?

JohannesGaessler · 2024-02-07T17:47:11Z

I don't know which issue you're talking about so I can't say.

Green-Sky · 2024-02-07T17:59:17Z

No it does not. Ignore me :)

llama.cpp

Co-authored-by: Georgi Gerganov <[email protected]>

neerajprad · 2024-02-16T19:59:25Z

I still see the same issue on the latest version (0.2.44) where the sampling is deterministic for K=0.

* sampling: fix top_k <= 0 * Update llama.cpp Co-authored-by: Georgi Gerganov <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>

JohannesGaessler mentioned this pull request Feb 7, 2024

[0.2.38] Models always produce the same output when setting top_k to 0, still using min_p abetlen/llama-cpp-python#1154

Closed

sampling: fix top_k <= 0

2f8e607

JohannesGaessler force-pushed the sampling-fix-top-k branch from 7ede556 to 2f8e607 Compare February 7, 2024 12:37

Green-Sky approved these changes Feb 7, 2024

View reviewed changes

cebtenzzre approved these changes Feb 7, 2024

View reviewed changes

ggerganov approved these changes Feb 8, 2024

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

Update llama.cpp

7b99650

Co-authored-by: Georgi Gerganov <[email protected]>

JohannesGaessler merged commit 26d4efd into ggml-org:master Feb 8, 2024

JohannesGaessler mentioned this pull request Feb 8, 2024

Fix trailing whitespaces #5407

Merged

BadisG mentioned this pull request Feb 9, 2024

The llama.cpp loader doesn't seem to be affected by samplers anymore. oobabooga/text-generation-webui#5451

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sampling: fix top_k <= 0 #5388

sampling: fix top_k <= 0 #5388

Uh oh!

JohannesGaessler commented Feb 7, 2024

Uh oh!

Green-Sky left a comment

Uh oh!

JohannesGaessler commented Feb 7, 2024

Uh oh!

Green-Sky commented Feb 7, 2024

Uh oh!

Uh oh!

neerajprad commented Feb 16, 2024

Uh oh!

Uh oh!

sampling: fix top_k <= 0 #5388

sampling: fix top_k <= 0 #5388

Uh oh!

Conversation

JohannesGaessler commented Feb 7, 2024

Uh oh!

Green-Sky left a comment

Choose a reason for hiding this comment

Uh oh!

JohannesGaessler commented Feb 7, 2024

Uh oh!

Green-Sky commented Feb 7, 2024

Uh oh!

Uh oh!

neerajprad commented Feb 16, 2024

Uh oh!

Uh oh!