feat: improve temperature logic in chat #1749

drbh · 2024-04-15T17:08:00Z

This PR adds support for do_sample to chat to enable greedy sampling

router/src/validation.rs

Narsil · 2024-04-17T13:41:33Z

router/src/server.rs

@@ -775,6 +776,10 @@ async fn chat_completions(
    let logprobs = logprobs.unwrap_or(false);
    let tool_prompt = tool_prompt.unwrap_or_default();
    let stop = stop.unwrap_or_default();
+    // rescale temperature starting from 0.0 to 1.0
+    let adjusted_temperature = temperature.map_or(1.0, |t| t + 1.0);


I do not understand this. Why are you adding 1.0 ?

1 is added since openais api is essentially zero indexed. They treat 0 as deterministic and higher values as more random.

In our case 1 is deterministic and greater is more random, so adding 1 allows users to get deterministic output with temp 0, and more random with larger values.

temperature:
What sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.

No it's not.

logits.pow(temperature).

temperature == 0 -> Pretty much forces deteministic sampling (with float issues).
temperature == 1 -> Regular distribution.

makes sense! PR updated to:

only set do_sample to false when temperature is 0

rescale temperature from 0 to 1 only when temp is 0

I believe this fits the requirements, and enables greedy ONLY when temperature is 0. Please lmk if I should change! 🙏

drbh · 2024-04-17T14:13:22Z

tiny reproduction script

import requests

headers = {
    "Content-Type": "application/json",
}

data = {
    "model": "tgi",
    "messages": [
        {
            "role": "user",
            "content": "Summarize the main ideas of Jeff Walker's Product Launch Formula into bullet points as it pertains to a growth marketing agency implementing these strategies and tactics for their clients...",
        }
    ],
    "stream": False,
    "max_tokens": 100,
    "temperature": 0,
}

url = "http://localhost:3000/v1/chat/completions"

response = requests.post(url, headers=headers, json=data)

print(response.text)

HuggingFaceDocBuilderDev · 2024-04-17T14:18:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Narsil · 2024-04-17T18:35:57Z

router/src/server.rs

+    let adjusted_temperature = temperature.map_or(1.0, |t| if t == 0.0 { 1.0 } else { t });
+    let temperature = Some(adjusted_temperature);


Isn´t that.

let temperature = if temperature == 0.0{ None } else{ Some(temperature) }

Better yet, make the condition == 0.0 common to both do_sample and temperature (makes it more readable imho).

I don't think that exactly equivalent since we need the following:

do_sample to be true in all cases except if the incoming temp is 0

we need temperature to be a Option<f32> and we need to replace 0 with 1.

most readable/simple I've come up with so far is...

let do_sample: bool = temperature.map_or(true, |t| t != 0.0); let temperature = temperature.map(|t| if t == 0.0 { 1.0 } else { t });

whatcha think? happy to make any changes!

This PR adds support for `do_sample` to chat to enable greedy sampling --------- Co-authored-by: Nicolas Patry <[email protected]>

drbh changed the title ~~feat: support do_sample param in ChatRequest~~ feat: improve temperature logic in chat Apr 15, 2024

drbh self-assigned this Apr 16, 2024

drbh force-pushed the greedy-chat-tokens branch from 7c8b473 to a1df6c5 Compare April 16, 2024 16:36

drbh requested a review from Narsil April 17, 2024 13:38

Narsil reviewed Apr 17, 2024

View reviewed changes

router/src/validation.rs Outdated Show resolved Hide resolved

Narsil reviewed Apr 17, 2024

View reviewed changes

drbh requested a review from Narsil April 17, 2024 14:12

drbh added 8 commits April 17, 2024 10:42

feat: support do_sample param in ChatRequest

0520bde

fix: update temperature and sampling logic in chat

27cd254

fix: reduce and refactor changes

24a5588

fix: revise temp scaling logic

7879365

fix: adjust conditional after rebase

9387b3b

fix: add missing comma typo

6b9a257

fix: adjust req typo

4ea2a98

fix: simplify changes

a2c935d

drbh force-pushed the greedy-chat-tokens branch from a461eaa to a2c935d Compare April 17, 2024 14:42

fix: update conditional to be more specific

9906b03

Narsil reviewed Apr 17, 2024

View reviewed changes

drbh and others added 3 commits April 17, 2024 20:20

fix: make logic more readable

7ecda46

Lint.

7d07e92

Fmt + clippy.

ec0d913

Narsil merged commit 0acac5c into main Apr 25, 2024

Narsil deleted the greedy-chat-tokens branch April 25, 2024 13:31

kdamaszk pushed a commit to kdamaszk/tgi-gaudi that referenced this pull request Jun 10, 2024

feat: improve temperature logic in chat (huggingface#1749)

ab59a5e

This PR adds support for `do_sample` to chat to enable greedy sampling --------- Co-authored-by: Nicolas Patry <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: improve temperature logic in chat #1749

feat: improve temperature logic in chat #1749

Uh oh!

drbh commented Apr 15, 2024

Uh oh!

Uh oh!

Narsil Apr 17, 2024

Uh oh!

drbh Apr 17, 2024

Uh oh!

Narsil Apr 17, 2024

Uh oh!

drbh Apr 17, 2024

Uh oh!

drbh commented Apr 17, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Apr 17, 2024

Uh oh!

Narsil Apr 17, 2024

Uh oh!

Narsil Apr 17, 2024

Uh oh!

drbh Apr 17, 2024

Uh oh!

Uh oh!

		let adjusted_temperature = temperature.map_or(1.0, \|t\| if t == 0.0 { 1.0 } else { t });
		let temperature = Some(adjusted_temperature);

feat: improve temperature logic in chat #1749

feat: improve temperature logic in chat #1749

Uh oh!

Conversation

drbh commented Apr 15, 2024

Uh oh!

Uh oh!

Narsil Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

drbh Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

Narsil Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

drbh Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

drbh commented Apr 17, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Apr 17, 2024

Uh oh!

Narsil Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

Narsil Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

drbh Apr 17, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!