fix: tokenizer config should use local model path when possible #1518

drbh · 2024-02-01T13:58:30Z

This PR fixes the issue with loading a local tokenizer config. Previously the default functionality would look in the current working directory. Now if a local model path is specified we will check that directory for the tokenizer_config.

Examples of valid commands

uses tokenizer_config from hub

text-generation-launcher --model-id HuggingFaceH4/zephyr-7b-beta

use tokenizer_config from local model path

text-generation-launcher \
  --model-id ~/.cache/huggingface/hub/models--HuggingFaceH4--zephyr-7b-beta/snapshots/dc24cabd13eacd3ae3a5fe574bd645483a335a4a/

use specific tokenizer_config file

 text-generation-launcher \
  --model-id ~/.cache/huggingface/hub/models--HuggingFaceH4--zephyr-7b-beta/snapshots/dc24cabd13eacd3ae3a5fe574bd645483a335a4a/ \
  --tokenizer-config-path ~/.cache/huggingface/hub/models--HuggingFaceH4--zephyr-7b-beta/snapshots/dc24cabd13eacd3ae3a5fe574bd645483a335a4a/tokenizer_config.json

router/src/main.rs

Co-authored-by: Nicolas Patry <[email protected]>

router/src/main.rs

Co-authored-by: Nicolas Patry <[email protected]>

Narsil

LGTM

…ingface#1518) This PR fixes the issue with loading a local tokenizer config. Previously the default functionality would look in the current working directory. Now if a local model path is specified we will check that directory for the tokenizer_config. ## Examples of valid commands uses tokenizer_config from hub ``` text-generation-launcher --model-id HuggingFaceH4/zephyr-7b-beta ``` use tokenizer_config from local model path ``` text-generation-launcher \ --model-id ~/.cache/huggingface/hub/models--HuggingFaceH4--zephyr-7b-beta/snapshots/dc24cabd13eacd3ae3a5fe574bd645483a335a4a/ ``` use specific tokenizer_config file ``` text-generation-launcher \ --model-id ~/.cache/huggingface/hub/models--HuggingFaceH4--zephyr-7b-beta/snapshots/dc24cabd13eacd3ae3a5fe574bd645483a335a4a/ \ --tokenizer-config-path ~/.cache/huggingface/hub/models--HuggingFaceH4--zephyr-7b-beta/snapshots/dc24cabd13eacd3ae3a5fe574bd645483a335a4a/tokenizer_config.json ``` --------- Co-authored-by: Nicolas Patry <[email protected]>

fix: tokenizer config should use local model path when possible

6e08f5b

Narsil reviewed Feb 1, 2024

View reviewed changes

router/src/main.rs Outdated Show resolved Hide resolved

Update router/src/main.rs

da3f8a4

Co-authored-by: Nicolas Patry <[email protected]>

Narsil reviewed Feb 1, 2024

View reviewed changes

router/src/main.rs Outdated Show resolved Hide resolved

drbh and others added 2 commits February 1, 2024 09:09

Update router/src/main.rs

57e27bc

Co-authored-by: Nicolas Patry <[email protected]>

fix: simplify logic and remove vars

9ad6b57

Narsil approved these changes Feb 1, 2024

View reviewed changes

drbh merged commit ee1cf51 into main Feb 1, 2024

drbh deleted the tokenizer-config-prefer-local-model-path branch February 1, 2024 14:39

drbh mentioned this pull request Feb 2, 2024

feat: supports openai chat completions API #1427

Merged

edwardzjl mentioned this pull request Feb 4, 2024

Migrate to chat-openai edwardzjl/chatbot#294

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: tokenizer config should use local model path when possible #1518

fix: tokenizer config should use local model path when possible #1518

Uh oh!

drbh commented Feb 1, 2024

Uh oh!

Uh oh!

Uh oh!

Narsil left a comment

Uh oh!

Uh oh!

fix: tokenizer config should use local model path when possible #1518

fix: tokenizer config should use local model path when possible #1518

Uh oh!

Conversation

drbh commented Feb 1, 2024

Examples of valid commands

Uh oh!

Uh oh!

Uh oh!

Narsil left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!