Support InfiniAI Megrez 3b #10893

dixyes · 2024-12-19T09:06:44Z

This pr is to add InfiniAI Megrez support into llama.cpp

The model now(@58f1df16523cb2a9acb225aa808146e052f2b5b2) seems have a wrong eos_token set in its tokenizer_config.json.
( <|turn_end|> at template and <|turn_end> in json) Not sure if this is on purpose. Also metioned here

So the converted model will not stop generating in chat mode. Modify it to <|turn_end|> in tokenizer_config.json, then the generated gguf will work.

ngxson

I'm not sure about the tokenizer_pre == "megrez" part (if other collaborators know, please feel free to review this PR).

The template part looks good to me.

arch-btw · 2024-12-20T04:37:42Z

Thanks for doing this, I was trying it myself but didn't finish it. Just so you know they fixed the eos 30 minutes ago.

src/llama.cpp

* Support InfiniAI Megrez 3b * Fix tokenizer_clean_spaces for megrez

Support InfiniAI Megrez 3b

a02c63d

github-actions bot added testing Everything test related python python script changes labels Dec 19, 2024

ngxson reviewed Dec 19, 2024

View reviewed changes

slaren reviewed Dec 20, 2024

View reviewed changes

src/llama.cpp Outdated Show resolved Hide resolved

dixyes force-pushed the megrez branch 2 times, most recently from 048d345 to 73f3d01 Compare December 22, 2024 06:55

Fix tokenizer_clean_spaces for megrez

01a0c36

dixyes force-pushed the megrez branch from 73f3d01 to 01a0c36 Compare December 22, 2024 06:56

slaren approved these changes Dec 23, 2024

View reviewed changes

slaren merged commit b92a14a into ggml-org:master Dec 23, 2024
50 checks passed

tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025

llama : support InfiniAI Megrez 3b (ggml-org#10893)

c0cba1a

* Support InfiniAI Megrez 3b * Fix tokenizer_clean_spaces for megrez

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

llama : support InfiniAI Megrez 3b (ggml-org#10893)

aad44e6

* Support InfiniAI Megrez 3b * Fix tokenizer_clean_spaces for megrez

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

llama : support InfiniAI Megrez 3b (ggml-org#10893)

c4d6296

* Support InfiniAI Megrez 3b * Fix tokenizer_clean_spaces for megrez

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support InfiniAI Megrez 3b #10893

Support InfiniAI Megrez 3b #10893

Uh oh!

dixyes commented Dec 19, 2024

Uh oh!

ngxson left a comment •

edited

Loading

Uh oh!

arch-btw commented Dec 20, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Support InfiniAI Megrez 3b #10893

Support InfiniAI Megrez 3b #10893

Uh oh!

Conversation

dixyes commented Dec 19, 2024

Uh oh!

ngxson left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

arch-btw commented Dec 20, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ngxson left a comment •

edited

Loading