reset glmedge chat template #13253

piDack · 2025-05-02T03:13:04Z

Make sure to read the contributing guidelines before submitting a PR

Hi, @ngxson,I’d like to know why the glmedge model needs to be added [gMASK]. I tried using the code below, and through testing, I found that the edge model does not exist. I think we shouldn’t add it, right?

from transformers import AutoModelForCausalLM, AutoTokenizer

MODEL_PATH = "<path>"

tokenizer = AutoTokenizer.from_pretrained(MODEL_PATH)
model = AutoModelForCausalLM.from_pretrained(MODEL_PATH, device_map="auto")

message = [{"role": "user", "content": "hello!"}]

inputs = tokenizer.apply_chat_template(
    message,
    return_tensors="pt",
    add_generation_prompt=True,
    return_dict=True,
).to(model.device)
print(tokenizer.decode(inputs["input_ids"][0],skip_special_tokens=False))

the output is

Also, what are the poor performance issues that arise from this?

ngxson · 2025-05-02T06:29:36Z

There is a test in llava/test.sh and without adding gmask, the model never response.

I don't know why but tbh the chat template of glmedge is quite confusing.

Did you tested it (glmedge gguf model with vision input) without gmask?

piDack · 2025-05-02T07:53:34Z

There is a test in llava/test.sh and without adding gmask, the model never response.

I don't know why but tbh the chat template of glmedge is quite confusing.

Did you tested it (glmedge gguf model with vision input) without gmask?

I feel it’s because of the changes made to BOI and EOI in your other PR #13081. Because in all GLM multimodal models, BOI and EOI are written in the weights rather than using <|begin_of_image|> and <|end_of_image|> tokenid directly https://huggingface.co/THUDM/glm-edge-v-2b/blob/main/siglip.py#L53. This is quite strange, but they indeed do it this way.

ngxson · 2025-05-02T08:00:15Z

Hmm ok I see, I didn't know about this python code. This is quite messy because all other models use boi/eoi as text token.

I'll bring back the boi/eoi embeddings in a follow up PR

ngxson · 2025-05-02T08:04:17Z

Btw, I'm not sure wht CI failed, let's try to make it green before merging

piDack · 2025-05-02T08:20:35Z

Hmm ok I see, I didn't know about this python code. This is quite messy because all other models use boi/eoi as text token.

Yes, I feel the confuse too.

piDack · 2025-05-02T08:20:46Z

Btw, I'm not sure wht CI failed, let's try to make it green before merging

Ok

reset glmedge chat template

d413c52

github-actions bot added the testing Everything test related label May 2, 2025

ngxson approved these changes May 2, 2025

View reviewed changes

fix glmedge chat template

033d29e

ngxson approved these changes May 2, 2025

View reviewed changes

ngxson merged commit 2af6880 into ggml-org:master May 2, 2025
50 checks passed

ngxson mentioned this pull request May 2, 2025

clip : revert the change of BOI/EOI token for GLM-edge (⚠️ breaking change) #13259

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

reset glmedge chat template #13253

reset glmedge chat template #13253

Uh oh!

piDack commented May 2, 2025

Uh oh!

ngxson commented May 2, 2025

Uh oh!

piDack commented May 2, 2025

Uh oh!

ngxson commented May 2, 2025

Uh oh!

ngxson commented May 2, 2025

Uh oh!

piDack commented May 2, 2025

Uh oh!

piDack commented May 2, 2025

Uh oh!

Uh oh!

Uh oh!

reset glmedge chat template #13253

reset glmedge chat template #13253

Uh oh!

Conversation

piDack commented May 2, 2025

Uh oh!

ngxson commented May 2, 2025

Uh oh!

piDack commented May 2, 2025

Uh oh!

ngxson commented May 2, 2025

Uh oh!

ngxson commented May 2, 2025

Uh oh!

piDack commented May 2, 2025

Uh oh!

piDack commented May 2, 2025

Uh oh!

Uh oh!

Uh oh!