llama : make Qwen2MoE QKV bias optional #12477

CISC · 2025-03-20T10:56:52Z

I'm guessing someone is cooking something, see huggingface/transformers#36735 :)

CISC · 2025-03-20T11:12:43Z

Hm, I see test-backend-ops fails a lot randomly lately, what's up with that?

ggerganov · 2025-03-20T12:24:09Z

25: CPY(type_src=f32,type_dst=q5_1,ne=[256,2,3,4],permute=[0,2,1,3]): [CPY] NMSE = 0.000002013 > 0.000001000 FAIL

This failure is expected. I think it was caused by some discrepancy of how float numbers are rounded on the CPU and on Metal (if I remember correctly).

Make Qwen2MoE QKV bias optional

5a8d4c5

CISC requested a review from slaren March 20, 2025 10:57

ggerganov approved these changes Mar 20, 2025

View reviewed changes

CISC merged commit dbb3a47 into ggml-org:master Mar 20, 2025
47 of 48 checks passed

CISC deleted the qwen2moe_optional_qkv_bias branch March 20, 2025 11:50

Ivy233 pushed a commit to Ivy233/llama.cpp that referenced this pull request Mar 23, 2025

llama : make Qwen2MoE QKV bias optional (ggml-org#12477)

886e3c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : make Qwen2MoE QKV bias optional #12477

llama : make Qwen2MoE QKV bias optional #12477

Uh oh!

CISC commented Mar 20, 2025

Uh oh!

CISC commented Mar 20, 2025

Uh oh!

Uh oh!

ggerganov commented Mar 20, 2025

Uh oh!

Uh oh!

llama : make Qwen2MoE QKV bias optional #12477

llama : make Qwen2MoE QKV bias optional #12477

Uh oh!

Conversation

CISC commented Mar 20, 2025

Uh oh!

CISC commented Mar 20, 2025

Uh oh!

Uh oh!

ggerganov commented Mar 20, 2025

Uh oh!

Uh oh!