convert : fix context length for nomic-embed-text-v2-moe #13216

cebtenzzre · 2025-04-30T16:47:27Z

As noted by ggerganov, nomic-embed-text-v2-moe is correctly documented to be trained with up to 512 tokens of context, so the hardcoded value of 2048 used in the convert script is not accurate.

With this change, nomic-embed-text-v1 and v1.5 still convert with context_length=2048, and nomic-embed-text-v2-moe now converts with context_length=512.

* GraniteMoEShared: fix: Fix the input to the shared experts fix: Cleaner (maybe more correct?) splitting for gate/up feat: First WIP cut at model arch in cpp fix: Split MoE fused tensors for shared experts in conversion feat: hparam and arch plumbing for granitemoeshared feat: Add GGUF conversion for granitemoeshared llama-model : support Qwen2 embedding models and pooling_mode_lasttoken (ggml-org#13245) convert : use correct context length for nomic-embed-text-v2 (ggml-org#13216)

convert : use correct context length for nomic-embed-text-v2

a96786c

cebtenzzre requested a review from ggerganov April 30, 2025 16:47

github-actions bot added the python python script changes label Apr 30, 2025

ggerganov approved these changes Apr 30, 2025

View reviewed changes

cebtenzzre merged commit 7d21234 into master May 2, 2025
7 checks passed

cebtenzzre deleted the jared/fix-nomic-embed-v2-nctx branch May 2, 2025 15:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

convert : fix context length for nomic-embed-text-v2-moe #13216

convert : fix context length for nomic-embed-text-v2-moe #13216

Uh oh!

cebtenzzre commented Apr 30, 2025

Uh oh!

Uh oh!

Uh oh!

convert : fix context length for nomic-embed-text-v2-moe #13216

convert : fix context length for nomic-embed-text-v2-moe #13216

Uh oh!

Conversation

cebtenzzre commented Apr 30, 2025

Uh oh!

Uh oh!

Uh oh!