Skip to content

Commit 1c8b3e4

Browse files
committed
fix: Allow "output" layer in granite moe architecture (convert and cpp)
Branch: GraniteMoE Co-Authored-By: [email protected] Signed-off-by: Gabe Goodhart <[email protected]>
1 parent 317b15b commit 1c8b3e4

File tree

2 files changed

+2
-0
lines changed

2 files changed

+2
-0
lines changed

gguf-py/gguf/constants.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1247,6 +1247,7 @@ class MODEL_TENSOR(IntEnum):
12471247
MODEL_ARCH.GRANITE_MOE: [
12481248
MODEL_TENSOR.TOKEN_EMBD,
12491249
MODEL_TENSOR.OUTPUT_NORM,
1250+
MODEL_TENSOR.OUTPUT,
12501251
MODEL_TENSOR.ATTN_NORM,
12511252
MODEL_TENSOR.ATTN_Q,
12521253
MODEL_TENSOR.ATTN_K,

src/llama.cpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1485,6 +1485,7 @@ static const std::map<llm_arch, std::map<llm_tensor, std::string>> LLM_TENSOR_NA
14851485
{
14861486
{ LLM_TENSOR_TOKEN_EMBD, "token_embd" },
14871487
{ LLM_TENSOR_OUTPUT_NORM, "output_norm" },
1488+
{ LLM_TENSOR_OUTPUT, "output" },
14881489
{ LLM_TENSOR_ATTN_NORM, "blk.%d.attn_norm" },
14891490
{ LLM_TENSOR_ATTN_Q, "blk.%d.attn_q" },
14901491
{ LLM_TENSOR_ATTN_K, "blk.%d.attn_k" },

0 commit comments

Comments
 (0)