Skip to content

Commit a88abd5

Browse files
Adding back query pre attn scalar override.
1 parent 47eba2b commit a88abd5

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

gemma/config.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -117,6 +117,7 @@ def get_config_for_9b() -> GemmaConfig:
117117
head_dim=256,
118118
attn_types=[AttentionType.LOCAL_SLIDING, AttentionType.GLOBAL] * 21,
119119
sliding_window_size=4096,
120+
query_pre_attn_scalar=224, # hidden_size / num_attention_heads
120121
)
121122

122123

0 commit comments

Comments
 (0)