Skip to content

Commit 80a04a4

Browse files
slarenteleprint-me
authored andcommitted
phi3 : duplicate rope factors in each layer (ggml-org#7447)
* phi3 : duplicate rope factors in each layer phi3 : set phi-3 model type as 14B model loader : simplify the process for duplicating model tensors llama-bench : remove default pg test * replace bool parameters in llama_model_loader with named flags
1 parent 367e703 commit 80a04a4

File tree

2 files changed

+83
-97
lines changed

2 files changed

+83
-97
lines changed

examples/llama-bench/llama-bench.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -195,7 +195,7 @@ static const cmd_params cmd_params_defaults = {
195195
/* model */ {"models/7B/ggml-model-q4_0.gguf"},
196196
/* n_prompt */ {512},
197197
/* n_gen */ {128},
198-
/* n_pg */ {{512, 128}},
198+
/* n_pg */ {},
199199
/* n_batch */ {2048},
200200
/* n_ubatch */ {512},
201201
/* type_k */ {GGML_TYPE_F16},

0 commit comments

Comments
 (0)