Skip to content

Commit a39217d

Browse files
committed
common : print --flash-attn in help
1 parent 871fcb6 commit a39217d

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

common/common.cpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1482,6 +1482,7 @@ void gpt_print_usage(int /*argc*/, char ** argv, const gpt_params & params) {
14821482
printf(" -ns N, --sequences N number of sequences to decode (default: %d)\n", params.n_sequences);
14831483
printf(" -ps N, --p-split N speculative decoding split probability (default: %.1f)\n", (double)params.p_split);
14841484
printf(" -cb, --cont-batching enable continuous batching (a.k.a dynamic batching) (default: disabled)\n");
1485+
printf(" -fa, --flash-attn enable Flash Attention (default: %s)\n", params.flash_attn ? "enabled" : "disabled");
14851486
printf(" --mmproj MMPROJ_FILE path to a multimodal projector file for LLaVA. see examples/llava/README.md\n");
14861487
printf(" --image IMAGE_FILE path to an image file. use with multimodal models\n");
14871488
if (llama_supports_mlock()) {

0 commit comments

Comments
 (0)