Skip to content

Commit 9470c01

Browse files
committed
fix note
1 parent 7885df6 commit 9470c01

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/tensorrt_llm/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -143,7 +143,7 @@ dynamo serve graphs.agg:Frontend -f configs/deepseek_r1/mtp/mtp_agg.yaml
143143
Notes:
144144
- MTP is only available within the container built with the experimental TensorRT-LLM commit. Please add --use-default-experimental-tensorrtllm-commit to the arguments of the build.sh script.
145145

146-
ex: `./container/build.sh --framework tensorrtllm --use-default-experimental-tensorrtllm-commit`
146+
Example: `./container/build.sh --framework tensorrtllm --use-default-experimental-tensorrtllm-commit`
147147

148148
- There is a noticeable latency for the first two inference requests. Please send warm-up requests before starting the benchmark.
149149
- MTP performance may vary depending on the acceptance rate of predicted tokens, which is dependent on the dataset or queries used while benchmarking. Additionally, `ignore_eos` should generally be omitted or set to `false` when using MTP to avoid speculating garbage outputs and getting unrealistic acceptance rates.

0 commit comments

Comments
 (0)