Skip to content

Commit

Permalink
[Misc] Update benchmark_prefix_caching.py fixed example usage (vllm-p…
Browse files Browse the repository at this point in the history
…roject#11920)

Signed-off-by: Ren MinMin <[email protected]>
Co-authored-by: Ren MinMin <[email protected]>
Signed-off-by: Harry Mellor <[email protected]>
  • Loading branch information
2 people authored and hmellor committed Jan 12, 2025
1 parent 5227c68 commit e5d2f35
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion benchmarks/benchmark_prefix_caching.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,8 @@
--model meta-llama/Llama-2-7b-chat-hf \
--enable-prefix-caching \
--num-prompts 1 \
--repeat-count 100
--repeat-count 100 \
--input-length-range 128:256
ShareGPT example usage:
# This command samples 20 prompts with input lengths
Expand Down

0 comments on commit e5d2f35

Please sign in to comment.