Skip to content

Commit

Permalink
[Misc] sort torch profiler table by kernel timing (vllm-project#11813)
Browse files Browse the repository at this point in the history
  • Loading branch information
divakar-amd authored and frreiss committed Jan 10, 2025
1 parent 550b91c commit 887282f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion benchmarks/benchmark_latency.py
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,7 @@ def run_to_completion(profile_dir: Optional[str] = None):
llm.generate(dummy_prompts,
sampling_params=sampling_params,
use_tqdm=False)
print(p.key_averages())
print(p.key_averages().table(sort_by="self_cuda_time_total"))
else:
start_time = time.perf_counter()
llm.generate(dummy_prompts,
Expand Down

0 comments on commit 887282f

Please sign in to comment.