optimize cuda graph max_bs_settings on low-end gpus #2901
Annotations
2 errors
|
Benchmark single latency (TP=2)
The operation was canceled.
|
Loading