Skip to content

Marlin 24 prefill performance improvement (about 25% better on average)#4983

Merged
mgoin merged 9 commits intovllm-project:mainfrom neuralmagic:marlin_24_improve_prefillMay 23, 2024

Commits

Commits on May 22, 2024