Marlin 24 prefill performance improvement (about 25% better on average)#4983
Merged
mgoin merged 9 commits intovllm-project:mainfrom neuralmagic:marlin_24_improve_prefillMay 23, 2024
+107-32
Commits
Commits on May 22, 2024
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed