Skip to content

Optimize Triton decoding kernel for long context#2394

Merged
merrymercy merged 7 commits intosgl-project:mainfrom ispobock:flash-decodingDec 8, 2024

Commits

Commits on Dec 7, 2024

Commits on Dec 8, 2024