·
123 commits
to main
since this release
What's Changed
- 20250127 docs update by @arakowsk-amd in #392
- Faster Custom Paged Attention kernels by @sanyalington in #372
- Improved memory profiling by @gshtras in #394
- Aiter readme by @gshtras in #400
- fix None dict for quark by @hliuca in #402
- Upstream merge 25 02 03 by @gshtras in #403
- Mbatch p3l by @Alexei-V-Ivanov-AMD in #401
- Fix quark fp8 format loading. by @fxmarty-amd in #395
- WARP_SIZE in sgl moe kernel by @gshtras in #406
- Update README.md 20250205_aiter by @arakowsk-amd in #407
- fix rocm get_device name by @divakar-amd in #359
- Fixing the output formatting in P3L by @gshtras in #414
- Add tuned moe config for qwen1.5_moe_A2.7B by @sky0530 in #398
- Update Benchmark Profiling Scripts by @AdrianAbeyta in #417
- updating 20250207 image manifiest by @arakowsk-amd in #416
- Upstream merge 25 02 10 by @gshtras in #418
- Aiter base by @gshtras in #419
New Contributors
- @arakowsk-amd made their first contribution in #392
- @fxmarty-amd made their first contribution in #395
- @sky0530 made their first contribution in #398
Full Changelog: v0.7.0+rocm...v0.7.2+rocm