Skip to content

[Model] [Quantization] Support deepseek_v3 w8a8 fp8 block-wise quantization#11523

Merged
simon-mo merged 9 commits intomainfrom deepseek_v3-fp8-supportDec 26, 2024

Commits

Commits on Dec 26, 2024