Skip to content

v0.8.2: PiSSA, Parallel Functions

Compare
Choose a tag to compare
@hiyouga hiyouga released this 19 Jun 13:06
· 638 commits to main since this release

New features

New models

  • Base models
    • DeepSeek-Coder-V2 (16B MoE/236B MoE) 📄
  • Instruct/Chat models
    • MiniCPM-2B 📄🤖
    • DeepSeek-Coder-V2-Instruct (16B MoE/236B MoE) 📄🤖

New datasets

Bug fix