v0.6.2: ORPO and Qwen1.5-32B

hiyouga released this 11 Apr 12:27

· 1243 commits to main since this release

New features

Support ORPO algorithm by @hiyouga in #3066
Support inferring BNB 4-bit models on multiple GPUs via the quantization_device_map argument
Reorganize README files, move example scripts to the examples folder
Support saving & loading arguments quickly in LlamaBoard by @hiyouga and @marko1616 in #3046
Support load alpaca-format dataset from the hub without dataset_info.json by specifying --dataset_dir ONLINE
Add a parameter moe_aux_loss_coef to control the coefficient of auxiliary loss in MoE models.

New models

Base models
- Breeze-7B-Base
- Qwen1.5-MoE-A2.7B (14B)
- Qwen1.5-32B
Instruct/Chat models
- Breeze-7B-Instruct
- Qwen1.5-MoE-A2.7B-Chat (14B)
- Qwen1.5-32B-Chat

Bug fix

Fix pile dataset download config by @lealaxy in #3053
Fix model generation config by @marko1616 in #3057
Fix qwen1.5 models DPO training by @changingivan and @hiyouga in #3083
Support Qwen1.5-32B by @sliderSun in #3160
Support Breeze-7B by @codemayq in #3161
Fix addtional_target in unsloth by @kno10 in #3201
Fix #2807 #3022 #3023 #3046 #3077 #3085 #3116 #3200 #3225

Contributors

kno10, changingivan, and 5 other contributors

Assets 2

0 Join discussion