train params save feature #3046

marko1616 · 2024-03-29T11:24:23Z

这个Pr做了什么

添加了训练参数保存功能 #2807

第一次commit仅用于核心代码的code review

Before submitting

Did you read the contributor guideline?

marko1616 · 2024-03-29T20:31:08Z

控件列表获取改为input_elems对base_elems求区别(应该不会有太大问题了)
然后控件名称获取改为使用id在manager里查找
[TODO]训练参数预设

marko1616 · 2024-03-29T20:37:03Z

[TODO]一个看上去更好的交互UI

@marko1616

some ideas are borrowed from @marko1616

@marko1616

* fix packages * Update wechat.jpg * Updated README with new information * Updated README with new information * Updated README with new information * Follow HF_ENDPOINT environment variable * fix hiyouga#2346 * fix hiyouga#2777 hiyouga#2895 * add orca_dpo_pairs dataset * support fsdp + qlora * update readme * update tool extractor * paper release * add citation * move file * Update README.md, fix the release date of the paper * Update README_zh.md, fix the release date of the paper * Update wechat.jpg * fix hiyouga#2941 * fix hiyouga#2928 * fix hiyouga#2936 * fix Llama lora merge crash * fix Llama lora merge crash * fix Llama lora merge crash * pass ruff check * tiny fix * Update requirements.txt * Update README_zh.md * release v0.6.0 * add arg check * Update README_zh.md * Update README.md * update readme * tiny fix * release v0.6.0 (real) * Update wechat.jpg * fix hiyouga#2961 * fix bug * fix hiyouga#2981 * fix ds optimizer * update trainers * fix hiyouga#3010 * update readme * fix hiyouga#2982 * add project * update readme * release v0.6.1 * Update wechat.jpg * fix pile datset hf hub url * upgrade gradio to 4.21.0 * support save args in webui hiyouga#2807 hiyouga#3046 some ideas are borrowed from @marko1616 * Fix Llama model save for full param train * fix blank line contains whitespace * tiny fix * support ORPO * support orpo in webui * update readme * use log1p in orpo loss huggingface/trl#1491 * fix plots * fix IPO and ORPO loss * fix ORPO loss * update webui * support infer 4bit model on GPUs hiyouga#3023 * fix hiyouga#3077 * add qwen1.5 moe * fix hiyouga#3083 * set dev version * Update SECURITY.md * fix hiyouga#3022 * add moe aux loss control hiyouga#3085 * simplify readme * update readme * update readme * update examples * update examples * add zh readme * update examples * update readme * update vllm example * Update wechat.jpg * fix hiyouga#3116 * fix resize vocab at inference hiyouga#3022 * fix requires for windows * fix bug in latest gradio * back to gradio 4.21 and fix chat * tiny fix * update examples * update readme * support Qwen1.5-32B * support Qwen1.5-32B * fix spell error * support hiyouga#3152 * rename template to breeze * rename template to breeze * add empty line * Update wechat.jpg * tiny fix * fix quant infer and qwen2moe * Pass additional_target to unsloth Fixes hiyouga#3200 * Update adapter.py * Update adapter.py * fix hiyouga#3225 --------- Co-authored-by: hiyouga <[email protected]> Co-authored-by: 刘一博 <[email protected]> Co-authored-by: khazic <[email protected]> Co-authored-by: SirlyDreamer <[email protected]> Co-authored-by: Sanjay Nadhavajhala <[email protected]> Co-authored-by: sanjay920 <[email protected]> Co-authored-by: 0xez <[email protected]> Co-authored-by: marko1616 <[email protected]> Co-authored-by: Remek Kinas <[email protected]> Co-authored-by: Tsumugii24 <[email protected]> Co-authored-by: li.yunhao <[email protected]> Co-authored-by: sliderSun <[email protected]> Co-authored-by: codingma <[email protected]> Co-authored-by: Erich Schubert <[email protected]>

marko1616 added 3 commits March 29, 2024 19:21

train params save feature's core code(only for code review)

3006df3

Make a better manager

8e97e68

Add locals

b58e5ad

marko1616 changed the title ~~train params save feature's core code~~ train params save feature Mar 29, 2024

Add path input

bf3c269

marko1616 and others added 2 commits March 30, 2024 22:56

resolved conflict

5629285

Merge branch 'hiyouga:main' into feature/train_params_save

bcede7f

hiyouga added a commit that referenced this pull request Mar 30, 2024

support save args in webui #2807 #3046

7a086ed

some ideas are borrowed from @marko1616

marko1616 closed this Mar 30, 2024

hiyouga added the solved This problem has been already solved label Mar 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train params save feature #3046

train params save feature #3046

marko1616 commented Mar 29, 2024

marko1616 commented Mar 29, 2024

marko1616 commented Mar 29, 2024

train params save feature #3046

train params save feature #3046

Conversation

marko1616 commented Mar 29, 2024

这个Pr做了什么

第一次commit仅用于核心代码的code review

Before submitting

marko1616 commented Mar 29, 2024

marko1616 commented Mar 29, 2024