--additional_target not supported with unsloth? #3200

kno10 · 2024-04-09T15:37:12Z

Reminder

I have read the README and searched the existing issues.

Reproduction

It appears the additional_target parameter is not applied when unsloth is used.
The number of trainable parameters only includes the enabled lora layers, not the embedding layer.
For all I can tell, unsloth does support optimizing the embedding layers, too.

LLaMA-Factory/src/llmtuner/model/adapter.py

Lines 144 to 157 in 7f6c248

    
           if model_args.use_unsloth: 
        
               from unsloth import FastLanguageModel  # type: ignore 
        
               unsloth_peft_kwargs = {"model": model, "max_seq_length": model_args.model_max_length} 
        
               model = FastLanguageModel.get_peft_model(**peft_kwargs, **unsloth_peft_kwargs) 
        
           else: 
        
               lora_config = LoraConfig( 
        
                   task_type=TaskType.CAUSAL_LM, 
        
                   inference_mode=False, 
        
                   modules_to_save=finetuning_args.additional_target, 
        
                   use_dora=finetuning_args.use_dora, 
        
                   **peft_kwargs, 
        
               ) 
        
               model = get_peft_model(model, lora_config)

Maybe this is sufficient?

--- a/src/llmtuner/model/adapter.py
+++ b/src/llmtuner/model/adapter.py
@@ -145,6 +145,8 @@ def init_adapter(
                 from unsloth import FastLanguageModel  # type: ignore
 
                 unsloth_peft_kwargs = {"model": model, "max_seq_length": model_args.model_max_length}
+                if finetuning_args.additional_target:
+                    unsloth_peft_kwargs["modules_to_save"] = finetuning_args.additional_target
                 model = FastLanguageModel.get_peft_model(**peft_kwargs, **unsloth_peft_kwargs)
             else:
                 lora_config = LoraConfig(

Expected behavior

No response

System Info

No response

Others

No response

The text was updated successfully, but these errors were encountered:

Fixes hiyouga#3200

Pass additional_target to unsloth

@marko1616

* fix packages * Update wechat.jpg * Updated README with new information * Updated README with new information * Updated README with new information * Follow HF_ENDPOINT environment variable * fix hiyouga#2346 * fix hiyouga#2777 hiyouga#2895 * add orca_dpo_pairs dataset * support fsdp + qlora * update readme * update tool extractor * paper release * add citation * move file * Update README.md, fix the release date of the paper * Update README_zh.md, fix the release date of the paper * Update wechat.jpg * fix hiyouga#2941 * fix hiyouga#2928 * fix hiyouga#2936 * fix Llama lora merge crash * fix Llama lora merge crash * fix Llama lora merge crash * pass ruff check * tiny fix * Update requirements.txt * Update README_zh.md * release v0.6.0 * add arg check * Update README_zh.md * Update README.md * update readme * tiny fix * release v0.6.0 (real) * Update wechat.jpg * fix hiyouga#2961 * fix bug * fix hiyouga#2981 * fix ds optimizer * update trainers * fix hiyouga#3010 * update readme * fix hiyouga#2982 * add project * update readme * release v0.6.1 * Update wechat.jpg * fix pile datset hf hub url * upgrade gradio to 4.21.0 * support save args in webui hiyouga#2807 hiyouga#3046 some ideas are borrowed from @marko1616 * Fix Llama model save for full param train * fix blank line contains whitespace * tiny fix * support ORPO * support orpo in webui * update readme * use log1p in orpo loss huggingface/trl#1491 * fix plots * fix IPO and ORPO loss * fix ORPO loss * update webui * support infer 4bit model on GPUs hiyouga#3023 * fix hiyouga#3077 * add qwen1.5 moe * fix hiyouga#3083 * set dev version * Update SECURITY.md * fix hiyouga#3022 * add moe aux loss control hiyouga#3085 * simplify readme * update readme * update readme * update examples * update examples * add zh readme * update examples * update readme * update vllm example * Update wechat.jpg * fix hiyouga#3116 * fix resize vocab at inference hiyouga#3022 * fix requires for windows * fix bug in latest gradio * back to gradio 4.21 and fix chat * tiny fix * update examples * update readme * support Qwen1.5-32B * support Qwen1.5-32B * fix spell error * support hiyouga#3152 * rename template to breeze * rename template to breeze * add empty line * Update wechat.jpg * tiny fix * fix quant infer and qwen2moe * Pass additional_target to unsloth Fixes hiyouga#3200 * Update adapter.py * Update adapter.py * fix hiyouga#3225 --------- Co-authored-by: hiyouga <[email protected]> Co-authored-by: 刘一博 <[email protected]> Co-authored-by: khazic <[email protected]> Co-authored-by: SirlyDreamer <[email protected]> Co-authored-by: Sanjay Nadhavajhala <[email protected]> Co-authored-by: sanjay920 <[email protected]> Co-authored-by: 0xez <[email protected]> Co-authored-by: marko1616 <[email protected]> Co-authored-by: Remek Kinas <[email protected]> Co-authored-by: Tsumugii24 <[email protected]> Co-authored-by: li.yunhao <[email protected]> Co-authored-by: sliderSun <[email protected]> Co-authored-by: codingma <[email protected]> Co-authored-by: Erich Schubert <[email protected]>

kno10 added a commit to kno10/LLaMA-Factory that referenced this issue Apr 9, 2024

Pass additional_target to unsloth

b5eefe5

Fixes hiyouga#3200

kno10 mentioned this issue Apr 9, 2024

Pass additional_target to unsloth #3201

Merged

hiyouga closed this as completed in #3201 Apr 9, 2024

hiyouga added a commit that referenced this issue Apr 9, 2024

Merge pull request #3201 from kno10/patch-1 and fix #3200

caf8373

Pass additional_target to unsloth

hiyouga added the solved This problem has been already solved label Apr 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

--additional_target not supported with unsloth? #3200

--additional_target not supported with unsloth? #3200

kno10 commented Apr 9, 2024

--additional_target not supported with unsloth? #3200

--additional_target not supported with unsloth? #3200

Comments

kno10 commented Apr 9, 2024

Reminder

Reproduction

Expected behavior

System Info

Others