fix some features in llava-style training #3835

BUAADreamer · 2024-05-21T01:40:53Z

What does this PR do?

support pretraining style like LLaVA by only fine-tuning mm_projector (freeze LM)
add mllm_pt_demo data with only image-captioning single-turn data, see BUAADreamer/mllm_pt_demo
modify target_modules to a regex string to only tune the Lora params in LM and mm_proj in LLaVA-style MLLM.

pre-train: use sft. set tune_mm_proj=true and finetuning_type="full" and dataset="mllm_pt_demo"
Lora's usage is the same as previously.

hiyouga · 2024-05-27T12:23:37Z

LGTM

support pretraining of llava

29a6d5b

hiyouga added the pending This problem is yet to be addressed label May 21, 2024

BUAADreamer added 3 commits May 21, 2024 22:18

Merge branch 'hiyouga:main' into main

8d53ec2

Merge branch 'hiyouga:main' into main

047a06a

Merge branch 'hiyouga:main' into main

60170a1

BUAADreamer mentioned this pull request May 26, 2024

Add Multimodal LLM Finetuning #3450

Merged

2 tasks

BUAADreamer and others added 4 commits May 27, 2024 11:54

Merge branch 'hiyouga:main' into main

4bc7c10

add regex of only tune lm and mm_proj

57eb13b

add only tune lm and mm_proj

7ae9a47

Merge branch 'hiyouga:main' into main

ea2afd4

BUAADreamer changed the title ~~support pretraining of mllm like llava style~~ fix some features in llava-style training May 27, 2024

BUAADreamer and others added 3 commits May 27, 2024 20:10

remove mllm_pt_demo.json

f665342

Merge branch 'hiyouga:main' into main

e2022ce

Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory

576b020

hiyouga approved these changes May 27, 2024

View reviewed changes

hiyouga merged commit 838f2fb into hiyouga:main May 27, 2024
1 check passed

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels May 27, 2024