Add Multimodal LLM Finetuning #3450

BUAADreamer · 2024-04-25T13:18:04Z

What does this PR do?

Add finetuning Multimodal-LLM especially for LLaVA by leveraging AutoModelForVision2Seq and AutoProcessortransformers

This PR is working in progress, needs improvement in the future, e.g. other MLLM.

For more usage, you can refer to MLLM-Finetuning-Demo

Support Models

LLaVA-1.5

Make your own Instruct Dataset

Just organize the content like the data/mllm_demo.json.

Finetuning

See examples at examples/lora_single_gpu/llava1_5_lora_sft.yaml

Before submitting

Did you read the contributor guideline?

hiyouga · 2024-04-25T21:30:25Z

LGTM! Thanks for your contribs!

whyiug · 2024-05-26T08:53:27Z

Hi @BUAADreamer, thanks for your work. Can you explain how this HF version differs from the original(i mean https://github.com/haotian-liu/LLaVA) during training, and does the HF version trains the mm_projector layers?
thanks a lot.

BUAADreamer · 2024-05-26T09:37:14Z

This HFversion is nearly the same as the origin, it is transferred by the official researchers of Huggingface and Haotian Liu.
Our current sft of MLLM is the same as the ft stage in LLaVA paper, only fine-tune the mm_proj and the LM.
You could refer to this Zhihu blog to learn more about fine-tuning MLLM
And you could refer to this fine-tuned paligemma by @hiyouga for a successful example

BUAADreamer · 2024-05-26T09:38:47Z

Besides, if you want to pre-train like LLaVA, you can refer to #3835 to only fine-tune the mm_proj

whyiug · 2024-05-26T12:00:11Z

Thx, those really help me a lot.

BUAADreamer and others added 29 commits April 23, 2024 18:45

add multimodal LLM BLIP-2 and InstructBLIP

4dcb11e

Merge branch 'hiyouga:main' into main

cde4dfe

add multimodal LLM BLIP-2 and InstructBLIP

4f3d558

add multimodal LLM BLIP-2 and InstructBLIP

e1afbea

add llava and instructblip

cfb485e

remove conflicts

7ffee90

remove conflicts

f85f403

remove error

8239907

merge model part to the text stream

838eb87

Merge branch 'hiyouga:main' into main

7e22ff5

merge data part to the text stream

c6dd899

Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory

2ee3046

merge data part to the text stream

42c90c8

Merge branch 'hiyouga:main' into main

68cdd9a

rm some

eefcd10

add some

94ad744

Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory

43d7ad5

modify style

1dcabaf

modify style

fc0fa9f

modify style

235b411

make dataset script

2cab2d4

modify some style

2d4ded5

modify some style

c27f7fb

merge some func

31420f7

modify some style

d29f379

modify some style

ece78a6

modify some bug

a7ead14

delete llava template (use vicuna)

646a788

update hparam name

860549b

hiyouga self-requested a review April 25, 2024 18:50

hiyouga added 9 commits April 26, 2024 02:57

Update and rename llava_instruct_example.json to mllm_demo.json

7dcae3d

Update mllm_demo.json

5ef2933

Update dataset_info.json

f8c26e6

Update loader.py

7d812ed

Update workflow.py

e16f128

Update loader.py

3408af2

Update parser.py

f62cadb

Update aligner.py

fcd0911

Update preprocess.py

7f3bd35

hiyouga approved these changes Apr 25, 2024

View reviewed changes

hiyouga merged commit c20f750 into hiyouga:mllm Apr 25, 2024

hiyouga added the solved This problem has been already solved label Apr 25, 2024

BUAADreamer deleted the mllm branch May 23, 2024 05:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Multimodal LLM Finetuning #3450

Add Multimodal LLM Finetuning #3450

BUAADreamer commented Apr 25, 2024 •

edited

Loading

hiyouga commented Apr 25, 2024

whyiug commented May 26, 2024

BUAADreamer commented May 26, 2024 •

edited

Loading

BUAADreamer commented May 26, 2024 •

edited

Loading

whyiug commented May 26, 2024

Add Multimodal LLM Finetuning #3450

Add Multimodal LLM Finetuning #3450

Conversation

BUAADreamer commented Apr 25, 2024 • edited Loading

What does this PR do?

Support Models

Make your own Instruct Dataset

Finetuning

Before submitting

hiyouga commented Apr 25, 2024

whyiug commented May 26, 2024

BUAADreamer commented May 26, 2024 • edited Loading

BUAADreamer commented May 26, 2024 • edited Loading

whyiug commented May 26, 2024

BUAADreamer commented Apr 25, 2024 •

edited

Loading

BUAADreamer commented May 26, 2024 •

edited

Loading

BUAADreamer commented May 26, 2024 •

edited

Loading