Support Several MLLM Models #4136

BUAADreamer · 2024-06-07T02:47:41Z

What does this PR do?

This PR is working!!

If you are interested, you can use my branch https://github.com/BUAADreamer/LLaMA-Factory for now.

Support models:

Video-LLaVA/LLaVA-Video-Next with videos and images as inputs
- sft with only video/image inputs (mllm_demo/video_demo)
- sft with video/image mix inputs (visual_mix_demo)
LLaVA-Next
Idefics-2

Features:

fine-tuning: sft/ppo/dpo/kto/orpo/simpo
inference: add video inference

Before submitting

Did you read the contributor guideline?

Davidchu11381 · 2024-06-10T19:49:49Z

Hi! By LlaVA-NEXT did you mean specifically this model? Similarly, does LlaVA-NEXT-Video refer to this model?

If not, would you be able to clarify which model you referred to? If yes, please let me know when they're good to use. Thank you!

BUAADreamer · 2024-06-11T04:09:51Z

All models are hf official models
for LLaVA, you could refer to https://huggingface.co/llava-hf
for idefics2, you could refer to https://huggingface.co/HuggingFaceM4

Davidchu11381 · 2024-06-11T06:58:45Z

Thanks for the clarification! Please let me know when you support the LlaVA-Next-Video model. Thank you!

BUAADreamer · 2024-06-13T01:52:06Z

This PR has not been merged into main, so every feature in this PR can not be tried yet using latest llama-factory.
You could wait until hiyouga releases the new version of llama-factory or this PR is closed!
@Davidchu11381 Please delete the above two comments so as not to mislead others!! Thanks!! 🤗

Davidchu11381 · 2024-06-13T04:34:13Z

I did! Thank you for your help! Could you also ping hiyouga for this issue? Sorry for bothering you. My lab (USC ISI) is working on something and we would really appreciate your help if we could use your tool to use LlaVA-Next. Please let me know when this is done :D Thanks again!

…uning problem of idefics2

JianbangZ · 2024-07-18T18:27:39Z

How about miniCPM V2.5?

BUAADreamer · 2024-07-21T02:24:18Z

How about miniCPM V2.5?

On the road!

1

Add special handling conditions to the llava-next-video model.

add visual model config for llava-next-video

BUAADreamer added 7 commits June 7, 2024 01:45

add videollava

9abd1b8

add videollava and demo video data

ddad20f

add videollava and demo video data

115ffbe

fix processor conflict

7cdc262

fix supervised conflict

0b7535e

support video-llava

4e97a83

support video-llava

adb3b26

BUAADreamer changed the title ~~Add Video Llava~~ Add Video-LLaVA Jun 7, 2024

hiyouga added the pending This problem is yet to be addressed label Jun 7, 2024

BUAADreamer and others added 6 commits June 8, 2024 09:34

Merge branch 'hiyouga:main' into main

ef76387

add av to requirements

76c6379

add llava-next/idefics2

3a53b3c

support video-llava/llava-next/idefics2(4.42)

3188a56

modify idefics2 template

daeffb4

Update requirements.txt

3fc87e8

hiyouga requested review from hiyouga and removed request for hiyouga June 8, 2024 13:26

BUAADreamer and others added 4 commits June 8, 2024 21:47

Merge branch 'hiyouga:main' into main

0f46edb

modify position of idefics2 in template

d2e4362

Merge branch 'main' of https://github.com/BUAADreamer/LLaMA-Factory

307e423

align preprocess_supervised_dataset implementation

7d2a8f3

BUAADreamer changed the title ~~Add Video-LLaVA~~ Support Several MLLM Models Jun 8, 2024

BUAADreamer mentioned this pull request Jun 10, 2024

Do you have plans to add fine-tuning scripts for other multimodal large models? For example, Qwen_VL, LLaVA1.6, MiniGPT4, etc. #4174

Closed

1 task

Merge branch 'hiyouga:main' into main

a0fe536

Merge branch 'hiyouga:main' into main

7689c9d

BUAADreamer linked an issue Jul 2, 2024 that may be closed by this pull request

Do you have plans to add fine-tuning scripts for other multimodal large models? For example, Qwen_VL, LLaVA1.6, MiniGPT4, etc. #4174

Closed

1 task

BUAADreamer and others added 2 commits July 2, 2024 16:07

add model constants

e65537d

Merge branch 'main' into main

5023974

Ben81828 mentioned this pull request Jul 10, 2024

ValueError: The model's config file has neither hidden_size nor hidden_sizes entry #4754

Closed

1 task

BUAADreamer mentioned this pull request Jul 13, 2024

LLama Factory现在是否支持LLava-Next系列 #4799

Closed

Merge branch 'hiyouga:main' into main

d5563d3

BUAADreamer had a problem deploying to tests July 15, 2024 08:23 — with GitHub Actions Failure

solve the predict problem of llava-next-video and the multi-gpu finet…

ca44c8d

…uning problem of idefics2

BUAADreamer had a problem deploying to tests July 15, 2024 09:27 — with GitHub Actions Failure

Merge branch 'hiyouga:main' into main

abdc2fa

BUAADreamer temporarily deployed to tests July 15, 2024 15:09 — with GitHub Actions Inactive

Repository owner deleted a comment from zjysteven Jul 21, 2024

Merge branch 'main' into main

7b5b32f

BUAADreamer temporarily deployed to tests July 22, 2024 01:24 — with GitHub Actions Inactive

Repository owner deleted a comment from zjysteven Jul 22, 2024

Merge branch 'main' into main

66980bf

BUAADreamer temporarily deployed to tests August 22, 2024 04:33 — with GitHub Actions Inactive

Kuangdd01 and others added 6 commits August 24, 2024 17:55

add if condition for llava-video

f033b3d

Merge pull request #1 from BUAADreamer/main

a96e29e

1

Merge branch 'main' of https://github.com/Kuangdd01/LLaMA-Factory-X

800793a

fix some errors

9eac318

remove redundant import

e116d34

Merge pull request #2 from Kuangdd01/main

7e59b76

Add special handling conditions to the llava-next-video model.

BUAADreamer temporarily deployed to tests August 25, 2024 15:08 — with GitHub Actions Inactive

Kuangdd01 and others added 2 commits August 28, 2024 17:46

add visual model config for llava-next-video

201593d

Merge pull request #3 from Kuangdd01/main

24526fe

add visual model config for llava-next-video

BUAADreamer temporarily deployed to tests August 28, 2024 10:57 — with GitHub Actions Inactive

BUAADreamer closed this Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Several MLLM Models #4136

Support Several MLLM Models #4136

BUAADreamer commented Jun 7, 2024 •

edited

Loading

Davidchu11381 commented Jun 10, 2024

BUAADreamer commented Jun 11, 2024 •

edited

Loading

Davidchu11381 commented Jun 11, 2024

BUAADreamer commented Jun 13, 2024 •

edited

Loading

Davidchu11381 commented Jun 13, 2024

JianbangZ commented Jul 18, 2024

BUAADreamer commented Jul 21, 2024

Support Several MLLM Models #4136

Support Several MLLM Models #4136

Conversation

BUAADreamer commented Jun 7, 2024 • edited Loading

What does this PR do?

Before submitting

Davidchu11381 commented Jun 10, 2024

BUAADreamer commented Jun 11, 2024 • edited Loading

Davidchu11381 commented Jun 11, 2024

BUAADreamer commented Jun 13, 2024 • edited Loading

Davidchu11381 commented Jun 13, 2024

JianbangZ commented Jul 18, 2024

BUAADreamer commented Jul 21, 2024

BUAADreamer commented Jun 7, 2024 •

edited

Loading

BUAADreamer commented Jun 11, 2024 •

edited

Loading

BUAADreamer commented Jun 13, 2024 •

edited

Loading