Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Several MLLM Models #4136

Closed
wants to merge 46 commits into from
Closed

Conversation

BUAADreamer
Copy link
Collaborator

@BUAADreamer BUAADreamer commented Jun 7, 2024

What does this PR do?

This PR is working!!

If you are interested, you can use my branch https://github.com/BUAADreamer/LLaMA-Factory for now.

Support models:

  • Video-LLaVA/LLaVA-Video-Next with videos and images as inputs
    • sft with only video/image inputs (mllm_demo/video_demo)
    • sft with video/image mix inputs (visual_mix_demo)
  • LLaVA-Next
  • Idefics-2

Features:

  • fine-tuning: sft/ppo/dpo/kto/orpo/simpo
  • inference: add video inference

Before submitting

@BUAADreamer BUAADreamer changed the title Add Video Llava Add Video-LLaVA Jun 7, 2024
@hiyouga hiyouga added the pending This problem is yet to be addressed label Jun 7, 2024
@hiyouga hiyouga requested review from hiyouga and removed request for hiyouga June 8, 2024 13:26
@Davidchu11381
Copy link

Hi! By LlaVA-NEXT did you mean specifically this model? Similarly, does LlaVA-NEXT-Video refer to this model?

If not, would you be able to clarify which model you referred to? If yes, please let me know when they're good to use. Thank you!

@BUAADreamer
Copy link
Collaborator Author

BUAADreamer commented Jun 11, 2024

All models are hf official models
for LLaVA, you could refer to https://huggingface.co/llava-hf
for idefics2, you could refer to https://huggingface.co/HuggingFaceM4

@Davidchu11381
Copy link

Thanks for the clarification! Please let me know when you support the LlaVA-Next-Video model. Thank you!

@BUAADreamer
Copy link
Collaborator Author

BUAADreamer commented Jun 13, 2024

This PR has not been merged into main, so every feature in this PR can not be tried yet using latest llama-factory.
You could wait until hiyouga releases the new version of llama-factory or this PR is closed!
@Davidchu11381 Please delete the above two comments so as not to mislead others!! Thanks!! 🤗

@Davidchu11381
Copy link

I did! Thank you for your help! Could you also ping hiyouga for this issue? Sorry for bothering you. My lab (USC ISI) is working on something and we would really appreciate your help if we could use your tool to use LlaVA-Next. Please let me know when this is done :D Thanks again!

@JianbangZ
Copy link

How about miniCPM V2.5?

Repository owner deleted a comment from zjysteven Jul 21, 2024
@BUAADreamer
Copy link
Collaborator Author

How about miniCPM V2.5?

On the road!

Repository owner deleted a comment from zjysteven Jul 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
pending This problem is yet to be addressed
Projects
None yet
5 participants