-
Notifications
You must be signed in to change notification settings - Fork 27.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix: Qwen2-VL training on video datasets #33307
Conversation
Oh yeah, and this also prevents compile. Can we use |
@zucchini-nlp We have updated the implementation, how about the new one? |
@hiyouga sorry, what do you mean by "the new one"? |
@zucchini-nlp sorry, i say the latest commit in this PR 96286c3 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, looks good to me!
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
NIce! Would want to add a test to make sure this works, but good to merge as well
@ArthurZucker Hi, I think this slow test case already covers this patch, it may require non-trivial effort to implement a test case for training I think. transformers/tests/models/qwen2_vl/test_modeling_qwen2_vl.py Lines 324 to 359 in e48e5f1
|
Imo the fullgraph compile tests should capture the test, but those are part of GenerationTester and Qwen2VL yet doesn't have it. For that we need generation tests with image + text + (optional video) inputs, will add those soon. Tracker here #33374 |
Okay all thanks ! Let's merge 🤗 We'll improve the ecosytem for video and related tests on the fly! 🚀 |
* fix video finetuning * Update modeling_qwen2_vl.py * Update modeling_qwen2_vl.py * fix
* fix video finetuning * Update modeling_qwen2_vl.py * Update modeling_qwen2_vl.py * fix
What does this PR do?
We should clone the leaf tensor before doing the in-place operation, otherwise it raises exception in training.
Similar to
transformers/src/transformers/models/qwen2_vl/modeling_qwen2_vl.py
Lines 1578 to 1584 in 9230d78
Before submitting
Pull Request section?
to it if that's the case.
documentation guidelines, and
here are tips on formatting docstrings.
Who can review?
@ArthurZucker @zucchini-nlp @simonJJJ