-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Latte: Latent Diffusion Transformer for Video Generation #7223
Comments
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
@sayakpaul Hi, I am the first author of Latte and we have updated the unwatermarked version of the LatteT2V model. We want to integrate Latte into |
Ccing @DN6 into this thread for further comments. I am happy to have the model integrated :) |
Thanks for integrating Latte and your awesome work maxin! |
Model/Pipeline/Scheduler description
Latte is a text2video diffusion transformer (similar to Sora), improving past the DiT and PixArt-alpha text2image models
The implementation is already based on diffusers (see latte_t2v.py), so adding it here should be a straightforward task
Open source status
Provide useful links for the implementation
The official repo https://github.com/Vchitect/Latte
Model on Huggingface: https://huggingface.co/maxin-cn/Latte
Paper: https://arxiv.org/abs/2401.03048v1
Project page: https://maxin-cn.github.io/latte_project/
The text was updated successfully, but these errors were encountered: