Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Latte: Latent Diffusion Transformer for Video Generation #7223

Closed
2 tasks done
kabachuha opened this issue Mar 5, 2024 · 5 comments
Closed
2 tasks done

Add Latte: Latent Diffusion Transformer for Video Generation #7223

kabachuha opened this issue Mar 5, 2024 · 5 comments
Labels
stale Issues that haven't received updates

Comments

@kabachuha
Copy link
Contributor

Model/Pipeline/Scheduler description

Latte is a text2video diffusion transformer (similar to Sora), improving past the DiT and PixArt-alpha text2image models

The implementation is already based on diffusers (see latte_t2v.py), so adding it here should be a straightforward task

Open source status

  • The model implementation is available.
  • The model weights are available (Only relevant if addition is not a scheduler).

Provide useful links for the implementation

The official repo https://github.com/Vchitect/Latte
Model on Huggingface: https://huggingface.co/maxin-cn/Latte
Paper: https://arxiv.org/abs/2401.03048v1
Project page: https://maxin-cn.github.io/latte_project/

@sayakpaul
Copy link
Member

sayakpaul commented Mar 6, 2024

Thanks for bringing this to our notice. But as far as I understand it from here, the current model suffers from the issue of producing watermarked videos. Maybe let's wait till they release the unwatermarked version? Cc: @DN6

Copy link

github-actions bot commented Apr 5, 2024

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

@maxin-cn
Copy link
Contributor

maxin-cn commented Jun 3, 2024

@sayakpaul Hi, I am the first author of Latte and we have updated the unwatermarked version of the LatteT2V model. We want to integrate Latte into diffusers library, what should I do? The pre-trained LatteT2V models are here and the codes are here.

@sayakpaul
Copy link
Member

Ccing @DN6 into this thread for further comments. I am happy to have the model integrated :)

@a-r-r-o-w
Copy link
Member

Thanks for integrating Latte and your awesome work maxin!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
stale Issues that haven't received updates
Projects
None yet
Development

No branches or pull requests

4 participants