Add Latte: Latent Diffusion Transformer for Video Generation #7223

kabachuha · 2024-03-05T16:03:52Z

Model/Pipeline/Scheduler description

Latte is a text2video diffusion transformer (similar to Sora), improving past the DiT and PixArt-alpha text2image models

The implementation is already based on diffusers (see latte_t2v.py), so adding it here should be a straightforward task

Open source status

The model implementation is available.
The model weights are available (Only relevant if addition is not a scheduler).

Provide useful links for the implementation

The official repo https://github.com/Vchitect/Latte
Model on Huggingface: https://huggingface.co/maxin-cn/Latte
Paper: https://arxiv.org/abs/2401.03048v1
Project page: https://maxin-cn.github.io/latte_project/

sayakpaul · 2024-03-06T04:50:49Z

Thanks for bringing this to our notice. But as far as I understand it from here, the current model suffers from the issue of producing watermarked videos. Maybe let's wait till they release the unwatermarked version? Cc: @DN6

github-actions · 2024-04-05T15:02:44Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

maxin-cn · 2024-06-03T07:13:17Z

@sayakpaul Hi, I am the first author of Latte and we have updated the unwatermarked version of the LatteT2V model. We want to integrate Latte into diffusers library, what should I do? The pre-trained LatteT2V models are here and the codes are here.

sayakpaul · 2024-06-03T07:38:53Z

Ccing @DN6 into this thread for further comments. I am happy to have the model integrated :)

a-r-r-o-w · 2024-07-19T09:44:19Z

Thanks for integrating Latte and your awesome work maxin!

sayakpaul added the contributions-welcome label Mar 6, 2024

sayakpaul removed the contributions-welcome label Mar 6, 2024

github-actions bot added the stale Issues that haven't received updates label Apr 5, 2024

Quest4AiJ mentioned this issue Jun 3, 2024

Any plan to implement Latte in HuggingFace's diffusers library? Vchitect/Latte#83

Closed

maxin-cn mentioned this issue Jun 5, 2024

Latte: Latent Diffusion Transformer for Video Generation #8404

Merged

5 tasks

a-r-r-o-w closed this as completed Jul 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Latte: Latent Diffusion Transformer for Video Generation #7223

Add Latte: Latent Diffusion Transformer for Video Generation #7223

kabachuha commented Mar 5, 2024

sayakpaul commented Mar 6, 2024 •

edited

Loading

github-actions bot commented Apr 5, 2024

maxin-cn commented Jun 3, 2024

sayakpaul commented Jun 3, 2024

a-r-r-o-w commented Jul 19, 2024

Add Latte: Latent Diffusion Transformer for Video Generation #7223

Add Latte: Latent Diffusion Transformer for Video Generation #7223

Comments

kabachuha commented Mar 5, 2024

Model/Pipeline/Scheduler description

Open source status

Provide useful links for the implementation

sayakpaul commented Mar 6, 2024 • edited Loading

github-actions bot commented Apr 5, 2024

maxin-cn commented Jun 3, 2024

sayakpaul commented Jun 3, 2024

a-r-r-o-w commented Jul 19, 2024

sayakpaul commented Mar 6, 2024 •

edited

Loading