fix cogvideox autoencoder decode #9569

Xiang-cd · 2024-10-02T06:56:08Z

What does this PR do?

fix bug of #9568

Fixes # (issue)
it is a simple fix with a line code change.
cogvideox train with mixsure of image and video, so the vae must support reconstruction of image, I fix the image case.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

any one who familiar with vae.
@sayakpaul @DN6 @yiyixuxu

Core library:

Schedulers: @yiyixuxu
Pipelines and pipeline callbacks: @yiyixuxu and @asomoza
Training examples: @sayakpaul
Docs: @stevhliu and @sayakpaul
JAX and MPS: @pcuenca
Audio: @sanchit-gandhi
General functionalities: @sayakpaul @yiyixuxu @DN6

Integrations:

deepspeed: HF Trainer/Accelerate: @SunMarc
PEFT: @sayakpaul @BenjaminBossan

HF projects:

accelerate: different repo
datasets: different repo
transformers: different repo
safetensors: different repo

-->

a-r-r-o-w

thank you, this looks correct!

a-r-r-o-w · 2024-10-02T08:00:20Z

The commit was to resolve the merge conflict with the following conv_cache line

HuggingFaceDocBuilderDev · 2024-10-02T08:05:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2024-10-02T09:34:13Z

@a-r-r-o-w could we have caught it with some tests? Or not really? In the former case, let's also add a test?

a-r-r-o-w · 2024-10-02T09:38:23Z

could we have caught it with some tests? Or not really? In the former case, let's also add a test?

Previously, we did not need to decode 1-frame video so this was not required, so testing for this case was not necessary. For Image-to-Video training, however, we need to be able to encode single frames (which is already supported). I think having decode work for a single frame is good to have too, since it could help with debugging.

sayakpaul · 2024-10-02T09:42:16Z

Cool. If you want to feel free to support a test.

Xiang-cd · 2024-10-02T09:49:56Z

thank you, the support of image decode is useful for image-video mix tasks and further research, and this was my first pr to huggingface, hope it could merge. :)

yiyixuxu · 2024-10-02T19:08:24Z

thanks for your PR!! @Xiang-cd

Co-authored-by: Aryan <[email protected]>

fix cogvideox autoencoder decode

8ca9105

Xiang-cd mentioned this pull request Oct 2, 2024

cogvideox autoencoder could not reconstruct an image #9568

Closed

a-r-r-o-w approved these changes Oct 2, 2024

View reviewed changes

a-r-r-o-w requested a review from yiyixuxu October 2, 2024 07:58

Merge branch 'main' into cogvideox-vae-fix

ae14a96

yiyixuxu merged commit 7f323f0 into huggingface:main Oct 2, 2024
15 checks passed

leisuzz pushed a commit to leisuzz/diffusers that referenced this pull request Oct 11, 2024

fix cogvideox autoencoder decode (huggingface#9569)

51adece

Co-authored-by: Aryan <[email protected]>

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

fix cogvideox autoencoder decode (#9569)

bdeff1e

Co-authored-by: Aryan <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix cogvideox autoencoder decode #9569

fix cogvideox autoencoder decode #9569

Xiang-cd commented Oct 2, 2024

a-r-r-o-w left a comment

a-r-r-o-w commented Oct 2, 2024

HuggingFaceDocBuilderDev commented Oct 2, 2024

sayakpaul commented Oct 2, 2024

a-r-r-o-w commented Oct 2, 2024

sayakpaul commented Oct 2, 2024

Xiang-cd commented Oct 2, 2024

yiyixuxu commented Oct 2, 2024

fix cogvideox autoencoder decode #9569

fix cogvideox autoencoder decode #9569

Conversation

Xiang-cd commented Oct 2, 2024

What does this PR do?

Before submitting

Who can review?

a-r-r-o-w left a comment

Choose a reason for hiding this comment

a-r-r-o-w commented Oct 2, 2024

HuggingFaceDocBuilderDev commented Oct 2, 2024

sayakpaul commented Oct 2, 2024

a-r-r-o-w commented Oct 2, 2024

sayakpaul commented Oct 2, 2024

Xiang-cd commented Oct 2, 2024

yiyixuxu commented Oct 2, 2024