[NeurIPS'24] Progressive Growing of Diffusion Autoencoder (PaGoDA)

This repository houses the official PyTorch implementation of the paper titled PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher on ImageNet ranging from 64x64 to 512x512, which is presented at NeurIPS 2024. Our code is heavily based on CTM.

Contacts:

Dongjun Kim: [email protected]
Chieh-Hsin (Jesse) Lai: [email protected]

TL;DR

We train one-step text-to-image generator that is progressively growing in its resolution. For that, we only need low-resolution diffusion models.

Checkpoints and Datasets

You may find PaGoDA's checkpoints on ImageNet. It contains:
- Stage 1's pretrained Diffusion Models at resolutions 32x32 and 64x64
- Stage 2's PaGoDA's generator at resolutions 32x32 and 64x64
- Stage 3's PaGoDA's generator (1) from resolution 64x64 → 128x128; (2) from resolution 64x64 → 256x256; (3) from resolution 64x64 → 512x512
You may find the preprocessed data-to-noise datasets here (released soon) for training.

Training

For Stage2 distillation, run bash commands/res64to64.sh
For Stage3 super-resolution, run from bash commands/64to128.sh to bash commands/64to512.sh sequentially.

Sampling

Please see commands/sampling.sh for detailed sampling commands.

Citations

@article{kim2024pagoda,
  title={PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher},
  author={Kim, Dongjun and Lai, Chieh-Hsin and Liao, Wei-Hsiang and Takida, Yuhta and Murata, Naoki and Uesaka, Toshimitsu and Mitsufuji, Yuki and Ermon, Stefano},
  journal={arXiv preprint arXiv:2405.14822},
  year={2024}}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
cm		cm
commands		commands
dnnlib		dnnlib
evaluations		evaluations
feature_networks		feature_networks
in_embeddings		in_embeddings
pg_modules		pg_modules
torch_utils		torch_utils
torch_utils_cm		torch_utils_cm
torch_utils_pg		torch_utils_pg
LICENSE		LICENSE
PaGoDA.jpg		PaGoDA.jpg
README.md		README.md
data_gather_reverse.py		data_gather_reverse.py
image_sample.py		image_sample.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[NeurIPS'24] Progressive Growing of Diffusion Autoencoder (PaGoDA)

TL;DR

Checkpoints and Datasets

Training

Sampling

Citations

About

Releases

Packages

Languages

License

sony/pagoda

Folders and files

Latest commit

History

Repository files navigation

[NeurIPS'24] Progressive Growing of Diffusion Autoencoder (PaGoDA)

TL;DR

Checkpoints and Datasets

Training

Sampling

Citations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages