Skip to content

Latest commit

 

History

History
68 lines (44 loc) · 3.03 KB

README.md

File metadata and controls

68 lines (44 loc) · 3.03 KB

Varformer: Adapting VAR’s Generative Prior for Image Restoration

This repository contains the official implementation of Varformer.

Abstract

Generative models trained on extensive high-quality datasets effectively capture the structural and statistical properties of clean images, rendering them powerful priors for transforming degraded features into clean ones in image restoration. VAR, a novel image generative paradigm, surpasses diffusion models in generation quality by applying a next-scale prediction approach. It progressively captures both global structures and fine-grained details through the autoregressive process, consistent with the multi-scale restoration principle widely acknowledged in the restoration community. Furthermore, we observe that during the image reconstruction process utilizing VAR, scale predictions automatically modulate the input, facilitating the alignment of representations at subsequent scales with the distribution of clean images. To harness VAR’s adaptive distribution alignment capability in image restoration tasks, we formulate the multi-scale latent representations within VAR as the restoration prior, thus advancing our delicately designed VarFormer framework. The strategic application of these priors enables our VarFormer to achieve remarkable generalization on unseen tasks while also reducing training computational costs.

Teaser

Data preparing

we refer to DiffUIR, the other datasets you could download from AdaIR and DF2K

  • The data structure is like this:
    dataset
    ├── GoPro
    │  ├── train
    │  ├── test
    ├── LOL
    │  ├── our485
    │  ├── eval15
       ---
    

Stage 1

Download VAR into experiments/pretrained_models/


python -m torch.distributed.launch --nproc_per_node=gpu_num --master_port=4344 basicsr/pretrain.py -opt basicsr/options/VarFormer_train_stage1.yml --launcher pytorch

Stage 2

You can train based on stage 1 or just start from scratch. Modify the configuration file options/VarFormer_train_stage2.yml accordingly.

python -m torch.distributed.launch --nproc_per_node=gpu_num --master_port=4344 basicsr/train.py -opt basicsr/options/VarFormer_train_stage2.yml --launcher pytorch

Inference

Modify the configuration file options/test.yml accordingly.


python basicsr/varformer_test.py -opt basicsr/options/test.yml

Citation

@article{wang2024varformer,
  title={Varformer: Adapting VAR's Generative Prior for Image Restoration},
  author={Wang, Siyang and Zhao, Feng},
  journal={arXiv preprint arXiv:2412.21063},
  year={2024}
}

Acknowledgements

This code is built upon VAR and CodeFormer, thanks for their excellent work!