Scheduled Pseudo-Huber Loss for Diffusion Models

This GitHub repo contains the code for the paper "Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss" https://arxiv.org/abs/2403.16728.

(NOTE: only text2image experiments code is present in this repo)

The content is composed of two parts:

Ready-to-use Diffusers scripts in the folder diffusers_scripts. (They are going be pushed to the Diffusers library itself soon, and it will be more handy to use it instead. Track huggingface/diffusers#7488)
Code for mass training sweeps, statistics collection and analysis, if you'd like to replicate the results.

Instruction:

For end-user usage (most likely, you need this):

Proceed to diffusers_scripts and then launch the desired training script with the same instructions as in http://github.com/huggingface/diffusers/examples/. Don't forget to specify loss_type in the training arguments!

For replication:

Install requirements

pip install -r requirements.txt

Make a concepts folder and put any amounts of subfolders containing images of same concepts (clean datasets). (and you can also include a random pictures folder, then exclude it from the dataset once it's formed).

Run dataset_composer.py (see it's argparse args), by default it will make a datasets folder with the results.

Run script.sh

!!! If you would like to receive messages of each job completion to your Telegram, remember to login into `telegram-send`` before the start! Depending on your GPU it can take hours or days!

Once it's completed, you will see a folder named stats (by default).

Then you can use analyzer.ipynb to parse the stats, analyze them and make the plots.

PM me on discord or email if you have any questions.

Citation

@misc{khrapov2024improving,
      title={Improving Diffusion Models's Data-Corruption Resistance using Scheduled Pseudo-Huber Loss}, 
      author={Artem Khrapov and Vadim Popov and Tasnima Sadekova and Assel Yermekova and Mikhail Kudinov},
      year={2024},
      eprint={2403.16728},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
diffusers_scripts		diffusers_scripts
.gitignore		.gitignore
README.md		README.md
analyzer.ipynb		analyzer.ipynb
bigprint.py		bigprint.py
clipsim.py		clipsim.py
datasets_composer.py		datasets_composer.py
generate_test_pics.py		generate_test_pics.py
latents_condition.py		latents_condition.py
requirements.txt		requirements.txt
script.sh		script.sh
train.sh		train.sh
train_dreambooth_lora.py		train_dreambooth_lora.py
train_test_runner.py		train_test_runner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scheduled Pseudo-Huber Loss for Diffusion Models

Instruction:

For end-user usage (most likely, you need this):

For replication:

Citation

About

Releases

Packages

kabachuha/SPHL-for-stable-diffusion

Folders and files

Latest commit

History

Repository files navigation

Scheduled Pseudo-Huber Loss for Diffusion Models

Instruction:

For end-user usage (most likely, you need this):

For replication:

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages