Deep Learning Project's Template

🚀 Introduction

Welcome to our Deep Learning Project Template, crafted for researchers and developers working with PyTorch. This template is designed to streamline the setup, execution, and modification of deep learning experiments, allowing you to focus more on model development and less on boilerplate code.

✨ Features

Multi-GPU Support: Utilize the power of multiple GPUs or devices to accelerate your training using accelerate.
Flexible Configuration: Easily configure your experiments with the tyro configuration system, enabling easy to use and type validation.
Clear Architecture: Our template is structured for clarity and ease of use, ensuring you can understand and modify the code with minimal effort.
Transparent Training Process: Enjoy a clear display of the training process, helping you monitor performance and make necessary tweaks in real-time.
Using uv for better and faster package management: We adopt uv for better package management which is written in Rust.

📂 Folder Structure

Our project is organized as follows to help you navigate and manage the codebase effectively:

📦deep-learning-template
 ├── 📂configs                # Configuration files for experiments
 │   ├── 📄config_utils.py    # Utils for showning or saving configs
 │   └── 📄config.py          # Main configuraiton script
 ├── 📂configuration          # Configuration files for experiments
 │   ├── 📂cifar
 │   │   ├── cifar_big.json   # Configuration for a larger model (example)
 │   │   └── cifar_small.json # Configuration for a smaller model (example)
 ├── 📂dataset                # Modules for data handling
 │   └── 📄data_loader.py     # Data loader script
 ├── 📂modeling               # Neural network models and loss functions
 │   └── 📄model.py           # Example model file
 ├── 📂utils                  # Utility scripts for various tasks
 │   ├── 📄logger.py          # Logging utilities
 │   └── 📄metrics.py         # Performance metrics
 ├── 📂engine                 # Utility scripts for various tasks
 │   ├── 📄base_engine.py     # Base engine class for repeat tasks
 │   └── 📄engine.py          # Training functions here
 ├── 📄.gitignore             # Specifies intentionally untracked files to ignore
 ├── 📄LICENSE                # License file for the project
 ├── 📄README.md              # README file with project details
 ├── 📄linter.sh              # Shell script for formating the code
 ├── 📄requirements.txt       # Dependencies and libraries
 └── 📄main.py                # Starting point for training

⚙️ Configuration (requires update)

Configure your models and training setups with ease. Modify the config.py file to suit your experimental needs. Our system uses YACS, which allows for a hierarchical configuration with overrides for command-line options. The recommeneded structure we used:

# Basic setup of the project
cfg = CN()
cfg._BASE_ = None
cfg.PROJECT_DIR = None
cfg.PROJECT_LOG_WITH = ["tensorboard"]

# Control the modeling settings
cfg.MODEL = CN()
# ...

# Control the loss settings
cfg.LOSS = CN()
# ...

# Control the dataset settings (e.g., path)
cfg.DATA = CN()
# ...

# Control the training setup (e.g., lr, epoch)
cfg.TRAIN = CN()
# ...

# Control the training setup (e.g., batch size)
cfg.EVAL = CN()
# ...

🏋️‍♂️ Training (requires update)

Basic Usage

To start a training, run:

python engine.py --config configs/your_config.yaml

# Concrete example
python traing.py --config configs/cifar/cifar-small.yaml

After the training start, users can find the training folder called logs. To modify the default setting, please change the option log_dir. Followed by logs is the project_dir defined in the config file.

📦{LOG_DIR}/{PROJECT_DIR}
 ├── 📂checkpoint           # Folder for saving checkpoints 
 └── 📂...                  # Other files setup by tracker(s)

Override the config with command line

Users can override the options with the --opts flag. For instance, to resume the training:

python engine.py --config configs/your_config.yaml --opts TRAIN.RESUME_CHECKPOINT path/to/checkpoint

# Concrete example
python engine.py --config configs/cifar/cifar-small.yaml --opts TRAIN.RESUME_CHECKPOINT logs/cifar-small/checkpoint/best_model_epoch_10.pth

Please check the config setup section for more details.

Multi-GPU Training

This project template is made based on accelerate to provide multi-GPU training. A simple example to train a model with 2 GPUs:

accelerate launch --multi_gpu --num_processes=2 engine.py --config configs/your_config.json --opts (optional)

# Concrete example
accelerate launch --multi_gpu --num_processes=2 engine.py --config configs/cifar/cifar-small.json

Tracker

Trackers such as tensorboard and wandb can be setup from the project_log_with option. We support multiple trackers at once through accelerate! Users are encouraged to find our which is the best for the project from here. Below are some examples to open the local monitor:

# tensorboard
tensorboard --logdir logs

🛠 How to Add Your Code?

Integrating New Models: Place your model files in the modeling/ folder and update the configurations accordingly.
Adding New Datasets: Implement data handling in the dataset/ folder and reference it in your config files.
Utility Scripts: Enhance functionality by adding utility scripts in the utils/ folder.
Customized Training Process: Please change the engine/engine.py to modify the training process.

TODO

Support iteration based training with infinite loader.

🙌 Special Thanks

Thanks to the creators of:

Feel free to modify and adapt this README to better fit the specifics and details of your project.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning Project's Template

🚀 Introduction

✨ Features

📂 Folder Structure

⚙️ Configuration (requires update)

🏋️‍♂️ Training (requires update)

Basic Usage

Override the config with command line

Multi-GPU Training

Tracker

🛠 How to Add Your Code?

TODO

🙌 Special Thanks

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
configs		configs
configuration/cifar		configuration/cifar
dataset		dataset
engine		engine
modeling		modeling
utils		utils
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
linter.sh		linter.sh
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

Justin900429/deep-learning-template

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Project's Template

🚀 Introduction

✨ Features

📂 Folder Structure

⚙️ Configuration (requires update)

🏋️‍♂️ Training (requires update)

Basic Usage

Override the config with command line

Multi-GPU Training

Tracker

🛠 How to Add Your Code?

TODO

🙌 Special Thanks

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages