Documentation:https://paddledetection.readthedocs.io

PaddleDetection

PaddleDetection is an end-to-end object detection development kit based on PaddlePaddle, which aims to help developers in the whole development of training models, optimizing performance and inference speed, and deploying models. PaddleDetection provides varied object detection architectures in modular design, and wealthy data augmentation methods, network components, loss functions, etc. PaddleDetection supported practical projects such as industrial quality inspection, remote sensing image object detection, and automatic inspection with its practical features such as model compression and multi-platform deployment.

Now all models in PaddleDetection require PaddlePaddle version 1.7 or higher, or suitable develop version.

Introduction

Features:

Rich models:

PaddleDetection provides rich of models, including 100+ pre-trained models such as object detection, instance segmentation, face detection etc. It covers the champion models, the practical detection models for cloud and edge device.
Production Ready:

Key operations are implemented in C++ and CUDA, together with PaddlePaddle's highly efficient inference engine, enables easy deployment in server environments.
Highly Flexible:

Components are designed to be modular. Model architectures, as well as data preprocess pipelines, can be easily customized with simple configuration changes.
Performance Optimized:

With the help of the underlying PaddlePaddle framework, faster training and reduced GPU memory footprint is achieved. Notably, YOLOv3 training is much faster compared to other frameworks. Another example is Mask-RCNN (ResNet50), we managed to fit up to 4 images per GPU (Tesla V100 16GB) during multi-GPU training.

Supported Architectures:

	ResNet	ResNet-vd ¹	ResNeXt-vd	SENet	MobileNet	HRNet	Res2Net
Faster R-CNN	✓	✓	x	✓	✗	✗	✗
Faster R-CNN + FPN	✓	✓	✓	✓	✗	✓	✓
Mask R-CNN	✓	✓	x	✓	✗	✗	✗
Mask R-CNN + FPN	✓	✓	✓	✓	✗	✗	✓
Cascade Faster-RCNN	✓	✓	✓	✗	✗	✗	✗
Cascade Mask-RCNN	✓	✗	✗	✓	✗	✗	✗
Libra R-CNN	✗	✓	✗	✗	✗	✗	✗
RetinaNet	✓	✗	✗	✗	✗	✗	✗
YOLOv3	✓	✗	✗	✗	✓	✗	✗
SSD	✗	✗	✗	✗	✓	✗	✗
BlazeFace	✗	✗	✗	✗	✗	✗	✗
Faceboxes	✗	✗	✗	✗	✗	✗	✗

[1] ResNet-vd models offer much improved accuracy with negligible performance cost.

More models:

EfficientDet
FCOS
CornerNet-Squeeze
YOLOv4

More Backbones:

DarkNet
VGG
GCNet
CBNet

Advanced Features:

Synchronized Batch Norm
Group Norm
Modulated Deformable Convolution
Deformable PSRoI Pooling
Non-local and GCNet

NOTE: Synchronized batch normalization can only be used on multiple GPU devices, can not be used on CPU devices or single GPU device.

The following is the relationship between COCO mAP and FPS on Tesla V100 of representative models of each architectures and backbones.

NOTE:

CBResNet stands for Cascade-Faster-RCNN-CBResNet200vd-FPN, which has highest mAP on COCO as 53.3% in PaddleDetection models
Cascade-Faster-RCNN stands for Cascade-Faster-RCNN-ResNet50vd-DCN, which has been optimized to 20 FPS inference speed when COCO mAP as 47.8%
The enhanced YOLOv3-ResNet50vd-DCN is 10.6 absolute percentage points higher than paper on COCO mAP, and inference speed is nearly 70% faster than the darknet framework
All these models can be get in Model Zoo

Tutorials

Get Started

Installation guide
Quick start on small dataset
Train/Evaluation/Inference
FAQ

Advanced Tutorial

Guide to preprocess pipeline and custom dataset
Models technical
Transfer learning document
Parameter configuration:
- Introduction to the configuration workflow
- Parameter configuration for RCNN model
IPython Notebook demo
Model compression
Deployment

Model Zoo

Pretrained models are available in the PaddleDetection model zoo.
Mobile models
Anchor free models
Face detection models
Pretrained models for pedestrian detection
Pretrained models for vehicle detection
YOLOv3 enhanced model: Compared to MAP of 33.0% in paper, enhanced YOLOv3 reaches the MAP of 43.6%, and inference speed is improved as well
Objects365 2019 Challenge champion model
Best single model of Open Images 2019-Object Detction
Practical Server-side detection method: Inference speed on single V100 GPU can reach 20FPS when COCO mAP is 47.8%.

License

PaddleDetection is released under the Apache 2.0 license.

Updates

v0.3.0 was released at 05/2020, add anchor-free, EfficientDet, YOLOv4, etc. Launched mobile and server-side practical and efficient multiple models. For example, the YOLOv3-MobileNetv3 mobile side model is accelerated 3.5 times, the server side has optimized the two-stage model, and the speed and accuracy have high cost performance. We also refactored predictive deployment functions, and improved ease of use, fix many known bugs, etc. Please refer to 版本更新文档 for details.

Contributing

Contributions are highly welcomed and we would really appreciate your feedback!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README_en.md

README_en.md

PaddleDetection

Introduction

Tutorials

Get Started

Advanced Tutorial

Model Zoo

License

Updates

Contributing

Files

README_en.md

Latest commit

History

README_en.md

File metadata and controls

PaddleDetection

Introduction

Tutorials

Get Started

Advanced Tutorial

Model Zoo

License

Updates

Contributing