[PaddlePaddle Hackathon] Task 70: Add DALI GPU processing for PP-YOLO training #4546

gbstack · 2021-11-11T01:44:28Z

PR types

New features

PR changes

APIs

Describe

Hi,

This PR adds DALI processing for PP-YOLO training according to Task #4221

Changing COCODataSet to DALICOCODataSet in configuration file configs/datasets/coco_detection.yml will enable DALI preprocessing.

e.g.

TrainDataset:
  !DALICOCODataSet
    image_dir: train2017
    anno_path: annotations/instances_train2017.json
    dataset_dir: /dataset/coco2017
    data_fields: ['image', 'gt_bbox', 'gt_class', 'is_crowd']

And the training command is same as before

python tools/train.py --config configs/ppyolo/ppyolo_r50vd_dcn_1x_coco.yml

Thanks,

…esize, flip, expand and crop operations

… removing requirement of all samples shape in one batch need be same)

… setting resize size from outer code

…ar import

…RandomFlip, RandomDistort, NormalizeImage and Permute operations

Set use_dali for all batch_transforms. Apply batch transforms after loading data from DALI pipeline. Convert image loaded from DALI pipeline to paddle Tensor.

lyuwenyu · 2021-11-11T05:31:57Z

训练速度这块有测过对比嘛 w/ vs. w/o

gbstack · 2021-11-12T08:50:37Z

训练速度这块有测过对比嘛 w/ vs. w/o

抱歉，刚看到消息。。

我的测试环境是这样的

CPU: Intel i5-7400
GPU: Nvidia 1080 Ti

w/o DALI
bs2  9-10 images/s
bs4  12-13 images/s
bs6  13-14 images/s cpu 60% load average: 3.18, 3.26, 3.13

w/ DALI
bs2  6-7 images/s
bs4  8-9 images/s
bs6  9-10 images/s cpu 25% load average: 1.97, 2.96, 2.99

在batch size 6的时候, 使用DALI时GPU显存占用比不使用DALI多了大约2G，再继续提高batch size就显存不足了。。

根据上面的信息，估计到batch size>10，cpu可能就会满载了（batch size为6时，GPU还没有满载）

lyuwenyu · 2021-11-16T04:01:05Z

根据上面的信息，估计到batch size>10，cpu可能就会满载了（batch size为6时，GPU还没有满载）

这个结果的意思是加了DALI变慢了嘛😂

gbstack · 2021-11-16T05:13:37Z

根据上面的信息，估计到batch size>10，cpu可能就会满载了（batch size为6时，GPU还没有满载）

这个结果的意思是加了DALI变慢了嘛joy

就是batch size变大使用DALI可能会提高速度，我试试看能不能在AI Studio上运行吧，用32G的显存。我本地的显存12G使用DALI时，batch size最大只能到6..

paddle-bot · 2024-02-06T06:42:05Z

Automatically closed by Paddle-bot.

gbstack added 9 commits November 10, 2021 13:45

add DALICOCOIterator, COCOPipeline. add feed function generator for r…

ae20e1e

…esize, flip, expand and crop operations

add dali_default_collate_fn (nearly same as default_collate_fn except…

b0d3f9e

… removing requirement of all samples shape in one batch need be same)

add DALICOCODataSet

b9bed85

add missing import

b639eb0

add use_dali parameter for BatchRandomResize and Gt2YoloTarget. allow…

adaaa65

… setting resize size from outer code

extract Compose and BatchCompose into separate file to prevent circul…

93d1337

…ar import

allow using dali for NormalizeBox, RandomCrop, RandomExpand, Resize, …

a4518ef

…RandomFlip, RandomDistort, NormalizeImage and Permute operations

Read resize option from DALI pipeline.

334ec68

Set use_dali for all batch_transforms. Apply batch transforms after loading data from DALI pipeline. Convert image loaded from DALI pipeline to paddle Tensor.

use DALI for data preprocessing if dataset is DALICOCODataSet

f9d9bfe

gbstack mentioned this pull request Nov 11, 2021

【PaddlePaddle Hackathon】任务总览 PaddlePaddle/Paddle#35940

Closed

qingqing01 added the PaddlePaddle Hackathon label Nov 12, 2021

paddle-bot bot closed this Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PaddlePaddle Hackathon] Task 70: Add DALI GPU processing for PP-YOLO training #4546

[PaddlePaddle Hackathon] Task 70: Add DALI GPU processing for PP-YOLO training #4546

gbstack commented Nov 11, 2021

lyuwenyu commented Nov 11, 2021

gbstack commented Nov 12, 2021

lyuwenyu commented Nov 16, 2021

gbstack commented Nov 16, 2021

paddle-bot bot commented Feb 6, 2024

[PaddlePaddle Hackathon] Task 70: Add DALI GPU processing for PP-YOLO training #4546

[PaddlePaddle Hackathon] Task 70: Add DALI GPU processing for PP-YOLO training #4546

Conversation

gbstack commented Nov 11, 2021

lyuwenyu commented Nov 11, 2021

gbstack commented Nov 12, 2021

lyuwenyu commented Nov 16, 2021

gbstack commented Nov 16, 2021

paddle-bot bot commented Feb 6, 2024