Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create README.md #37

Open
wants to merge 10 commits into
base: master
Choose a base branch
from
Open
Changes from 3 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
98 changes: 98 additions & 0 deletions mrcnn/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,98 @@
# Model Training

## Table of Contents

1. [Introduction](#introduction)
1. [Examples](#examples)
1. [Usage](#usage)


## Introduction


## Setup

Run data syntheszer module to generate the training datasets in the following folder structure:
pshivraj marked this conversation as resolved.
Show resolved Hide resolved

~~~~~~~
project
|-- mrcnn
|-- scipts
|-- config.py
|-- model.py
|-- train.py
|-- pre_process.py
|-- requirements.txt
|-- utils.py
|-- visualize.py
|-- Inference_notebook.ipynb
|-- utils.py

|--- mask_data
|-- id_map.json
|-- logs/
|-- mask_rcnn_coco.h5
|-- test_image
|-- train_image
~~~~~~~

## Training


### Pre-processing
Data pre-processing using pre_process.py to generate .h5 file for masks.


### Model and Training

```
pshivraj marked this conversation as resolved.
Show resolved Hide resolved
-- Modified Matterport's implementation of Mask-RCNN deep neural network for object instance segmentation.
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you replace all -- with *, these will appear as proper bullet points

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for this, updated the file now with bullets.

Model - Added two new methods to train just specific masks.
mrcnn_mask: Just mask layers
mask_heads: Mask layers or rpn/fpn
Multiclass- Pre_processed images and mask files accordingly to prepare for multi classification.
-- Increased maximum number of predicted objects since an image can contain 200 or more bottles/bags/boxes.
-- Increased POST_NMS_ROIS_TRAINING to get more region proposals during training.
-- Resized images and masks to 512x512.
-- Used Default anchor size as we do not expect small objects: RPN_ANCHOR_SCALES = (16, 32, 64, 128, 256)
-- Relied heavily on deep image augmentation due to small training set:
Random horizontal or vertical flips
Random 90 or -90 degrees rotation
Random rotations in the range of (-20, 20) degrees
Random scaling of image and mask scaling in the range (0.5, 2.0)
-- Used Resnet101 architecture as a backbone encoder.
-- Trained the model with Adam optimizer for 65 epochs:
-- 5 epochs of heads with learning rate 1e-4 (To speed up the training process)
-- 30 epochs with learning rate 1e-5
-- 30 epochs with learning rate 1e-6
-- changed mAP computation to be (0.5 - 0.8)
-- weighted mAP
-- weighted loss
LOSS_WEIGHTS = {
"rpn_class_loss": 20.,
"rpn_bbox_loss": 1.,
"mrcnn_class_loss": 10.,
"mrcnn_bbox_loss": 1.,
"mrcnn_mask_loss": 10.
}

```
pshivraj marked this conversation as resolved.
Show resolved Hide resolved

### Model Execution and Run-Time
Run python pre_process.py to pre-process data
pshivraj marked this conversation as resolved.
Show resolved Hide resolved

Run python train.py to train the model. Model weights are saved at ../data/logs/kaggle_bowl/mask_rcnn.h5.
pshivraj marked this conversation as resolved.
Show resolved Hide resolved

Run python inference_notebook.ipynb.py to evaluate model performance on test set


The following execution times are measured on Nvidia P100 GPUs provided by AWS Deep learning AMI

```
pshivraj marked this conversation as resolved.
Show resolved Hide resolved
Each training epoch takes about 25 minutes.
It takes about 18 hours to train the model from scratch.
```
pshivraj marked this conversation as resolved.
Show resolved Hide resolved

## Example model predictions

[put graphs from notebook]