- Scene Parsing through ADE20K Dataset
- Semantic Understanding of Scenes through ADE20K Dataset
- Official website
Challenges using ADE20K data:
Evaluation is usually done in the framework of competitions. For the target metric in both Scene Parsing and Places the authors use the average of Pixel Accuracy and Mean Intersection over Union.
The data comes from original ADE20K dataset
- 20 210 images in training set, 2000 images in validation set, 3000 images in testing set.
- 20 object classes and 434 826 object instances.
- Finer hierarchical labeling beyond 20 object classes: 150 stuff/object labels (e.g. wall, sky, tree) and 1038 image-level labels (e.g. airport terminal, bedroom, street)
- Varying spatial resolution: median image size - 307 200 pixels with median aspect ratio of 1.33.
Each folder contains images separated by scene category (same scene categories than the Places Database). For each image, the object and part segmentations are stored in two different png files. All object and part instances are annotated sparately.
For each image there are the following files:
Input RGB image.
RGB image with object segmentation mask. The mask has the following structure:
Parts segmentation mask, where N is a number (1,2,3,...) indicating the level in the part hierarchy. N=1 - parts of objects. As N can be > 1, parts can consist of parts too.
Text file describing the content of each image (describing objects and parts).
This information is redundant with other files. But in addition contains also information about object attributes.
The function loadAde20K.m
provided in official toolkit (see below) also parses the content of this file.
Each line in the text file contains six columns with the following content:
- Instance number
- Part level (0 for objects),
- Indicator of occlusion (1 for true)
- Class name (parsed using wordnet)
- Original raw name (might provide a more detailed categorization)
- Comma separated attributes list
Link | Description |
---|---|
Segmentation models for ADE20K (PyTorch) | PyTorch implementation of segmentation models and training routines |
MATLAB Development Kit | Evaluation/Visualization scripts |
Baselines from MIT Team | A bunch of pretrained PyTorch models |
Official MATLAB tools for loading | MATLAB scripts for loading and exploring the data |
A sample of images (row 1) with corresponding object-level (row 2) and parts-level (row 3) segmentations:
A web-based tool for browsing different data splits: