Footprints and Free Space from a Single Color Image

Jamie Watson, Michael Firman, Aron Monszpart and Gabriel J. Brostow – CVPR 2020 (Oral presentation)

We introduce Footprints, a method for estimating the visible and hidden traversable space from a single RGB image

Understanding the shape of a scene from a single color image is a formidable computer vision task. Most methods aim to predict the geometry of surfaces that are visible to the camera, which is of limited use when planning paths for robots or augmented reality agents. Models which predict beyond the line of sight often parameterize the scene with voxels or meshes, which can be expensive to use in machine learning frameworks.

Our method predicts the hidden ground geometry and extent from a single image:

Our predictions enable virtual characters to more realistically explore their environment.


Baseline: The virtual character can only explore the ground visible to the camera	Ours: The penguin can explore both the visible and hidden ground

⚙️ Setup

Our code and models were developed with PyTorch 1.3.1. The environment.yml and requirements.txt list our dependencies.

We recommend installing and activating a new conda environment from these files with:

conda env create -f environment.yml -n footprints
conda activate footprints

🖼️ Prediction

We provide three pretrained models:

kitti, a model trained on the KITTI driving dataset with a resolution of 192x640,
matterport, a model trained on the indoor Matterport dataset with a resolution of 512x640, and
handheld, a model trained on our own handheld stereo footage with a resolution of 256x448.

We provide code to make predictions for a single image, or a whole folder of images, using any of these pretrained models. Models will be automatically downloaded when required, and input images will be automatically resized to the correct input resolution for each model.

Single image prediction:

python -m footprints.predict --image test_data/cyclist.jpg --model kitti

Multi image prediction:

python -m footprints.predict --image test_data --model handheld

By default, .npy predictions and .jpg visualisations will be saved to the predictions folder; this can be changed with the --save_dir flag.

Training code is coming soon

Method and further results

We learn from stereo video sequences, using camera poses, per-frame depth and semantic segmentation to form training data, which is used to supervise an image-to-image network.

More results on the KITTI dataset:

✏️ 📄 Citation

If you find our work useful or interesting, please consider citing our paper:

@inproceedings{watson-2020-footprints,
 title   = {Footprints and Free Space from a Single Color Image},
 author  = {Jamie Watson and
            Michael Firman and
            Aron Monszpart and
            Gabriel J. Brostow},
 booktitle = {Computer Vision and Pattern Recognition ({CVPR})},
 year = {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
footprints		footprints
readme_ims		readme_ims
test_data		test_data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Footprints and Free Space from a Single Color Image

⚙️ Setup

🖼️ Prediction

Method and further results

✏️ 📄 Citation

👩‍⚖️ License

About

Releases

Packages

Languages

License

amonszpart/footprints

Folders and files

Latest commit

History

Repository files navigation

Footprints and Free Space from a Single Color Image

⚙️ Setup

🖼️ Prediction

Method and further results

✏️ 📄 Citation

👩‍⚖️ License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages