This repo contains the code for our RSS2020 paper, OverlapNet.
OverlapNet is modified Siamese Network that predicts the overlap and relative yaw angle of a pair of range images generated by 3D LiDAR scans.
Developed by Xieyuanli Chen and Thomas Läbe.
Pipeline overview of OverlapNet.
- Introduction
- Publication
- Logs
- Dependencies
- How to use
- Application
- License
If you use our implementation in your academic work, please cite the corresponding paper (PDF):
@inproceedings{chen2020rss,
author = {X. Chen and T. L\"abe and A. Milioto and T. R\"ohling and O. Vysotska and A. Haag and J. Behley and C. Stachniss},
title = {{OverlapNet: Loop Closing for LiDAR-based SLAM}},
booktitle = {Proceedings of Robotics: Science and Systems (RSS)},
year = {2020}
}
The extended journal version of OverlapNet is here (PDF):
@article{chen2021auro,
author = {X. Chen and T. L\"abe and A. Milioto and T. R\"ohling and J. Behley and C. Stachniss},
title = {{OverlapNet: A Siamese Network for Computing LiDAR Scan Similarity with Applications to Loop Closing and Localization}},
journal = {{Autonomous Robots}},
year = {2021},
doi = {10.1007/s10514-021-09999-0},
issn = {1573-7527},
volume={46},
pages={61--81},
url = {http://www.ipb.uni-bonn.de/pdfs/chen2021auro.pdf}
}
Based on the OverlapTransformer, we propose a novel transformer exploiting sequential LiDAR data to achieve better LiDAR place recognition (code).
We develop a novel lightweight neural network called OverlapTransformer exploiting transformer and achieve fast execution with less than 4 ms per frame in estimating LiDAR scans similarity.
- We thank Junyi Ma to create the PyTorch version of OverlapNet (delta-head-only), which can be found in the pytorch version branch.
- Added a method to the Infer class for inference with multiple frames versus multiple frames.
- Updated TensorFlow version in dependencies.
- Fixed bugs in generating ground truth overlap and yaw.
- Added an application and a link to our overlap-based MCL implementation.
Open source initial submission
We are using standalone keras with a tensorflow backend as a library for neural networks.
In order to do training and testing on a whole dataset, you need a nvidia GPU. The demos still are fast enough when using the neural network on CPU.
To use a GPU, first you need to install the nvidia driver and CUDA, so have fun!
-
CUDA Installation guide: link
-
System dependencies:
sudo apt-get update sudo apt-get install -y python3-pip python3-tk sudo -H pip3 install --upgrade pip
-
Python dependencies (may also work with different versions than mentioned in the requirements file)
sudo -H pip3 install -r requirements.txt
This repository contains the neural network for doing a detection of loop closing candidates.
For a complete pipline for online LiDAR preprocessing, one could find the fast implementation in our SuMa++.
In this repository, we provide demos to show the functionaly. Additional to that, we explain how to train a model.
To try demo 1, you could directly run the script with one command line:
python3 demo/demo1_gen_data.py
The generated data are stored in /data/preprocess_data
, and you will get a visualization like this:
For generating range and normal images, we have also a much faster implementation in C (with python interface) available
in our repo https://github.com/PRBonn/overlap_localization (look at src/prepare_training
).
To run demo 2, you need to first download the pre-trained model.
Then, you should
- either copy it into the default location folder
data
or - you need to modify the
pretrained_weightsfilename
in the config file/config/network.yml
accordingly.
Run the second demo script with one command line:
python3 demo/demo2_infer.py
You will get a visualization like this:
To run demo 3, you need to first download several data:
- pre-trained model,
- KITTI odometry data, where we also provide the covariance information generated from the SLAM,
- pre-processed data.
If you follow the recommended data structure below, you extract the downloaded data into the folder data
.
Otherwise, you need to specify the paths of data in both config/network.yml
and config/demo.yml
accordingly,
and then run the third demo script with one command line:
python3 demo/demo3_lcd.py
You will get an animation like this:
To run demo 4, you need only the raw KITTI odometry data. We are using the same setup as in demo 3.
Run the fourth demo script with one command line:
python3 demo/demo4_gen_gt_files.py
You will generated the ground truth data in data/preprocess_data_demo/ground_truth
and get a plot like this:
The colors represent the ground truth overlap value of each frame with respect to the given current frame which is located at (0.0, 0.0).
For a quick test of the training and testing procedures, you could use our preprocessed data as used in demo3.
We only provide the geometric-based preprocessed data. But it will also be possible to generate other inputs (semantics, intensity) by yourself.
A simple example to generate different types of data from LiDAR scan is given in demo1.
For 3D LiDAR semantic segmentation, we provide a fast c++ inferring library rangenetlib.
For training a new model with OverlapNet, you need to first generate preprocessed data and ground truth overlap and yaw angle which you could find examples in demo1 and demo4.
The recommended data structure is as follows:
data
├── 07
│ ├── calib.txt
│ ├── covariance.txt
│ ├── poses.txt
│ ├── depth
│ │ ├── 000000.npy
│ │ ├── 000001.npy
│ │ └── ...
│ ├── normal
│ │ ├── 000000.npy
│ │ ├── 000001.npy
│ │ └── ...
│ ├── velodyne
│ │ ├── 000000.bin
│ │ ├── 000001.bin
│ │ └── ...
│ └── ground_truth
│ ├── ground_truth_overlap_yaw.npz
│ ├── test_set.npz
│ └── train_set.npz
└── model_geo.weight
The code for training can be found in src/two_heads/training.py
.
If you download our preprocessed data, please put the data into the folder data
.
If you want to use another directory, please change the parameter data_root_folder
in
the configuration file network.yml
.
Notice that default weight file is set in the configuration file with
parameter pretrained_weightsfilename
.
If you want to train a completely new model from scratch, leave this parameter empty.
Otherwise you will fine-tune the provided model.
Then you can start the training with
python3 src/two_heads/training.py config/network.yml
All configuration data is in the yml file. You will find path definitions and training parameters there. The main path settings are:
experiments_path
: the folder where all the training data and results (log files, tensorboard logs, network weights) will be saved. Default is/tmp
. Change this according to your needsdata_root_folder
: the dataset folder. Is should contain the sequence folders of the dataset e.g. as00
,01
, ..., For the provided preproecessed data, it should be07
.
We provide tensorboard logs in experiment_path/testname/tblog
for visualizing training and validation details.
Once a model has been trained (thus a file .weight
with the network weights is available), the performance of the network
can be evaluated. Therefore you can start the testing script in the same manner as the training with the testing script:
python3 src/two_heads/testing.py config/network.yml
The configuration file should have the following additional settings:
pretrained_weightsfilename
: the weight filename mentioned as parametertesting_seqs
: sequences to test on, e.g.00 01
. (Please comment outtraining_seqs
.) The pairs where the tests are computed are in the fileground_truth/ground_truth_overlap_yaw.npz
. If one still uses the parametertraining_seqs
, the validation is done on the test sets of the sequences (ground_truth/validation_set.npz
) which contain only a small amount of data used for validation during training.
Note that: the provided pre-trained model and preprocessed ground truth are with the constraint that the current frame only finds loop closures in the previous frames.
This repo contains the code for our IROS2020 paper: Learning an Overlap-based Observation Model for 3D LiDAR Localization.
It uses the OverlapNet to train an observation model for Monte Carlo Localization and achieves global localization with 3D LiDAR scans.
Copyright 2020, Xieyuanli Chen, Thomas Läbe, Cyrill Stachniss, Photogrammetry and Robotics Lab, University of Bonn.
This project is free software made available under the MIT License. For details see the LICENSE file.