We present Mono-STAR, the first real-time 3D reconstruction system that simultaneously supports semantic fusion, fast motion tracking, non-rigid object deformation, and topological change under a unified framework. The proposed system solves a new optimization problem that incorporates optical-flow-based 2D constraints to deal with fast motion, and a novel semantic-aware deformation graph (SAD-graph) for handling topology change. We test the proposed system under various challenging scenes and demonstrate that it significantly outperforms existing state-of-the-art systems.
Overview of the proposed system. The system runs in two parallel threads, one for measurement and one for geometry. In each time-step
Color | Graph | Semantic |
---|---|---|
geometry_color.mp4 |
geometry_graph.mp4 |
geometry_seg.mp4 |
Color | Graph | Semantic |
---|---|---|
geometry_color.mp4 |
geometry_graph.mp4 |
geometry_seg.mp4 |
Color | Geometry | Semantic |
---|---|---|
geometry_color.mp4 |
geometry_normal.mp4 |
geometry_seg.mp4 |
Color | Geometry | Semantic |
---|---|---|
geometry_color.mp4 |
geometry_normal.mp4 |
geometry_seg.mp4 |
Color | Geometry | Semantic |
---|---|---|
geometry_color.mp4 |
geometry_normal.mp4 |
geometry_seg.mp4 |
Color | Geometry | Graph |
---|---|---|
geometry_color.mp4 |
geometry_normal.mp4 |
geometry_graph.mp4 |
Color | Geometry | Graph |
---|---|---|
geometry_color.mp4 |
geometry_normal.mp4 |
geometry_graph.mp4 |
We presented Mono-STAR, a single-view solution for the semantic-aware STAR problem. Mono-STAR uses a novel semantic-aware and adaptive deformation graph for simultaneous tracking and reconstruction, and can handle topology changes as well as semantic fusion. Experiments show that Mono-STAR achieves promising results in non-rigid object reconstruction, while resisting to semantic segmentation errors, and capturing fast motions on various challenging scenes. We believe that this system can inspire and boost more future research on imitation learning, dexterous manipulation, and many other relevant robotics problems.
For source code, please check https://github.com/changhaonan/StarHub. Code is only checked under Ubuntu20.04.
@article{chang2023mono,
title={Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction},
author={Chang, Haonan and Ramesh, Dhruv Metha and Geng, Shijie and Gan, Yuqiu and Boularias, Abdeslam},
journal={arXiv preprint arXiv:2301.13244},
year={2023}
}