Real-Time Dense Monocular SLAM With Online Adapted Depth Prediction Network

被引:34
|
作者
Luo, Hongcheng [1 ]
Gao, Yang [1 ]
Wu, Yuhao [1 ]
Liao, Chunyuan [2 ]
Yang, Xin [1 ]
Cheng, Kwang-Ting [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Hubei, Peoples R China
[2] HiScene Informat Technol Co Ltd, Pudong 201210, Peoples R China
[3] Hong Kong Univ Sci & Technol, Sch Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular SLAM; dense mapping; convolutional neural network; fusion; online tuning; ACCURATE;
D O I
10.1109/TMM.2018.2859034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Considerable advances have been achieved in estimating the depth map from a single image via convolutional neural networks (CNNs) during the past few years. Combining depth prediction from CNNs with conventional monocular simultaneous localization and mapping (SLAM) is promising for accurate and dense monocular reconstruction, in particular addressing the two long-standing challenges in conventional monocular SLAM: low map completeness and scale ambiguity. However, depth estimated by pretrained CNNs usually fails to achieve sufficient accuracy for environments of different types from the training data, which are common for certain applications such as obstacle avoidance of drones in unknown scenes. Additionally, inaccurate depth prediction of CNN could yield large tracking errors in monocular SLAM. In this paper, we present a real-time dense monocular SLAM system, which effectively fuses direct monocular SLAM with an online-adapted depth prediction network for achieving accurate depth prediction of scenes of different types from the training data and providing absolute scale information for tracking and mapping. Specifically, on one hand, tracking pose (i.e., translation and rotation) from direct SLAM is used for selecting a small set of highly effective and reliable training images, which acts as ground truth for tuning the depth prediction network on-the-fly toward better generalization ability for scenes of different types. A stage-wise Stochastic Gradient Descent algorithm with a selective update strategy is introduced for efficient convergence of the tuning process. On the other hand, the dense map produced by the adapted network is applied to address scale ambiguity of direct monocular SLAM which in turn improves the accuracy of both tracking and overall reconstruction. The system with assistance of both CPUs and GPUs, can achieve real-time performance with progressively improved reconstruction accuracy. Experimental results on public datasets and live application to obstacle avoidance of drones demonstrate that our method outperforms the state-of-the-art methods with greater map completeness and accuracy, and a smaller tracking error.
引用
收藏
页码:470 / 483
页数:14
相关论文
共 50 条
  • [31] Real-time surface of revolution reconstruction on dense SLAM
    Yang, Liming
    Uchiyama, Hideaki
    Normand, Jean-Marie
    Moreau, Guillaume
    Nagahara, Hajime
    Taniguchi, Rin-ichiro
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 28 - 36
  • [32] Real-time dense map fusion for stereo SLAM
    Pire, Taihu
    Baravalle, Rodrigo
    D'Alessandro, Ariel
    Civera, Javier
    ROBOTICA, 2018, 36 (10) : 1510 - 1526
  • [33] A real-time semi-dense depth-guided depth completion network
    JieJie Xu
    Yisheng Zhu
    Wenqing Wang
    Guangcan Liu
    The Visual Computer, 2024, 40 : 87 - 97
  • [34] Real-time Monocular Dense Mapping for Augmented Reality
    Xue, Tangli
    Luo, Hongcheng
    Cheng, Danpeng
    Yuan, Zikang
    Yang, Xin
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 510 - 518
  • [35] A real-time semi-dense depth-guided depth completion network
    Xu, JieJie
    Zhu, Yisheng
    Wang, Wenqing
    Liu, Guangcan
    VISUAL COMPUTER, 2024, 40 (01): : 87 - 97
  • [36] HFNet-SLAM: An Accurate and Real-Time Monocular SLAM System with Deep Features
    Liu, Liming
    Aitken, Jonathan M.
    SENSORS, 2023, 23 (04)
  • [37] Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial Network
    Yang, Xin
    Chen, Jingyu
    Wang, Zhiwei
    Zhang, Qiaozhe
    Liu, Wenyu
    Liao, Chunyuan
    Cheng, Kwang-Ting
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 896 - 904
  • [38] Parsimonious Real Time Monocular SLAM
    Bresson, Guillaume
    Feraud, Thomas
    Aufrere, Romuald
    Checchin, Paul
    Chapuis, Roland
    2012 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2012, : 511 - 516
  • [39] RD-SLAM: Real-Time Dense SLAM Using Gaussian Splatting
    Guo, Chaoyang
    Gao, Chunyan
    Bai, Yiyang
    Lv, Xiaoling
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [40] Real-time Monocular Depth Estimation with Extremely Light-Weight Neural Network
    Chiu, Mian-Jhong
    Chiu, Wei-Chen
    Chen, Hua-Tsung
    Chuang, Jen-Hui
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7050 - 7057