Real-Time Dense Monocular SLAM With Online Adapted Depth Prediction Network

被引:34
|
作者
Luo, Hongcheng [1 ]
Gao, Yang [1 ]
Wu, Yuhao [1 ]
Liao, Chunyuan [2 ]
Yang, Xin [1 ]
Cheng, Kwang-Ting [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Hubei, Peoples R China
[2] HiScene Informat Technol Co Ltd, Pudong 201210, Peoples R China
[3] Hong Kong Univ Sci & Technol, Sch Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Monocular SLAM; dense mapping; convolutional neural network; fusion; online tuning; ACCURATE;
D O I
10.1109/TMM.2018.2859034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Considerable advances have been achieved in estimating the depth map from a single image via convolutional neural networks (CNNs) during the past few years. Combining depth prediction from CNNs with conventional monocular simultaneous localization and mapping (SLAM) is promising for accurate and dense monocular reconstruction, in particular addressing the two long-standing challenges in conventional monocular SLAM: low map completeness and scale ambiguity. However, depth estimated by pretrained CNNs usually fails to achieve sufficient accuracy for environments of different types from the training data, which are common for certain applications such as obstacle avoidance of drones in unknown scenes. Additionally, inaccurate depth prediction of CNN could yield large tracking errors in monocular SLAM. In this paper, we present a real-time dense monocular SLAM system, which effectively fuses direct monocular SLAM with an online-adapted depth prediction network for achieving accurate depth prediction of scenes of different types from the training data and providing absolute scale information for tracking and mapping. Specifically, on one hand, tracking pose (i.e., translation and rotation) from direct SLAM is used for selecting a small set of highly effective and reliable training images, which acts as ground truth for tuning the depth prediction network on-the-fly toward better generalization ability for scenes of different types. A stage-wise Stochastic Gradient Descent algorithm with a selective update strategy is introduced for efficient convergence of the tuning process. On the other hand, the dense map produced by the adapted network is applied to address scale ambiguity of direct monocular SLAM which in turn improves the accuracy of both tracking and overall reconstruction. The system with assistance of both CPUs and GPUs, can achieve real-time performance with progressively improved reconstruction accuracy. Experimental results on public datasets and live application to obstacle avoidance of drones demonstrate that our method outperforms the state-of-the-art methods with greater map completeness and accuracy, and a smaller tracking error.
引用
收藏
页码:470 / 483
页数:14
相关论文
共 50 条
  • [41] Real-Time Monocular Object-Model Aware Sparse SLAM
    Hosseinzadeh, Mehdi
    Li, Kejie
    Latif, Yasir
    Reid, Ian
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 7123 - 7129
  • [42] EpiDepth: A Real-Time Monocular Dense-Depth Estimation Pipeline Using Generic Image Rectification
    Camaioni, Raub
    Luke, Robert H.
    Buck, Andrew
    Anderson, Derek T.
    GEOSPATIAL INFORMATICS XII, 2022, 12099
  • [43] DRM-SLAM: Towards dense reconstruction of monocular SLAM with scene depth fusion
    Ye, Xinchen
    Ji, Xiang
    Sun, Baoli
    Chen, Shenglun
    Wang, Zhihui
    Li, Haojie
    NEUROCOMPUTING, 2020, 396 (396) : 76 - 91
  • [44] ORBFusion: Real-time and Accurate dense SLAM at large scale
    Dai, Juting
    Tang, Xinyi
    Oppermann, Leif
    ADJUNCT PROCEEDINGS OF THE 2017 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT), 2017, : 124 - 129
  • [45] ElasticFusion: Real-time dense SLAM and light source estimation
    Whelan, Thomas
    Salas-Moreno, Renato F.
    Glocker, Ben
    Davison, Andrew J.
    Leutenegger, Stefan
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2016, 35 (14): : 1697 - 1716
  • [46] Real-Time Dense Visual SLAM with Neural Factor Representation
    Wei, Weifeng
    Wang, Jie
    Xie, Xiaolong
    Liu, Jie
    Su, Pengxiang
    ELECTRONICS, 2024, 13 (16)
  • [47] Quadtree-accelerated Real-time Monocular Dense Mapping
    Wang, Kaixuan
    Ding, Wenchao
    Shen, Shaojie
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 7817 - 7824
  • [48] STDC-SLAM: A Real-Time Semantic SLAM Detect Object by Short-Term Dense Concatenate Network
    Hu, Zhangfang
    Chen, Jian
    Luo, Yuan
    Zhang, Yi
    IEEE ACCESS, 2022, 10 : 129419 - 129428
  • [49] Towards Real-Time Monocular Depth Estimation For Mobile Systems
    Deldjoo, Yashar
    Di Noia, Tommaso
    Di Sciascio, Eugenio
    Pernisco, Gaetano
    Reno, Vito
    Stella, Ettore
    MULTIMODAL SENSING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS II, 2021, 11785
  • [50] Real-Time Depth Estimation from a Monocular Moving Camera
    Handa, Aniket
    Sharma, Prateek
    CONTEMPORARY COMPUTING, 2012, 306 : 494 - 495