An Unsupervised Approach for Simultaneous Visual Odometry and Single Image Depth Estimation

被引:0
|
作者
Lu, Yawen [1 ]
Lu, Guoyu [1 ]
机构
[1] Rochester Inst Technol, Intelligent Vis & Sensing Lab, Rochester, NY 14623 USA
基金
美国国家科学基金会;
关键词
STEREO;
D O I
10.1109/IJCNN55064.2022.9892294
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual odometry (VO) and single image depth estimation are critical for robot vision, 3D reconstruction, and camera pose estimation that can be applied to autonomous driving, map building, augmented reality and many other applications. Various supervised learning models have been proposed to train the VO or single image depth estimation framework for each targeted scene to improve the performance recently. However, little effort has been made to learn these separate tasks together without requiring the collection of a significant number of labels. This paper proposes a novel unsupervised learning approach to simultaneously perceive VO and single image depth estimation. In our framework, either of these tasks can benefit from each other through simultaneously learning these two tasks. We correlate these two tasks by enforcing depth consistency between VO and single image depth estimation. Based on the single image depth estimation, we can resolve the most common and challenging scaling issue of monocular VO. Meanwhile, through training from a sequence of images, VO can enhance the single image depth estimation accuracy. The effectiveness of our proposed method is demonstrated through extensive experiments compared with current state-of-the-art methods on the benchmark datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] VIMO: Simultaneous Visual Inertial Model-based Odometry and Force Estimation
    Nisar, Barza
    Foehn, Philipp
    Falanga, Davide
    Scaramuzza, Davide
    ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
  • [42] VIMO: Simultaneous Visual Inertial Model-Based Odometry and Force Estimation
    Nisar, Barza
    Foehn, Philipp
    Falanga, Davide
    Scaramuzza, Davide
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03): : 2785 - 2792
  • [43] SINGLE IMAGE DEPTH ESTIMATION FROM IMAGE DESCRIPTORS
    Lin, Yu-Hsun
    Cheng, Wen-Huang
    Miao, Hsin
    Ku, Tsung-Hao
    Hsieh, Yung-Huan
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 809 - 812
  • [44] Unsupervised single image-based depth estimation powered by coplanarity-driven disparity derivation
    Yao, Xiaoling
    Hu, Lihua
    Ma, Yang
    Zhang, Jifu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [45] Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue
    Garg, Ravi
    VijayKumar, B. G.
    Carneiro, Gustavo
    Reid, Ian
    COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 740 - 756
  • [46] Self-supervised deep monocular visual odometry and depth estimation with observation variation
    Zhao, Wentao
    Wang, Yanbo
    Wang, Zehao
    Li, Rui
    Xiao, Peng
    Wang, Jingchuan
    Guo, Rui
    DISPLAYS, 2023, 80
  • [47] Collaborative Learning of Depth Estimation, Visual Odometry and Camera Relocalization from Monocular Videos
    Zhao, Haimei
    Bian, Wei
    Yuan, Bo
    Tao, Dacheng
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 488 - 494
  • [48] Unsupervised visual odometry method for greenhouse mobile robots
    Wu X.
    Zhou Y.
    Liu J.
    Liu Z.
    Wang C.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2023, 39 (10): : 163 - 174
  • [49] Eliminating Scale Ambiguity of Unsupervised Monocular Visual Odometry
    Zhongyi Wang
    Mengjiao Shen
    Qijun Chen
    Neural Processing Letters, 2023, 55 : 9743 - 9764
  • [50] Eliminating Scale Ambiguity of Unsupervised Monocular Visual Odometry
    Wang, Zhongyi
    Shen, Mengjiao
    Chen, Qijun
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9743 - 9764