Semantic and Optical Flow Guided Self-supervised Monocular Depth and Ego-Motion Estimation

被引:0
|
作者
Fang, Jiaojiao [1 ]
Liu, Guizhong [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Peoples R China
来源
关键词
Self-supervised learning; Monocular depth estimation; Camera pose estimation; Stereo vision;
D O I
10.1007/978-3-030-87361-5_38
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The self-supervised depth and camera pose estimation methods are proposed to address the difficulty of acquiring the densely labeled ground-truth data and have achieved a great advance. As the stereo vision could constrain the predicted depth to a real-world scale, in this paper, we study the use of both left-right pairs and adjacent frames of stereo sequences for self-supervised semantic and optical flow guided monocular depth and camera pose estimation without real pose information. In particular, we explore (i) to construct a cascaded structure of the depth-pose and optical flow for well-initializing the optical flow, (ii) a cycle learning strategy to further constrain the depth-pose learning by the cross-task consistency, and (iii) a weighted semantic guided smoothness loss to match the real nature of a depth map. Our method produces favorable results against the state-of-the-art methods on several benchmarks. And we also demonstrate the generalization ability of our method on the cross dataset.
引用
收藏
页码:465 / 477
页数:13
相关论文
共 50 条
  • [1] Self-Supervised monocular depth and ego-Motion estimation in endoscopy: Appearance flow to the rescue
    Shao, Shuwei
    Pei, Zhongcai
    Chen, Weihai
    Zhu, Wentao
    Wu, Xingming
    Sun, Dianmin
    Zhang, Baochang
    [J]. MEDICAL IMAGE ANALYSIS, 2022, 77
  • [2] Self-supervised monocular depth and ego-motion estimation for CT-bronchoscopy fusion
    Chang, Qi
    Higgins, William E.
    [J]. IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, MEDICAL IMAGING 2024, 2024, 12928
  • [3] Self-Supervised Attention Learning for Depth and Ego-motion Estimation
    Sadek, Assent
    Chidlovskii, Boris
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10054 - 10060
  • [4] Self-Supervised Depth and Ego-Motion Estimation for Monocular Thermal Video Using Multi-Spectral Consistency Loss
    Shin, Ukcheol
    Lee, Kyunghyun
    Lee, Seokju
    Kweon, In So
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 1103 - 1110
  • [5] WS-SfMLearner: Self-supervised Monocular Depth and Ego-motion Estimation on Surgical Videos with Unknown Camera Parameters
    Lou, Ange
    Noble, Jack
    [J]. IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, MEDICAL IMAGING 2024, 2024, 12928
  • [6] Beyond Photometric Loss for Self-Supervised Ego-Motion Estimation
    Shen, Tianwei
    Luo, Zixin
    Zhou, Lei
    Deng, Hanyu
    Zhang, Runze
    Fang, Tian
    Quan, Long
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6359 - 6365
  • [7] Two Stream Networks for Self-Supervised Ego-Motion Estimation
    Ambrus, Rares
    Guizilini, Vitor
    Li, Jie
    Pillai, Sudeep
    Gaidon, Adrien
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [8] Joint self-supervised learning of interest point, descriptor, depth, and ego-motion from monocular video
    Wang, Zhongyi
    Shen, Mengjiao
    Chen, Qijun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (32) : 77529 - 77547
  • [9] Semantically guided self-supervised monocular depth estimation
    Lu, Xiao
    Sun, Haoran
    Wang, Xiuling
    Zhang, Zhiguo
    Wang, Haixia
    [J]. IET IMAGE PROCESSING, 2022, 16 (05) : 1293 - 1304
  • [10] Neural Ray Surfaces for Self-Supervised Learning of Depth and Ego-motion
    Vasiljevic, Igor
    Guizilini, Vitor
    Ambrus, Rares
    Pillai, Sudeep
    Burgard, Wolfram
    Shakhnarovich, Greg
    Gaidon, Adrien
    [J]. 2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 1 - 11