Self-Supervised Attention Learning for Depth and Ego-motion Estimation

被引:4
|
作者
Sadek, Assent [1 ]
Chidlovskii, Boris [1 ]
机构
[1] Naver Labs Europe, Chemin Maupertuis 6, F-38240 Meylan, France
关键词
D O I
10.1109/IROS45743.2020.9340820
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of depth and ego-motion estimation from image sequences. Recent advances in the domain propose to train a deep learning model for both tasks using image reconstruction in a self-supervised manner. We revise the assumptions and the limitations of the current approaches and propose two improvements to boost the performance of the depth and ego-motion estimation. We first use Lie group properties to enforce the geometric consistency between images in the sequence and their reconstructions. We then propose a mechanism to pay attention to image regions where the image reconstruction gets corrupted. We show how to integrate the attention mechanism in the form of attention gates in the pipeline and use attention coefficients as a mask. We evaluate the new architecture on the KITTI datasets and compare it to the previous techniques. We show that our approach improves the state-of-the-art results for ego-motion estimation and achieve comparable results for depth estimation.
引用
收藏
页码:10054 / 10060
页数:7
相关论文
共 50 条
  • [1] Neural Ray Surfaces for Self-Supervised Learning of Depth and Ego-motion
    Vasiljevic, Igor
    Guizilini, Vitor
    Ambrus, Rares
    Pillai, Sudeep
    Burgard, Wolfram
    Shakhnarovich, Greg
    Gaidon, Adrien
    [J]. 2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 1 - 11
  • [2] CeMNet: Self-supervised learning for accurate continuous ego-motion estimation
    Lee, Minhaeng
    Fowlkes, Charless C.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 354 - 363
  • [3] Semantic and Optical Flow Guided Self-supervised Monocular Depth and Ego-Motion Estimation
    Fang, Jiaojiao
    Liu, Guizhong
    [J]. IMAGE AND GRAPHICS (ICIG 2021), PT III, 2021, 12890 : 465 - 477
  • [4] Self-supervised monocular depth and ego-motion estimation for CT-bronchoscopy fusion
    Chang, Qi
    Higgins, William E.
    [J]. IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, MEDICAL IMAGING 2024, 2024, 12928
  • [5] Self-Supervised monocular depth and ego-Motion estimation in endoscopy: Appearance flow to the rescue
    Shao, Shuwei
    Pei, Zhongcai
    Chen, Weihai
    Zhu, Wentao
    Wu, Xingming
    Sun, Dianmin
    Zhang, Baochang
    [J]. MEDICAL IMAGE ANALYSIS, 2022, 77
  • [6] Beyond Photometric Loss for Self-Supervised Ego-Motion Estimation
    Shen, Tianwei
    Luo, Zixin
    Zhou, Lei
    Deng, Hanyu
    Zhang, Runze
    Fang, Tian
    Quan, Long
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6359 - 6365
  • [7] Two Stream Networks for Self-Supervised Ego-Motion Estimation
    Ambrus, Rares
    Guizilini, Vitor
    Li, Jie
    Pillai, Sudeep
    Gaidon, Adrien
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [8] Self-Supervised Ego-Motion Estimation Based on Multi-Layer Fusion of RGB and Inferred Depth
    Jiang, Zijie
    Taira, Hajime
    Miyashita, Naoyuki
    Okutomi, Masatoshi
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7605 - 7611
  • [9] Maximizing Self-Supervision From Thermal Image for Effective Self-Supervised Learning of Depth and Ego-Motion
    Shin, Ukcheol
    Lee, Kyunghyun
    Lee, Byeong-Uk
    Kweon, In So
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7771 - 7778
  • [10] Self-Supervised Learning of Depth and Ego-Motion for 3D Perception in Human Computer Interaction
    Qiao, Shanbao
    Xiong, Neal N.
    Gao, Yongbin
    Fang, Zhijun
    Yu, Wenjun
    Zhang, Juan
    Jiang, Xiaoyan
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (02)