Graph attention network-optimized dynamic monocular visual odometry

被引:2
|
作者
Hongru, Zhao [1 ]
Xiuquan, Qiao [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Monocular visual odometry; Multi-task learning; Multi-view geometry; Dynamic objects removal; Graph attention network; SEMANTIC SEGMENTATION;
D O I
10.1007/s10489-023-04687-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular Visual Odometry (VO) is often formulated as a sequential dynamics problem that relies on scene rigidity assumption. One of the main challenges is rejecting moving objects and estimating camera pose in dynamic environments. Existing methods either take the visual cues in the whole image equally or eliminate the fixed semantic categories by heuristics or attention mechanisms. However, they fail to tackle unknown dynamic objects which are not labeled in the training sets of the network. To solve these issues, we propose a novel framework, named graph attention network (GAT)-optimized dynamic monocular visual odometry (GDM-VO), to remove dynamic objects explicitly with semantic segmentation and multi-view geometry in this paper. Firstly, we employ a multi-task learning network to perform semantic segmentation and depth estimation. Then, we reject priori known and unknown objective moving objects through semantic information and multi-view geometry, respectively. Furthermore, to our best knowledge, we are the first to leverage GAT to capture long-range temporal dependencies from consecutive image sequences adaptively, while existing sequential modeling approaches need to select information manually. Extensive experiments on the KITTI and TUM datasets demonstrate the superior performance of GDM-VO overs existing state-of-the-art classical and learning-based monocular VO.
引用
收藏
页码:23067 / 23082
页数:16
相关论文
共 50 条
  • [21] Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning
    Francani, Andre O.
    Maximo, Marcos R. O. A.
    2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 409 - 414
  • [22] CCVO: Cascaded CNNs for Fast Monocular Visual Odometry Towards the Dynamic Environment
    Zhang, Tiantian
    Li, Ni
    Gong, Guanghong
    Yang, Chaopin
    Hou, Guoqing
    Lin, Xin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) : 2938 - 2945
  • [23] Towards Dynamic Monocular Visual Odometry Based on an Event Camera and IMU Sensor
    Mohamed, Sherif A. S.
    Haghbayan, Mohammad-Hashem
    Rabah, Mohammed
    Heikkonen, Jukka
    Tenhunen, Hannu
    Plosila, Juha
    INTELLIGENT TRANSPORT SYSTEMS, 2020, 310 : 249 - 263
  • [24] Real-Time Monocular Visual Odometry for Turbid and Dynamic Underwater Environments
    Ferrera, Maxime
    Moras, Julien
    Trouve-Peloux, Pauline
    Creuze, Vincent
    SENSORS, 2019, 19 (03)
  • [25] Learning Kalman Network: A deep monocular visual odometry for on-road driving
    Zhao, Cheng
    Sun, Li
    Yan, Zhi
    Neumann, Gerhard
    Duckett, Tom
    Stolkin, Rustam
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2019, 121
  • [26] An End-to-end Network for Monocular Visual Odometry Based on Image Sequence
    Yao, Mingwei
    Quan, Hongyan
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [27] Survey and Research Challenges in Monocular Visual Odometry
    Neyestani, Arman
    Picariello, Francesco
    Basiri, Amin
    Daponte, Pasquale
    De Vito, Luca
    2023 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR LIVING ENVIRONMENT, METROLIVENV, 2023, : 107 - 112
  • [28] Depth Prediction for Monocular Direct Visual Odometry
    Cheng, Ran
    Agia, Christopher
    Meger, David
    Dudek, Gregory
    2020 17TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV 2020), 2020, : 70 - 77
  • [29] Robust Ground Vehicle Monocular Visual Odometry
    Sabry, Mohamed
    Al-Kaff, Abdulla
    Hussein, Ahmed
    Abdennadher, Slim
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3587 - 3592
  • [30] Monocular Visual Odometry Based on Hybrid Parameterization
    Mohamed, Sherif A. S.
    Haghbayan, Mohammad-Hashem
    Heikkonen, Jukka
    Tenhunen, Hannu
    Plosila, Juha
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433