Graph attention network-optimized dynamic monocular visual odometry

被引:2
|
作者
Hongru, Zhao [1 ]
Xiuquan, Qiao [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Monocular visual odometry; Multi-task learning; Multi-view geometry; Dynamic objects removal; Graph attention network; SEMANTIC SEGMENTATION;
D O I
10.1007/s10489-023-04687-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular Visual Odometry (VO) is often formulated as a sequential dynamics problem that relies on scene rigidity assumption. One of the main challenges is rejecting moving objects and estimating camera pose in dynamic environments. Existing methods either take the visual cues in the whole image equally or eliminate the fixed semantic categories by heuristics or attention mechanisms. However, they fail to tackle unknown dynamic objects which are not labeled in the training sets of the network. To solve these issues, we propose a novel framework, named graph attention network (GAT)-optimized dynamic monocular visual odometry (GDM-VO), to remove dynamic objects explicitly with semantic segmentation and multi-view geometry in this paper. Firstly, we employ a multi-task learning network to perform semantic segmentation and depth estimation. Then, we reject priori known and unknown objective moving objects through semantic information and multi-view geometry, respectively. Furthermore, to our best knowledge, we are the first to leverage GAT to capture long-range temporal dependencies from consecutive image sequences adaptively, while existing sequential modeling approaches need to select information manually. Extensive experiments on the KITTI and TUM datasets demonstrate the superior performance of GDM-VO overs existing state-of-the-art classical and learning-based monocular VO.
引用
收藏
页码:23067 / 23082
页数:16
相关论文
共 50 条
  • [41] Deep Online Correction for Monocular Visual Odometry
    Zhang, Jiaxin
    Sui, Wei
    Wang, Xinggang
    Meng, Wenming
    Zhu, Hongmei
    Zhang, Qian
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14396 - 14402
  • [42] ELECTRONIC FUNDS TRANSFER SECURITY - A NETWORK-OPTIMIZED APPROACH
    SERPELL, SC
    ELECTRONICS LETTERS, 1984, 20 (20) : 836 - 838
  • [43] Heterogeneous Dynamic Graph Attention Network
    Li, Qiuyan
    Shang, Yanlei
    Qiao, Xiuquan
    Dai, Wei
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 404 - 411
  • [44] Attention Guided Unsupervised learning of Monocular Visualinertial Odometry
    Wang, Zhenke
    Zhu, Yuan
    Lu, Ke
    Freer, Daniel
    Wu, Hao
    Chen, Hui
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 651 - 657
  • [45] A robust graph attention network with dynamic adjusted graph
    Zhou, Xianchen
    Zeng, Yaoyun
    Hao, Zepeng
    Wang, Hongxia
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
  • [46] A robust graph attention network with dynamic adjusted graph
    Zhou, Xianchen
    Zeng, Yaoyun
    Hao, Zepeng
    Wang, Hongxia
    Engineering Applications of Artificial Intelligence, 2024, 129
  • [47] DYNAMIC OBJECT-AWARE MONOCULAR VISUAL ODOMETRY WITH LOCAL AND GLOBAL INFORMATION AGGREGATION
    Wan, Yiming
    Gao, Wei
    Han, Sheng
    Wu, Yihong
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 603 - 607
  • [48] WPO-Net: Windowed Pose Optimization Network for Monocular Visual Odometry Estimation
    Gadipudi, Nivesh
    Elamvazuthi, Irraivan
    Lu, Cheng-Kai
    Paramasivam, Sivajothi
    Su, Steven
    SENSORS, 2021, 21 (23)
  • [49] MOVRO2: Loosely coupled monocular visual radar odometry using factor graph optimization
    Stironja, Vlaho-Josip
    Persic, Juraj
    Petrovic, Luka
    Markovic, Ivan
    Petrovic, Ivan
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2025, 184
  • [50] Eliminating Scale Ambiguity of Unsupervised Monocular Visual Odometry
    Wang, Zhongyi
    Shen, Mengjiao
    Chen, Qijun
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 9743 - 9764