Graph attention network-optimized dynamic monocular visual odometry

被引:2
|
作者
Hongru, Zhao [1 ]
Xiuquan, Qiao [1 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
关键词
Monocular visual odometry; Multi-task learning; Multi-view geometry; Dynamic objects removal; Graph attention network; SEMANTIC SEGMENTATION;
D O I
10.1007/s10489-023-04687-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monocular Visual Odometry (VO) is often formulated as a sequential dynamics problem that relies on scene rigidity assumption. One of the main challenges is rejecting moving objects and estimating camera pose in dynamic environments. Existing methods either take the visual cues in the whole image equally or eliminate the fixed semantic categories by heuristics or attention mechanisms. However, they fail to tackle unknown dynamic objects which are not labeled in the training sets of the network. To solve these issues, we propose a novel framework, named graph attention network (GAT)-optimized dynamic monocular visual odometry (GDM-VO), to remove dynamic objects explicitly with semantic segmentation and multi-view geometry in this paper. Firstly, we employ a multi-task learning network to perform semantic segmentation and depth estimation. Then, we reject priori known and unknown objective moving objects through semantic information and multi-view geometry, respectively. Furthermore, to our best knowledge, we are the first to leverage GAT to capture long-range temporal dependencies from consecutive image sequences adaptively, while existing sequential modeling approaches need to select information manually. Extensive experiments on the KITTI and TUM datasets demonstrate the superior performance of GDM-VO overs existing state-of-the-art classical and learning-based monocular VO.
引用
收藏
页码:23067 / 23082
页数:16
相关论文
共 50 条
  • [1] Graph attention network-optimized dynamic monocular visual odometry
    Zhao Hongru
    Qiao Xiuquan
    Applied Intelligence, 2023, 53 : 23067 - 23082
  • [2] Pose Graph for Improved Monocular Visual Odometry
    Kieman, Pawel
    Narkiewicz, Janusz
    2014 19TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2014, : 549 - 553
  • [3] Monocular visual odometry in dynamic environments by utilizing motion segmentation and attention mechanism
    Chen, Shuo
    Lu, Lixin
    Chen, Dongxing
    Chen, Dongdong
    Kong, Dongdong
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (03)
  • [4] Pose Graph Optimization for Unsupervised Monocular Visual Odometry
    Li, Yang
    Ushiku, Yoshitaka
    Harada, Tatsuya
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5439 - 5445
  • [5] Unsupervised Monocular Visual-inertial Odometry Network
    Wei, Peng
    Hua, Guoliang
    Huang, Weibo
    Meng, Fanyang
    Liu, Hong
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2347 - 2354
  • [6] Dynamic Attention-based Visual Odometry
    Kuo, Xin-Yu
    Liu, Chien
    Lin, Kai-Chen
    Luo, Evan
    Chen, Yu-Wen
    Lee, Chun-Yi
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5753 - 5760
  • [7] Dynamic Attention-based Visual Odometry
    Kuo, Xin-Yu
    Liu, Chien
    Lin, Kai-Chen
    Lee, Chun-Yi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 160 - 169
  • [8] Unsupervised Scale Network for Monocular Relative Depth and Visual Odometry
    Wang, Zhongyi
    Chen, Qijun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [9] A review of monocular visual odometry
    He, Ming
    Zhu, Chaozheng
    Huang, Qian
    Ren, Baosen
    Liu, Jintao
    VISUAL COMPUTER, 2020, 36 (05): : 1053 - 1065
  • [10] A review of monocular visual odometry
    Ming He
    Chaozheng Zhu
    Qian Huang
    Baosen Ren
    Jintao Liu
    The Visual Computer, 2020, 36 : 1053 - 1065