Graph attention network-optimized dynamic monocular visual odometry

被引：2

作者：

Hongru, Zhao ^{[1
]}

Xiuquan, Qiao ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 20期

关键词：

Monocular visual odometry; Multi-task learning; Multi-view geometry; Dynamic objects removal; Graph attention network; SEMANTIC SEGMENTATION;

D O I：

10.1007/s10489-023-04687-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Monocular Visual Odometry (VO) is often formulated as a sequential dynamics problem that relies on scene rigidity assumption. One of the main challenges is rejecting moving objects and estimating camera pose in dynamic environments. Existing methods either take the visual cues in the whole image equally or eliminate the fixed semantic categories by heuristics or attention mechanisms. However, they fail to tackle unknown dynamic objects which are not labeled in the training sets of the network. To solve these issues, we propose a novel framework, named graph attention network (GAT)-optimized dynamic monocular visual odometry (GDM-VO), to remove dynamic objects explicitly with semantic segmentation and multi-view geometry in this paper. Firstly, we employ a multi-task learning network to perform semantic segmentation and depth estimation. Then, we reject priori known and unknown objective moving objects through semantic information and multi-view geometry, respectively. Furthermore, to our best knowledge, we are the first to leverage GAT to capture long-range temporal dependencies from consecutive image sequences adaptively, while existing sequential modeling approaches need to select information manually. Extensive experiments on the KITTI and TUM datasets demonstrate the superior performance of GDM-VO overs existing state-of-the-art classical and learning-based monocular VO.

引用

页码：23067 / 23082

页数：16

共 50 条

[1] Graph attention network-optimized dynamic monocular visual odometry
Zhao Hongru
Qiao Xiuquan
Applied Intelligence, 2023, 53 : 23067 - 23082
[2] Pose Graph for Improved Monocular Visual Odometry
Kieman, Pawel
Narkiewicz, Janusz
2014 19TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2014, : 549 - 553
[3] Monocular visual odometry in dynamic environments by utilizing motion segmentation and attention mechanism
Chen, Shuo
Lu, Lixin
Chen, Dongxing
Chen, Dongdong
Kong, Dongdong
MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (03)
[4] Pose Graph Optimization for Unsupervised Monocular Visual Odometry
Li, Yang
Ushiku, Yoshitaka
Harada, Tatsuya
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5439 - 5445
[5] Unsupervised Monocular Visual-inertial Odometry Network
Wei, Peng
Hua, Guoliang
Huang, Weibo
Meng, Fanyang
Liu, Hong
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2347 - 2354
[6] Dynamic Attention-based Visual Odometry
Kuo, Xin-Yu
Liu, Chien
Lin, Kai-Chen
Luo, Evan
Chen, Yu-Wen
Lee, Chun-Yi
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5753 - 5760
[7] Dynamic Attention-based Visual Odometry
Kuo, Xin-Yu
Liu, Chien
Lin, Kai-Chen
Lee, Chun-Yi
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 160 - 169
[8] Unsupervised Scale Network for Monocular Relative Depth and Visual Odometry
Wang, Zhongyi
Chen, Qijun
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
[9] A review of monocular visual odometry
He, Ming
Zhu, Chaozheng
Huang, Qian
Ren, Baosen
Liu, Jintao
VISUAL COMPUTER, 2020, 36 (05): : 1053 - 1065
[10] A review of monocular visual odometry
Ming He
Chaozheng Zhu
Qian Huang
Baosen Ren
Jintao Liu
The Visual Computer, 2020, 36 : 1053 - 1065

← 1 2 3 4 5 →