Fine-MVO: Toward Fine-Grained Feature Enhancement for Self-Supervised Monocular Visual Odometry in Dynamic Environments

被引:0
|
作者
Wei, Wenhui [1 ,2 ]
Ping, Yang [3 ]
Li, Jiadong [2 ]
Liu, Xin [2 ]
Zhou, Yangfan [2 ,4 ]
机构
[1] Univ Sci & Technol China, Sch Nanotech & Nanobion, Hefei 230026, Peoples R China
[2] Chinese Acad Sci, Suzhou Inst Nanotech & Nanobion SINANO, Suzhou 215123, Peoples R China
[3] Acad Mil Sci, Beijing 100091, Peoples R China
[4] Guangdong Inst Semicond Micronano Mfg Technol, Foshan 528000, Peoples R China
关键词
Semantics; Training; Pose estimation; Task analysis; Visual odometry; Robustness; Multitasking; Monocular visual odometry; feature enhancement; self-supervised learning; dynamic environments; multi-task learning;
D O I
10.1109/TITS.2024.3404924
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Self-supervised monocular visual odometry has a crucial advantage of not depending on labels and has shown significant performance in autonomous driving and robotics. However, recent methods suffer from limited feature representations as they depend on coarse semantic masks to handle dynamic objects, resulting in diminished accuracy in dynamic environments. In contrast to these coarse-grained methods, we present Fine-MVO, a novel self-supervised monocular visual odometry that aims to address dynamic objects using implicit fine-grained feature representations, thus achieving excellent accuracy and robustness in dynamic environments. First, Fine-MVO provides an efficient cross-feature augmentation module and a novel loss weight balance strategy to effectively leverage fine-grained features with implicit semantic information, leading to a great improvement in the depth estimation accuracy, especially on object boundaries in the scenes. Secondly, we design a novel pose-feature enhancement module and an effective two-stage training policy to empower the pose network to focus on robust static regions and temporal information, thereby enhancing the pose estimation performance in dynamic and long-term environments. Extensive experimental results demonstrate the excellent accuracy and generalization of Fine-MVO. Specifically, Fine-MVO achieves a remarkable 36.80% improvement in pose accuracy over the state-of-the-art method on the KITTI dataset, which even breaks through the performance of loop closure within geometry-based visual odometry methods. Furthermore, Fine-MVO exhibits satisfactory generalization on the outdoor dataset AirDOS-Shibuya, attaining a notable improvement of 28.22% over current advanced method. Excitingly, Fine-MVO also reveals outstanding generalization on the indoor dataset TUM-RGBD.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation
    Jung, Hyunyoung
    Park, Eunhyeok
    Yoo, Sungjoo
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12622 - 12632
  • [2] Siamese self-supervised learning for fine-grained visual classification
    Ji, Ruyi
    Li, Jiaying
    Zhang, Libo
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 229
  • [3] Self-supervised facial expression recognition with fine-grained feature selection
    An, Heng-Yu
    Jia, Rui-Sheng
    [J]. VISUAL COMPUTER, 2024, 40 (10): : 7001 - 7013
  • [4] Self-supervised learning for fine-grained monocular 3D face reconstruction in the wild
    Huang, Dongjin
    Shi, Yongsheng
    Liu, Jinhua
    Tang, Wen
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [5] Self-Supervised Collaborative Multi-Network for Fine-Grained Visual Categorization of Tomato Diseases
    Yang, Guofeng
    Chen, Guipeng
    He, Yong
    Yan, Zhiyan
    Guo, Yang
    Ding, Jian
    [J]. IEEE ACCESS, 2020, 8 : 211912 - 211923
  • [6] Fine-Grained Object Classification via Self-Supervised Pose Alignment
    Yang, Xuhui
    Wang, Yaowei
    Chen, Ke
    Xu, Yong
    Tian, Yonghong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7389 - 7398
  • [7] Convolutional Fine-Grained Classification With Self-Supervised Target Relation Regularization
    Liu, Kangjun
    Chen, Ke
    Jia, Kui
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5570 - 5584
  • [8] A Self-Supervised Tree-Structured Framework for Fine-Grained Classification
    Cai, Qihang
    Niu, Lei
    Shang, Xibin
    Ding, Heng
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [9] Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems
    Shu, Yangyang
    van den Hengel, Anton
    Liu, Lingqiao
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11392 - 11401
  • [10] Convolutional Fine-Grained Classification with Self-Supervised Target Relation Regularization
    Liu, Kangjun
    Chen, Ke
    Jia, Kui
    [J]. IEEE Transactions on Image Processing, 2022, 31 : 5570 - 5584