Keypoints and Descriptors Based on Cross-Modality Information Fusion for Camera Localization

被引:0
|
作者
MA Shuo [1 ]
GAO Yongbin [1 ]
TIAN Fangzheng [1 ]
LU Junxin [1 ]
HUANG Bo [1 ]
GU Jia [1 ]
ZHOU Yilong [1 ]
机构
[1] College of Electronic and Electrical Engineering,Shanghai University of Engineering Science
基金
中国国家自然科学基金;
关键词
D O I
10.19823/j.cnki.1007-1202.2021.0021
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
To address the problem that traditional keypoint detection methods are susceptible to complex backgrounds and local similarity of images resulting in inaccurate descriptor matching and bias in visual localization, keypoints and descriptors based on cross-modality fusion are proposed and applied to the study of camera motion estimation. A convolutional neural network is used to detect the positions of keypoints and generate the corresponding descriptors, and the pyramid convolution is used to extract multi-scale features in the network. The problem of local similarity of images is solved by capturing local and global feature information and fusing the geometric position information of keypoints to generate descriptors. According to our experiments, the repeatability of our method is improved by 3.7%, and the homography estimation is improved by 1.6%. To demonstrate the practicability of the method, the visual odometry part of simultaneous localization and mapping is constructed and our method is 35% higher positioning accuracy than the traditional method.
引用
收藏
页码:128 / 136
页数:9
相关论文
共 50 条
  • [31] Cross-Modality Binary Code Learning via Fusion Similarity Hashing
    Liu, Hong
    Ji, Rongrong
    Wu, Yongjian
    Huang, Feiyue
    Zhang, Baochang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6345 - 6353
  • [32] A novel infrared and visible image fusion network based on cross-modality reinforcement and multi-attention fusion strategy
    Qi, Biao
    Zhang, Yu
    Nie, Ting
    Yu, Da
    Lv, Hengyi
    Li, Guoning
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [33] Cross-Modality Transfer Learning for Image-Text Information Management
    Niu, Shuteng
    Jiang, Yushan
    Chen, Bowen
    Wang, Jian
    Liu, Yongxin
    Song, Houbing
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2022, 13 (01)
  • [35] Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding
    Lv, Zezhong
    Su, Bing
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1487 - 1492
  • [36] Global and Part Feature Fusion for Cross-Modality Person Re-Identification
    Wang, Xianju
    Cordova, Ronald S.
    IEEE ACCESS, 2022, 10 : 122038 - 122046
  • [37] Cascaded Cross-Modality Fusion Network for 3D Object Detection
    Chen, Zhiyu
    Lin, Qiong
    Sun, Jing
    Feng, Yujian
    Liu, Shangdong
    Liu, Qiang
    Ji, Yimu
    Xu, He
    SENSORS, 2020, 20 (24) : 1 - 14
  • [38] CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation
    Zhao, Lingjun
    Song, Jingyu
    Skinner, Katherine A.
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15470 - 15480
  • [39] Biologically motivated cross-modality sensory fusion system for automatic target recognition
    Huntsberger, T
    NEURAL NETWORKS, 1995, 8 (7-8) : 1215 - 1226
  • [40] Cross-modality fusion with EEG and text for enhanced emotion detection in English writing
    Wang, Jing
    Zhang, Ci
    FRONTIERS IN NEUROROBOTICS, 2025, 18