Keypoints and Descriptors Based on Cross-Modality Information Fusion for Camera Localization

被引：0

作者：

MA Shuo ^{[1
]}

GAO Yongbin ^{[1
]}

TIAN Fangzheng ^{[1
]}

LU Junxin ^{[1
]}

HUANG Bo ^{[1
]}

GU Jia ^{[1
]}

ZHOU Yilong ^{[1
]}

机构：

[1] College of Electronic and Electrical Engineering,Shanghai University of Engineering Science

来源：

WuhanUniversityJournalofNaturalSciences | 2021年 / 26卷 / 02期

基金：

中国国家自然科学基金;

关键词：

D O I：

10.19823/j.cnki.1007-1202.2021.0021

中图分类号：

TP391.41 [];

学科分类号：

080203 ;

摘要：

To address the problem that traditional keypoint detection methods are susceptible to complex backgrounds and local similarity of images resulting in inaccurate descriptor matching and bias in visual localization, keypoints and descriptors based on cross-modality fusion are proposed and applied to the study of camera motion estimation. A convolutional neural network is used to detect the positions of keypoints and generate the corresponding descriptors, and the pyramid convolution is used to extract multi-scale features in the network. The problem of local similarity of images is solved by capturing local and global feature information and fusing the geometric position information of keypoints to generate descriptors. According to our experiments, the repeatability of our method is improved by 3.7%, and the homography estimation is improved by 1.6%. To demonstrate the practicability of the method, the visual odometry part of simultaneous localization and mapping is constructed and our method is 35% higher positioning accuracy than the traditional method.

引用

页码：128 / 136

页数：9

共 50 条

[31] Cross-Modality Binary Code Learning via Fusion Similarity Hashing
Liu, Hong
Ji, Rongrong
Wu, Yongjian
Huang, Feiyue
Zhang, Baochang
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6345 - 6353
[32] A novel infrared and visible image fusion network based on cross-modality reinforcement and multi-attention fusion strategy
Qi, Biao
Zhang, Yu
Nie, Ting
Yu, Da
Lv, Hengyi
Li, Guoning
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
[33] Cross-Modality Transfer Learning for Image-Text Information Management
Niu, Shuteng
Jiang, Yushan
Chen, Bowen
Wang, Jian
Liu, Yongxin
Song, Houbing
ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2022, 13 (01)
[34] STIMULUS INFORMATION AND SEQUENTIAL DEPENDENCIES IN MAGNITUDE ESTIMATION AND CROSS-MODALITY MATCHING
WARD, LM
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1979, 5 (03) : 444 - 459
[35] Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding
Lv, Zezhong
Su, Bing
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1487 - 1492
[36] Global and Part Feature Fusion for Cross-Modality Person Re-Identification
Wang, Xianju
Cordova, Ronald S.
IEEE ACCESS, 2022, 10 : 122038 - 122046
[37] Cascaded Cross-Modality Fusion Network for 3D Object Detection
Chen, Zhiyu
Lin, Qiong
Sun, Jing
Feng, Yujian
Liu, Shangdong
Liu, Qiang
Ji, Yimu
Xu, He
SENSORS, 2020, 20 (24) : 1 - 14
[38] CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation
Zhao, Lingjun
Song, Jingyu
Skinner, Katherine A.
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15470 - 15480
[39] Biologically motivated cross-modality sensory fusion system for automatic target recognition
Huntsberger, T
NEURAL NETWORKS, 1995, 8 (7-8) : 1215 - 1226
[40] Cross-modality fusion with EEG and text for enhanced emotion detection in English writing
Wang, Jing
Zhang, Ci
FRONTIERS IN NEUROROBOTICS, 2025, 18

← 1 2 3 4 5 →