Keypoints and Descriptors Based on Cross-Modality Information Fusion for Camera Localization

被引:0
|
作者
MA Shuo [1 ]
GAO Yongbin [1 ]
TIAN Fangzheng [1 ]
LU Junxin [1 ]
HUANG Bo [1 ]
GU Jia [1 ]
ZHOU Yilong [1 ]
机构
[1] College of Electronic and Electrical Engineering,Shanghai University of Engineering Science
基金
中国国家自然科学基金;
关键词
D O I
10.19823/j.cnki.1007-1202.2021.0021
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
To address the problem that traditional keypoint detection methods are susceptible to complex backgrounds and local similarity of images resulting in inaccurate descriptor matching and bias in visual localization, keypoints and descriptors based on cross-modality fusion are proposed and applied to the study of camera motion estimation. A convolutional neural network is used to detect the positions of keypoints and generate the corresponding descriptors, and the pyramid convolution is used to extract multi-scale features in the network. The problem of local similarity of images is solved by capturing local and global feature information and fusing the geometric position information of keypoints to generate descriptors. According to our experiments, the repeatability of our method is improved by 3.7%, and the homography estimation is improved by 1.6%. To demonstrate the practicability of the method, the visual odometry part of simultaneous localization and mapping is constructed and our method is 35% higher positioning accuracy than the traditional method.
引用
收藏
页码:128 / 136
页数:9
相关论文
共 50 条
  • [1] Cross-modality complementary information fusion for multispectral pedestrian detection
    Yan, Chaoqi
    Zhang, Hong
    Li, Xuliang
    Yang, Yifan
    Yuan, Ding
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (14): : 10361 - 10386
  • [2] Cross-modality complementary information fusion for multispectral pedestrian detection
    Chaoqi Yan
    Hong Zhang
    Xuliang Li
    Yifan Yang
    Ding Yuan
    Neural Computing and Applications, 2023, 35 : 10361 - 10386
  • [3] POLO: Learning Explicit Cross-Modality Fusion for Temporal Action Localization
    Wang, Binglu
    Yang, Le
    Zhao, Yongqiang
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 503 - 507
  • [4] Image manipulation localization via dynamic cross-modality fusion and progressive integration
    Jin, Xiao
    Yu, Wen
    Shi, Wei
    NEUROCOMPUTING, 2024, 610
  • [5] CROSS-MODALITY TRANSFER OF SPATIAL INFORMATION
    FISHBEIN, HD
    DECKER, J
    WILCOX, P
    BRITISH JOURNAL OF PSYCHOLOGY, 1977, 68 (NOV) : 503 - 508
  • [6] Camera-LiDAR Cross-Modality Fusion Water Segmentation for Unmanned Surface Vehicles
    Gao, Jiantao
    Zhang, Jingting
    Liu, Chang
    Li, Xiaomao
    Peng, Yan
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (06)
  • [7] Self-attention Cross-modality Fusion Network for Cross-modality Person Re-identification
    Du P.
    Song Y.-H.
    Zhang X.-Y.
    Zidonghua Xuebao/Acta Automatica Sinica, 2022, 48 (06): : 1457 - 1468
  • [8] Camera-LiDAR Cross-Modality Gait Recognition
    Guo, Wenxuan
    Liang, Yingping
    Pan, Zhiyu
    Xi, Ziheng
    Feng, Jianjiang
    Zhou, Jie
    COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 439 - 455
  • [9] Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion
    Tang, Tianqi
    Du, Heming
    Yu, Xin
    Yang, Yi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5422 - 5430
  • [10] Cross-modality feature fusion for night pedestrian detection
    Feng, Yong
    Luo, Enbo
    Lu, Hai
    Zhai, SuWei
    FRONTIERS IN PHYSICS, 2024, 12