Multimodal Remote Sensing Image Matching via Learning Features and Attention Mechanism

被引:3
|
作者
Zhang, Yongxian [1 ]
Lan, Chaozhen [2 ]
Zhang, Haiming [1 ]
Ma, Guorui [1 ]
Li, Heng [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] PLA Strateg Support Force Informat Engn Univ, Inst Geospatial Informat, Zhengzhou 450001, Peoples R China
关键词
Attention mechanism; feature matching; learning feature; multimodal image; transformer; TRANSFORMER; FRAMEWORK; NETWORK; CNN;
D O I
10.1109/TGRS.2023.3348980
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Matching multimodal remote sensing images (RSIs) remains an ongoing challenge due to the significant nonlinear radiometric differences and geometric distortions, resulting in matches exhibiting one-to-many matches or mismatches. To tackle this challenge, we propose a novel approach for multimodal RSI matching called modality-independent consistency matching (MICM), which leverages the capabilities of deep convolutional neural networks and the transformer attention mechanism to improve the matching performance. The proposed MICM method consists of three key steps. First, a Unet-like feature extraction backbone network is employed to learn multiscale invariant features from multimodal RSIs, enabling the extraction of rich and evenly distributed feature keypoints. Second, a hybrid approach combining local learning features with the transformer attention mechanism is introduced to aggregate learning features, facilitating both detailed capture and long-range modeling to enhance the representation ability of the features. Third, a feature consistency correlation strategy is adopted to maximize the number of correct corresponding feature points, ensuring reliable matching performance. The performance of the proposed method has been extensively evaluated on both the same scene and different scene multimodal RSIs, which are captured from various imaging modes, wavebands, and platforms. The results show the superior matching performance of the proposed MICM method compared to commonly used and state-of-the-art handcrafted- and learning-based methods when evaluated on both the same scene and different scene datasets. The proposed method serves as a valuable reference for addressing common challenges in multimodal RSI matching.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [1] Multimodal Remote Sensing Image Matching Combining Learning Features and Delaunay Triangulation
    Zhang, Yongxian
    Liu, Yuxuan
    Zhang, Haiming
    Ma, Guorui
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] Self-Similarity Features for Multimodal Remote Sensing Image Matching
    Xiong, Xin
    Jin, Guowang
    Xu, Qing
    Zhang, Hongmin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 12440 - 12454
  • [3] Multiscale Template Matching for Multimodal Remote Sensing Image
    Gao, Tian
    Lan, Chaozhen
    Huang, Wenjun
    Wang, Longhao
    Wei, Zijun
    Yao, Fushan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 10132 - 10147
  • [4] Combining Phase Congruency and Self-Similarity Features for Multimodal Remote Sensing Image Matching
    Fan, Jianwei
    Xiong, Qing
    Ye, Yuanxin
    Li, Jian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [5] Combining Phase Congruency and Self-Similarity Features for Multimodal Remote Sensing Image Matching
    Fan, Jianwei
    Xiong, Qing
    Ye, Yuanxin
    Li, Jian
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [6] Fast and Robust Matching for Multimodal Remote Sensing Image Registration
    Ye, Yuanxin
    Bruzzone, Lorenzo
    Shan, Jie
    Bovolo, Francesca
    Zhu, Qing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 9059 - 9070
  • [7] Joint Self-Attention for Remote Sensing Image Matching
    Li, Liangzhi
    Han, Ling
    Cao, Hongye
    Hu, Huijuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [8] SFA-Net: A SAM-guided focused attention network for multimodal remote sensing image matching
    Gao, Tian
    Lan, Chaozhen
    Huang, Wenjun
    Wang, Sheng
    ISPRS Journal of Photogrammetry and Remote Sensing, 2025, 223 : 188 - 206
  • [9] Remote Sensing Multimodal Image Matching Based on Structure Feature and Learnable Matching Network
    Han, Songlai
    Liu, Xuesong
    Dong, Jing
    Liu, Haiqiao
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [10] Multimodal Remote Sensing Image Registration Based on Image Transfer and Local Features
    Zhang, Jun
    Ma, Wenping
    Wu, Yue
    Jiao, Licheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (08) : 1210 - 1214