GRiD: Guided Refinement for Detector-Free Multimodal Image Matching

被引:0
|
作者
Liu, Yuyan [1 ]
He, Wei [1 ]
Zhang, Hongyan [1 ,2 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430072, Peoples R China
[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Image matching; Transformers; Optical imaging; Detectors; Semantics; Image edge detection; Adaptive optics; Robustness; Remote sensing; detector-free; guided refinement; multimodal images; REGISTRATION; TRANSFORMER; MODEL;
D O I
10.1109/TIP.2024.3472491
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal image matching is essential in image stitching, image fusion, change detection, and land cover mapping. However, the severe nonlinear radiometric distortion (NRD) and geometric distortions in multimodal images severely limit the accuracy of multimodal image matching, posing significant challenges to existing methods. Additionally, detector-based methods are prone to feature point offset issues in regions with substantial modal differences, which also hinder the subsequent fine registration and fusion of images. To address these challenges, we propose a guided refinement for detector-free multimodal image matching (GRiD) method, which weakens feature point offset issues by establishing pixel-level correspondences and utilizes reference points to guide and correct matches affected by NRD and geometric distortions. Specifically, we first introduce a detector-free framework to alleviate the feature point offset problem by directly finding corresponding pixels between images. Subsequently, to tackle NRD and geometric distortion in multimodal images, we design a guided correction module that establishes robust reference points (RPs) to guide the search for corresponding pixels in regions with significant modality differences. Moreover, to enhance RPs reliability, we incorporate a phase congruency module during the RPs confirmation stage to concentrate RPs around image edge structures. Finally, we perform finer localization on highly correlated corresponding pixels to obtain the optimized matches. We conduct extensive experiments on four multimodal image datasets to validate the effectiveness of the proposed approach. Experimental results demonstrate that our method can achieve sufficient and robust matches across various modality images and effectively suppress the feature point offset problem.
引用
收藏
页码:5892 / 5906
页数:15
相关论文
共 50 条
  • [31] Attention-based multimodal image matching
    Moreshet, Aviad
    Keller, Yosi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
  • [32] Local similarity measures for multimodal image matching
    Rogelj, P
    Kovacic, S
    IWISPA 2000: PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2000, : 81 - 86
  • [33] Dense Image-Matching via Optical Flow Field Estimation and Fast-Guided Filter Refinement
    Yuan, Wei
    Yuan, Xiuxiao
    Xu, Shu
    Gong, Jianya
    Shibasaki, Ryosuke
    REMOTE SENSING, 2019, 11 (20)
  • [34] Image matching using enclosed region detector
    Zhang, Wei
    Wu, Q. M. Jonathan
    Wang, Guanghui
    You, Xinge
    Wang, Yongfang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2010, 21 (04) : 271 - 282
  • [35] Optical multimodal probe for image guided surgery
    Yoon, Yeoreum
    Jang, Won Hyuk
    Kim, Kihean
    2014 11TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2014, : 551 - 553
  • [36] SFA-Net: A SAM-guided focused attention network for multimodal remote sensing image matching
    Gao, Tian
    Lan, Chaozhen
    Huang, Wenjun
    Wang, Sheng
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 223 : 188 - 206
  • [37] Multiscale Template Matching for Multimodal Remote Sensing Image
    Gao, Tian
    Lan, Chaozhen
    Huang, Wenjun
    Wang, Longhao
    Wei, Zijun
    Yao, Fushan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 10132 - 10147
  • [38] Multimodal unbiased image matching via mutual information
    Yanovska, Igor
    Thompson, Paul M.
    Osher, Stanley J.
    Leow, Alex D.
    COMPUTATIONAL IMAGING VI, 2008, 6814
  • [39] Multimodal Convolutional Neural Networks for Matching Image and Sentence
    Ma, Lin
    Lu, Zhengdong
    Shang, Lifeng
    Li, Hang
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2623 - 2631
  • [40] Guided aggregation and disparity refinement for real-time stereo matching
    Yang, Jinlong
    Wu, Cheng
    Wang, Gang
    Chen, Dong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (05) : 4467 - 4477