GRiD: Guided Refinement for Detector-Free Multimodal Image Matching

被引：0

作者：

Liu, Yuyan ^{[1
]}

He, Wei ^{[1
]}

Zhang, Hongyan ^{[1
,2
]}

机构：

[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430072, Peoples R China

[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Image matching; Transformers; Optical imaging; Detectors; Semantics; Image edge detection; Adaptive optics; Robustness; Remote sensing; detector-free; guided refinement; multimodal images; REGISTRATION; TRANSFORMER; MODEL;

D O I：

10.1109/TIP.2024.3472491

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal image matching is essential in image stitching, image fusion, change detection, and land cover mapping. However, the severe nonlinear radiometric distortion (NRD) and geometric distortions in multimodal images severely limit the accuracy of multimodal image matching, posing significant challenges to existing methods. Additionally, detector-based methods are prone to feature point offset issues in regions with substantial modal differences, which also hinder the subsequent fine registration and fusion of images. To address these challenges, we propose a guided refinement for detector-free multimodal image matching (GRiD) method, which weakens feature point offset issues by establishing pixel-level correspondences and utilizes reference points to guide and correct matches affected by NRD and geometric distortions. Specifically, we first introduce a detector-free framework to alleviate the feature point offset problem by directly finding corresponding pixels between images. Subsequently, to tackle NRD and geometric distortion in multimodal images, we design a guided correction module that establishes robust reference points (RPs) to guide the search for corresponding pixels in regions with significant modality differences. Moreover, to enhance RPs reliability, we incorporate a phase congruency module during the RPs confirmation stage to concentrate RPs around image edge structures. Finally, we perform finer localization on highly correlated corresponding pixels to obtain the optimized matches. We conduct extensive experiments on four multimodal image datasets to validate the effectiveness of the proposed approach. Experimental results demonstrate that our method can achieve sufficient and robust matches across various modality images and effectively suppress the feature point offset problem.

引用

页码：5892 / 5906

页数：15

共 50 条

[31] Attention-based multimodal image matching
Moreshet, Aviad
Keller, Yosi
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
[32] Local similarity measures for multimodal image matching
Rogelj, P
Kovacic, S
IWISPA 2000: PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2000, : 81 - 86
[33] Dense Image-Matching via Optical Flow Field Estimation and Fast-Guided Filter Refinement
Yuan, Wei
Yuan, Xiuxiao
Xu, Shu
Gong, Jianya
Shibasaki, Ryosuke
REMOTE SENSING, 2019, 11 (20)
[34] Image matching using enclosed region detector
Zhang, Wei
Wu, Q. M. Jonathan
Wang, Guanghui
You, Xinge
Wang, Yongfang
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2010, 21 (04) : 271 - 282
[35] Optical multimodal probe for image guided surgery
Yoon, Yeoreum
Jang, Won Hyuk
Kim, Kihean
2014 11TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2014, : 551 - 553
[36] SFA-Net: A SAM-guided focused attention network for multimodal remote sensing image matching
Gao, Tian
Lan, Chaozhen
Huang, Wenjun
Wang, Sheng
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 223 : 188 - 206
[37] Multiscale Template Matching for Multimodal Remote Sensing Image
Gao, Tian
Lan, Chaozhen
Huang, Wenjun
Wang, Longhao
Wei, Zijun
Yao, Fushan
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 10132 - 10147
[38] Multimodal unbiased image matching via mutual information
Yanovska, Igor
Thompson, Paul M.
Osher, Stanley J.
Leow, Alex D.
COMPUTATIONAL IMAGING VI, 2008, 6814
[39] Multimodal Convolutional Neural Networks for Matching Image and Sentence
Ma, Lin
Lu, Zhengdong
Shang, Lifeng
Li, Hang
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2623 - 2631
[40] Guided aggregation and disparity refinement for real-time stereo matching
Yang, Jinlong
Wu, Cheng
Wang, Gang
Chen, Dong
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (05) : 4467 - 4477

← 1 2 3 4 5 →