ContextMatcher: Detector-Free Feature Matching With Cross-Modality Context

被引:0
|
作者
Li, Dongyue [1 ]
Du, Songlin [2 ,3 ]
机构
[1] Southeast Univ, Sch Software, Nanjing 210096, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[3] Southeast Univ, Shenzhen Res Inst, Shenzhen 518063, Peoples R China
基金
中国国家自然科学基金;
关键词
Local feature matching; transformer; feature extraction; feature representation; convolutional neural network; neighborhood consensus; SCALE;
D O I
10.1109/TCSVT.2024.3383334
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing feature matching methods tend to extract feature descriptors by relying on the visual appearance, leading to false matches which are obviously false from the geometric perspective. This paper proposes ContextMatcher, which goes beyond the visual appearance representation by introducing the geometric context to guild the feature matching. Specifically, our ContextMatcher includes visual descriptors generation, the neighborhood consensus module, and the geometric context encoder. To learn visual descriptors, Transformers situated in different branches are leveraged to obtain feature descriptors. In one branch, convolutions are integrated into self-attention layers elegantly to compensate for the lack of the local structure information. In another branch, a cross-scale Transformer is proposed through injecting heterogeneous receptive field sizes into tokens. To leverage and aggregate the geometric contextual information, a neighborhood consensus mechanism is proposed by re-ranking initial pixel-level matches to make a constraint of geometric consensus on neighborhood feature descriptors. Moreover, local feature descriptors are boosted through combining with the geometric properties of keypoints for refining matches to the sub-pixel level. Extensive experiments on relative pose estimations and image matching show that our proposed method outperforms existing state-of-the-art methods by a large margin.
引用
收藏
页码:7922 / 7934
页数:13
相关论文
共 50 条
  • [21] INVERSE CROSS-MODALITY MATCHING AND PSYCHOPHYSICAL LAW
    DAWSON, WE
    LILIENTHAL, MG
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1976, 8 (04) : 249 - 249
  • [22] Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering
    Cao, Jianjian
    Qin, Xiameng
    Zhao, Sanyuan
    Shen, Jianbing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022,
  • [23] GRiD: Guided Refinement for Detector-Free Multimodal Image Matching
    Liu, Yuyan
    He, Wei
    Zhang, Hongyan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5892 - 5906
  • [24] ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer
    Chen, Hongkai
    Luo, Zixin
    Zhou, Lei
    Tian, Yurun
    Zhen, Mingmin
    Fang, Tian
    McKinnon, David
    Tsin, Yanghai
    Quan, Long
    COMPUTER VISION - ECCV 2022, PT XXXII, 2022, 13692 : 20 - 36
  • [25] CROSS-MODALITY MATCHING FUNCTIONS GENERATED BY MAGNITUDE ESTIMATION
    STEVENS, JC
    MARKS, LE
    PERCEPTION & PSYCHOPHYSICS, 1980, 27 (05): : 379 - 389
  • [26] CROSS-MODALITY MATCHING IN MEMORY PSYCHOPHYSICS - BRIGHTNESS AND LOUDNESS
    HUBBARD, TL
    PERCEPTUAL AND MOTOR SKILLS, 1993, 76 (03) : 1339 - 1357
  • [27] STIMULUS INFORMATION AND SEQUENTIAL DEPENDENCIES IN CROSS-MODALITY MATCHING
    WARD, LM
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1977, 10 (04) : 275 - 275
  • [28] PATTERN-RECOGNITION IN CROSS-MODALITY LETTER MATCHING
    TEMPANY, C
    BULLETIN OF THE BRITISH PSYCHOLOGICAL SOCIETY, 1981, 34 (JAN): : 38 - 38
  • [29] CROSS-MODALITY MATCHING IN STUDY OF ABNORMAL LOUDNESS FUNCTIONS
    THALMANN, R
    LARYNGOSCOPE, 1965, 75 (11): : 1708 - &
  • [30] Cross-modality feature fusion for night pedestrian detection
    Feng, Yong
    Luo, Enbo
    Lu, Hai
    Zhai, SuWei
    FRONTIERS IN PHYSICS, 2024, 12