Dynamic attention-based detector and descriptor with effective and derivable loss for image matching

被引:2
|
作者
Yang, Hua [1 ]
Jiang, Yuyang [1 ]
Huang, Kaiji [1 ]
Yin, Zhouping [1 ]
机构
[1] Huazhong Univ Sci & Technol, State Key Lab Digital Mfg Equipment & Technol, Wuhan, Peoples R China
关键词
local feature; detector; descriptor; image matching; visual location; 3D reconstruction; LINE;
D O I
10.1117/1.JEI.32.2.023022
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Jointly learned detectors and descriptors are becoming increasingly popular because they can simplify the matching process and obtain more correspondences than traditional tools. However, most methods yield low keypoint detection accuracy due to the large receptive field of the detection score map. In addition, existing methods lack efficient detector loss functions because the coordinates of keypoints are discrete and nonderivable. To mitigate these two problems, we propose a method called dynamic attention-based detector and descriptor with effective and derivable loss (DA-Net). For the first problem, a dynamic attention convolution-based feature extraction module is proposed to select the most suitable parameters for different samples. In addition, a multilayer feature self-difference detection (MFSD) module is proposed to detect keypoints with high accuracy. In the MFSD module, multilayer feature maps are used to calculate their feature self-difference maps, and they are fused to obtain a detection score map. For the second problem, an approximate keypoint distance loss function is proposed by approximately regressing the coordinates of the local maximum as keypoint coordinates, allowing the calculations involving keypoint coordinates to backpropagate. Moreover, two descriptor loss functions are proposed to learn reliable descriptors. A series of experiments based on widely used datasets show that DA-Net outperforms other learned detection and description methods.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Attention-based multimodal image matching
    Moreshet, Aviad
    Keller, Yosi
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
  • [2] Attention-Based Dynamic Subspace Learners for Medical Image Analysis
    Adiga, Sukesh, V
    Dolz, Jose
    Lombaert, Herve
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (09) : 4599 - 4610
  • [3] Single Shot Attention-Based Face Detector
    Zhuang, Chubin
    Zhang, Shifeng
    Zhu, Xiangyu
    Lei, Zhen
    Li, Stan Z.
    [J]. BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 285 - 293
  • [4] Attention-Based Real Image Restoration
    Anwar, Saeed
    Barnes, Nick
    Petersson, Lars
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021,
  • [5] Visual Attention-Based Image Watermarking
    Bhowmik, Deepayan
    Oakes, Matthew
    Abhayaratne, Charith
    [J]. IEEE ACCESS, 2016, 4 : 8002 - 8018
  • [6] Dynamic Attention-based Visual Odometry
    Kuo, Xin-Yu
    Liu, Chien
    Lin, Kai-Chen
    Luo, Evan
    Chen, Yu-Wen
    Lee, Chun-Yi
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5753 - 5760
  • [7] Dynamic Attention-based Visual Odometry
    Kuo, Xin-Yu
    Liu, Chien
    Lin, Kai-Chen
    Lee, Chun-Yi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 160 - 169
  • [8] Semi-supervised Keypoint Detector and Descriptor for Retinal Image Matching
    Liu, Jiazhen
    Li, Xirong
    Wei, Qijie
    Xu, Jie
    Ding, Dayong
    [J]. COMPUTER VISION, ECCV 2022, PT XXI, 2022, 13681 : 593 - 609
  • [9] Image matching with an improved descriptor based on SIFT
    Hu, Xuemei
    Ding, Yan
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON ELECTRONICS AND INFORMATION ENGINEERING, 2017, 10322
  • [10] Image Matching Based on LBP and SIFT Descriptor
    Kabbai, Leila
    Azaza, Aymen
    Abdellaoui, Mehrez
    Douik, Ali
    [J]. 2015 IEEE 12TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2015,