Co-Attention for Conditioned Image Matching

被引:31
|
作者
Wiles, Olivia [1 ]
Ehrhardt, Sebastien [1 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, VGG, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
SCALE;
D O I
10.1109/CVPR46437.2021.01566
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new approach to determine correspondences between image pairs in the wild under large changes in illumination, viewpoint, context, and material. While other approaches find correspondences between pairs of images by treating the images independently, we instead condition on both images to implicitly take account of the differences between them. To achieve this, we introduce (i) a spatial attention mechanism (a co-attention module, CoAM) for conditioning the learned features on both images, and (ii) a distinctiveness score used to choose the best matches at test time. CoAM can be added to standard architectures and trained using self-supervision or supervised data, and achieves a significant performance improvement under hard conditions, e.g. large viewpoint changes. We demonstrate that models using CoAM achieve state of the art or competitive results on a wide range of tasks: local matching, camera localization, 3D reconstruction, and image stylization.
引用
收藏
页码:15915 / 15924
页数:10
相关论文
共 50 条
  • [1] SAR and Optical Image Registration Based on Deep Learning with Co-Attention Matching Module
    Chen, Jiaxing
    Xie, Hongtu
    Zhang, Lin
    Hu, Jun
    Jiang, Hejun
    Wang, Guoqian
    [J]. REMOTE SENSING, 2023, 15 (15)
  • [2] Sentence Matching with Deep Self-attention and Co-attention Features
    Wang, Zhipeng
    Yan, Danfeng
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2021, PT II, 2021, 12816 : 550 - 561
  • [3] Hermitian Co-Attention Networks for Text Matching in Asymmetrical Domains
    Tay, Yi
    Anh Tuan Luu
    Hui, Siu Cheung
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4425 - 4431
  • [4] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
    Huang, Lufei
    Sun, Fengming
    Yuan, Xia
    [J]. PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284
  • [5] Multi-decoder Based Co-attention for Image Captioning
    Sun, Zhen
    Lin, Xin
    Wang, Zhaohui
    Ji, Yi
    Liu, Chunping
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 200 - 210
  • [6] Co-attention enabled content-based image retrieval
    Hu, Zechao
    Bors, Adrian G.
    [J]. NEURAL NETWORKS, 2023, 164 : 245 - 263
  • [7] Bi-Directional Co-Attention Network for Image Captioning
    Jiang, Weitao
    Wang, Weixuan
    Hu, Haifeng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (04)
  • [8] Streamer temporal action detection in live video by co-attention boundary matching
    Chenhao Li
    Chen He
    Hui Zhang
    Jiacheng Yao
    Jing Zhang
    Li Zhuo
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 3071 - 3088
  • [9] Streamer temporal action detection in live video by co-attention boundary matching
    Li, Chenhao
    He, Chen
    Zhang, Hui
    Yao, Jiacheng
    Zhang, Jing
    Zhuo, Li
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (10) : 3071 - 3088
  • [10] Identity-Aware Textual-Visual Matching with Latent Co-attention
    Li, Shuang
    Xiao, Tong
    Li, Hongsheng
    Yang, Wei
    Wang, Xiaogang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1908 - 1917