Multi-Scale Context Attention Network for Image Retrieval

被引:12
|
作者
Lou, Yihang [1 ]
Bai, Yan [1 ]
Wang, Shiqi [2 ]
Duan, Ling-Yu [1 ]
机构
[1] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China
[2] City Univ Hong Kong, Comp Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Image Retrieval; Multi-Scale Context; Attention Network; FEATURES;
D O I
10.1145/3240508.3240602
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recent attempts on the Convolutional Neural Network (CNN) based image retrieval usually adopt the output of a specific convolutional or fully connected layer as feature representation. Though superior representation capability has yielded better retrieval performance, the scale variation and clutter distracting remain to be two challenging problems in CNN based image retrieval. In this work, we propose a Multi-Scale Context Attention Network (MSCAN) to generate global descriptors, which is able to selectively focus on the informative regions with the assistance of multi-scale context information. We model the multi-scale context information by an improved Long Short-Term Memory (LSTM) network across different layers. As such, the proposed global descriptor is equipped with the scale aware attention capability. Experimental results show that our proposed method can effectively capture the informative regions in images and retain reliable attention responses when encountering scale variation and clutter distracting. Moreover, we compare the performance of the proposed scheme with the state-of-the-art global descriptors, and extensive results verify that the proposed MSCAN can achieve superior performance on several image retrieval benchmarks.
引用
收藏
页码:1128 / 1136
页数:9
相关论文
共 50 条
  • [21] MGTANet: Multi-Scale Guided Token Attention Network for Image Captioning
    Jia, Wenhao
    Wang, Ronggui
    Yang, Juan
    Xua, Lixia
    [J]. PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 237 - 245
  • [22] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE RESIDUAL ATTENTION NETWORK
    Xie, Wen
    Wu, Qinzhe
    Ren, Wen
    Zhang, Yuzhuo
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 7649 - 7652
  • [23] Attention based multi-scale nested network for biomedical image segmentation
    Cheng, Dapeng
    Deng, Jia
    Xiao, Jinjie
    Yanyan, Mao
    Kang, Jialong
    Gai, Jiale
    Zhang, Baosheng
    Zhao, Feng
    [J]. HELIYON, 2024, 10 (14)
  • [24] Multi-Scale Feature Fusion Network with Attention for Single Image Dehazing
    [J]. Pattern Recognition and Image Analysis, 2021, 31 : 608 - 615
  • [25] Multi-Scale Mixed Attention Network for CT and MRI Image Fusion
    Liu, Yang
    Yan, Binyu
    Zhang, Rongzhu
    Liu, Kai
    Jeon, Gwanggil
    Yang, Xiaoming
    [J]. ENTROPY, 2022, 24 (06)
  • [26] Multi-Scale Attention Generative Adversarial Network for Medical Image Enhancement
    Zhong, Guojin
    Ding, Weiping
    Chen, Long
    Wang, Yingxu
    Yu, Yu-Feng
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (04): : 1113 - 1125
  • [28] MSAANet: Multi-scale Axial Attention Network for medical image segmentation
    Zeng, Hao
    Shan, Xinxin
    Feng, Yu
    Wen, Ying
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2291 - 2296
  • [29] Lightweight multi-scale generative adversarial network with attention for image denoising
    Hu, Xuegang
    Zhao, Wei
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [30] GridDehazeNet: Attention-Based Multi-Scale Network for Image Dehazing
    Liu, Xiaohong
    Ma, Yongrui
    Shi, Zhihao
    Chen, Jun
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7313 - 7322