Self-Attention based fine-grained cross-media hybrid network

被引:4
|
作者
Shan, Wei [1 ,2 ]
Huang, Dan [3 ]
Wang, Jiangtao [2 ]
Zou, Feng [2 ]
Li, Suwen [2 ]
机构
[1] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing, Peoples R China
[2] Huaibei Normal Univ, Coll Phys & Elect Informat, Huaibei, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci, Beijing, Peoples R China
关键词
Fine-Grained; Cross-Media; Retrieval; Attention;
D O I
10.1016/j.patcog.2022.108748
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the heterogeneity gap, the data representations of different types of media are inconsistent. It is challenging to measure the fine-grained gap between different media. To this end, we propose a self-attention-based hybrid network to learn the common representations of different media data. Specifically, we first utilize a local self-attention layer to learn the common attention space between different media data. Then we propose a similarity concatenation method to understand the content relationship between features. To further improve the robustness of the model, we also learn a local position encoding to capture the spatial relationships between features. Therefore, our proposed approach can effectively reduce the gap between different feature distributions on cross-media retrieval tasks. Extensive experiments and ablation studies demonstrate that our proposed method achieves state-of-the-art performance. The source code and models are publicly available at: https://github.com/NUST-Machine-Intelligence-Laboratory/SAFGCMHN. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network
    Liang, Meiyu
    Du, Junping
    Liu, Wu
    Xue, Zhe
    Geng, Yue
    Yang, Congxian
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1313 - 1321
  • [2] A fine-grained classification method based on self-attention Siamese network
    He Can
    Yuan Guowu
    Wu Hao
    [J]. 2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 148 - 154
  • [3] Spatial self-attention network with self-attention distillation for fine-grained image recognitionx2729;
    Baffour, Adu Asare
    Qin, Zhen
    Wang, Yong
    Qin, Zhiguang
    Choo, Kim-Kwang Raymond
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 81
  • [4] Cross-Media Fine-Grained Representation Learning Based on Multi-modal Graph and Adversarial Hash Attention Network
    Liang M.
    Wang X.
    Du J.
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (03): : 195 - 206
  • [5] Cross-media Deep Fine-grained Correlation Learning
    Zhuo Y.-K.
    Qi J.-W.
    Peng Y.-X.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2019, 30 (04): : 884 - 895
  • [6] A New Benchmark and Approach for Fine-grained Cross-media Retrieval
    He, Xiangteng
    Peng, Yuxin
    Xie, Liu
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1740 - 1748
  • [7] Fine-Grained Deep Knowledge-Aware Network for News Recommendation with Self-Attention
    Gao, Jie
    Xin, Xin
    Liu, Junshuai
    Wang, Rui
    Lu, Jing
    Li, Biao
    Fan, Xin
    Guo, Ping
    [J]. 2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 81 - 88
  • [8] Multigranularity Self-Attention Network for Fine-Grained Ship Detection in Remote Sensing Images
    Ouyang, Lihan
    Fang, Leyuan
    Ji, Xinyu
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9722 - 9732
  • [9] Cross-media retrieval by exploiting fine-grained correlation at entity level
    Huang, Lei
    Peng, Yuxin
    [J]. NEUROCOMPUTING, 2017, 236 : 123 - 133
  • [10] Fine-Grained Image Classification Based on Cross-Attention Network
    Zheng, Zhiwen
    Zhou, Juxiang
    Gan, Jianhou
    Luo, Sen
    Gao, Wei
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2022, 18 (01)