Double Attention for Multi-Label Image Classification

被引:9
|
作者
Zhao, Haiying [1 ]
Zhou, Wei [2 ]
Hou, Xiaogang [1 ]
Zhu, Hui [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Natl Pilot Software Engn Sch, Sch Comp Sci, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Digital Media & Design Art, Beijing 100876, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Correlation; Feature extraction; Task analysis; Image classification; Semantics; Spatial resolution; Predictive models; Multi-label classification; multi-scale features; attention mechanism; label correlation;
D O I
10.1109/ACCESS.2020.3044446
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label image classification is an essential task in image processing. How to improve the correlation between labels by learning multi-scale features from images is a very challenging problem. We propose a Double Attention Network (DAN) to improve the correlation between image feature regions and labels, as well as between labels and labels. Firstly, the dynamic learning strategy is used to extract the multi-scale features of the image to solve the problem of inconsistent scale of objects in the image. Secondly, in order to improve the correlation between the image feature regions and the labels, we use the spatial attention module to focus on the important regions of the image to learn their salient features, while we use the channel attention module to model the correlation between the channels to improve the correlation between the labels. Finally, the output features of two attention modules are fused as one multi-label image classification model. Experiments on MS-COCO 2014 dataset, Pascal VOC 2007 dataset and NUS-WIDE dataset demonstrate that our model is significantly better than the state-of-the-art models. Besides, visualization analyses show that our model has a strong ability for image salient feature learning and label correlation capturing.
引用
收藏
页码:225539 / 225550
页数:12
相关论文
共 50 条
  • [1] Double Attention Based on Graph Attention Network for Image Multi-Label Classification
    Zhou, Wei
    Xia, Zhiwu
    Dou, Peng
    Su, Tao
    Hu, Haifeng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [2] Visual Attention in Multi-Label Image Classification
    Luo, Yan
    Jiang, Ming
    Zhao, Qi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 820 - 827
  • [3] Multi-Label Image Classification by Feature Attention Network
    Yan, Zheng
    Liu, Weiwei
    Wen, Shiping
    Yang, Yin
    [J]. IEEE ACCESS, 2019, 7 : 98005 - 98013
  • [4] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [5] DATran: Dual Attention Transformer for Multi-Label Image Classification
    Zhou, Wei
    Zheng, Zhijie
    Su, Tao
    Hu, Haifeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 342 - 356
  • [6] Pose Guided Attention for Multi-label Fashion Image Classification
    Ferreira, Beatriz Quintino
    Costeira, Joao P.
    Sousa, Ricardo G.
    Gui, Liang-Yan
    Gomes, Joao P.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3125 - 3128
  • [7] Visual Attention Consistency under Image Transforms for Multi-Label Image Classification
    Guo, Hao
    Zheng, Kang
    Fan, Xiaochuan
    Yu, Hongkai
    Wang, Song
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 729 - 739
  • [8] Attention-Augmented Memory Network for Image Multi-Label Classification
    Zhou, Wei
    Hou, Yanke
    Chen, Dihu
    Hu, Haifeng
    Su, Tao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
  • [9] Multi-label Image Classification via Coarse-to-Fine Attention*
    Lyu, Fan
    Li, Linyan
    Victor, S. Sheng
    Fu, Qiming
    Hu, Fuyuan
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (06) : 1118 - 1126
  • [10] Multi-label Image Classification via Coarse-to-Fine Attention
    LYU Fan
    LI Linyan
    Victor S.Sheng
    FU Qiming
    HU Fuyuan
    [J]. Chinese Journal of Electronics, 2019, 28 (06) : 1118 - 1126