Double Attention for Multi-Label Image Classification

被引:9
|
作者
Zhao, Haiying [1 ]
Zhou, Wei [2 ]
Hou, Xiaogang [1 ]
Zhu, Hui [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Natl Pilot Software Engn Sch, Sch Comp Sci, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Digital Media & Design Art, Beijing 100876, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Correlation; Feature extraction; Task analysis; Image classification; Semantics; Spatial resolution; Predictive models; Multi-label classification; multi-scale features; attention mechanism; label correlation;
D O I
10.1109/ACCESS.2020.3044446
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label image classification is an essential task in image processing. How to improve the correlation between labels by learning multi-scale features from images is a very challenging problem. We propose a Double Attention Network (DAN) to improve the correlation between image feature regions and labels, as well as between labels and labels. Firstly, the dynamic learning strategy is used to extract the multi-scale features of the image to solve the problem of inconsistent scale of objects in the image. Secondly, in order to improve the correlation between the image feature regions and the labels, we use the spatial attention module to focus on the important regions of the image to learn their salient features, while we use the channel attention module to model the correlation between the channels to improve the correlation between the labels. Finally, the output features of two attention modules are fused as one multi-label image classification model. Experiments on MS-COCO 2014 dataset, Pascal VOC 2007 dataset and NUS-WIDE dataset demonstrate that our model is significantly better than the state-of-the-art models. Besides, visualization analyses show that our model has a strong ability for image salient feature learning and label correlation capturing.
引用
收藏
页码:225539 / 225550
页数:12
相关论文
共 50 条
  • [21] A NOVEL MULTI-ATTENTION DRIVEN SYSTEM FOR MULTI-LABEL REMOTE SENSING IMAGE CLASSIFICATION
    Sumbul, Gencer
    Demir, Begum
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5726 - 5729
  • [22] Image to Text Translation by Multi-Label Classification
    Nasierding, Gulisong
    Kouzani, Abbas Z.
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2010, 6216 : 247 - +
  • [23] MULTIMODAL LEARNING FOR MULTI-LABEL IMAGE CLASSIFICATION
    Pang, Yanwei
    Ma, Zhao
    Yuan, Yuan
    Li, Xuelong
    Wang, Kongqiao
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1797 - 1800
  • [24] Multi-label Active Learning for Image Classification
    Wu, Jian
    Sheng, Victor S.
    Zhang, Jing
    Zhao, Pengpeng
    Cui, Zhiming
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5227 - 5231
  • [25] Multi-label legal text classification with BiLSTM and attention
    Enamoto, Liriam
    Santos, Andre R. A. S.
    Maia, Ricardo
    Weigang, Li
    Rocha Filho, Geraldo P.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2022, 68 (04) : 369 - 378
  • [26] Label-Guided Cross-Modal Attention Network for Multi-Label Aerial Image Classification
    Chen, Ying
    Zhang, Ding
    Han, Tao
    Meng, Xiaoliang
    Gao, Mianxin
    Wang, Teng
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [27] Multi-label Iterated Learning for Image Classification with Label Ambiguity
    Rajeswar, Sai
    Rodriguez, Pau
    Singhal, Soumye
    Vazquez, David
    Courville, Aaron
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4773 - 4783
  • [28] LABEL RELATION INFERENCE FOR MULTI-LABEL AERIAL IMAGE CLASSIFICATION
    Hua, Yuansheng
    Mou, Lichao
    Zhu, Xiao Xiang
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5244 - 5247
  • [29] Multi-Label Active Learning with Label Correlation for Image Classification
    Ye, Chen
    Wu, Jian
    Sheng, Victor S.
    Zhao, Pengpeng
    Cui, Zhiming
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3437 - 3441
  • [30] Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks
    Lyu, Fan
    Wu, Qi
    Hu, Fuyuan
    Wu, Qingyao
    Tan, Mingkui
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (08) : 1971 - 1981