Visual Attention Consistency under Image Transforms for Multi-Label Image Classification

被引:164
|
作者
Guo, Hao [2 ]
Zheng, Kang [2 ]
Fan, Xiaochuan [2 ]
Yu, Hongkai [3 ]
Wang, Song [1 ,2 ]
机构
[1] Tianjin Univ, Tianjin, Peoples R China
[2] Univ South Carolina, Columbia, SC 29208 USA
[3] Univ Texas Rio Grande Valley, Edinburg, TX USA
关键词
SELECTIVE ATTENTION;
D O I
10.1109/CVPR.2019.00082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human visual perception shows good consistency for many multi-label image classification tasks under certain spatial transforms, such as scaling, rotation, flipping and translation. This has motivated the data augmentation strategy widely used in CNN classifier training - transformed images are included for training by assuming the same class labels as their original images. In this paper, we further propose the assumption of perceptual consistency of visual attention regions for classification under such transforms, i.e., the attention region for a classification follows the same transform if the input image is spatially transformed. While the attention regions of CNN classifiers can be derived as an attention heatmap in middle layers of the network, we find that their consistency under many transforms are not preserved. To address this problem, we propose a two-branch network with an original image and its transformed image as inputs and introduce a new attention consistency loss that measures the attention heatmap consistency between two branches. This new loss is then combined with multi-label image classification loss for network training. Experiments on three datasets verify the superiority of the proposed network by achieving new state-of-the-art classification performance.
引用
收藏
页码:729 / 739
页数:11
相关论文
共 50 条
  • [1] Visual Attention in Multi-Label Image Classification
    Luo, Yan
    Jiang, Ming
    Zhao, Qi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 820 - 827
  • [2] Double Attention for Multi-Label Image Classification
    Zhao, Haiying
    Zhou, Wei
    Hou, Xiaogang
    Zhu, Hui
    [J]. IEEE ACCESS, 2020, 8 : 225539 - 225550
  • [3] Multi-Label Image Classification by Feature Attention Network
    Yan, Zheng
    Liu, Weiwei
    Wen, Shiping
    Yang, Yin
    [J]. IEEE ACCESS, 2019, 7 : 98005 - 98013
  • [4] Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks
    Lyu, Fan
    Wu, Qi
    Hu, Fuyuan
    Wu, Qingyao
    Tan, Mingkui
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (08) : 1971 - 1981
  • [5] DATran: Dual Attention Transformer for Multi-Label Image Classification
    Zhou, Wei
    Zheng, Zhijie
    Su, Tao
    Hu, Haifeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 342 - 356
  • [6] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [7] Pose Guided Attention for Multi-label Fashion Image Classification
    Ferreira, Beatriz Quintino
    Costeira, Joao P.
    Sousa, Ricardo G.
    Gui, Liang-Yan
    Gomes, Joao P.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3125 - 3128
  • [8] Double Attention Based on Graph Attention Network for Image Multi-Label Classification
    Zhou, Wei
    Xia, Zhiwu
    Dou, Peng
    Su, Tao
    Hu, Haifeng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [9] Attention-Augmented Memory Network for Image Multi-Label Classification
    Zhou, Wei
    Hou, Yanke
    Chen, Dihu
    Hu, Haifeng
    Su, Tao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
  • [10] Multi-label Image Classification via Coarse-to-Fine Attention*
    Lyu, Fan
    Li, Linyan
    Victor, S. Sheng
    Fu, Qiming
    Hu, Fuyuan
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (06) : 1118 - 1126