Multi-label Image Classification via Coarse-to-Fine Attention*

被引:6
|
作者
Lyu, Fan [1 ,2 ]
Li, Linyan [3 ]
Victor, S. Sheng [4 ]
Fu, Qiming [1 ]
Hu, Fuyuan [1 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
[3] Suzhou Inst Trade & Commerce, Suzhou 215009, Peoples R China
[4] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
基金
中国国家自然科学基金;
关键词
feature extraction; image classification; image recognition; image representation; learning (artificial intelligence); neural nets; object detection; visual databases; attention mechanism; conventional attention-based methods; positive labels; negative labels; image classification method; popular multilabel image datasets; multilabel image classification; coarse-to-fine attention; Multi-label classification; Convolutional neural network; Recurrent neural network; Attention;
D O I
10.1049/cje.2019.07.015
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Great efforts have been made by using deep neural networks to recognize multi-label images. Since multi-label image classification is very complicated, many studies seek to use the attention mechanism as a kind of guidance. Conventional attention-based methods always analyzed images directly and aggressively, which is difficult to well understand complicated scenes. We propose a global/local attention method that can recognize a multi-label image from coarse to fine by mimicking how human-beings observe images. Our global/local attention method first concentrates on the whole image, and then focuses on its local specific objects. We also propose a joint max-margin objective function, which enforces that the minimum score of positive labels should be larger than the maximum score of negative labels horizontally and vertically. This function further improve our multi-label image classification method. We evaluate the effectiveness of our method on two popular multi-label image datasets (i.e., Pascal VOC and MS-COCO). Our experimental results show that our method outperforms state-of-the-art methods.
引用
收藏
页码:1118 / 1126
页数:9
相关论文
共 50 条
  • [1] Multi-label Image Classification via Coarse-to-Fine Attention
    LYU Fan
    LI Linyan
    Victor S.Sheng
    FU Qiming
    HU Fuyuan
    [J]. Chinese Journal of Electronics, 2019, 28 (06) : 1118 - 1126
  • [2] Coarse to Fine: Multi-label Image Classification with Global/Local Attention
    Lyu, Fan
    Hu, Fuyuan
    Sheng, Victor S.
    Wu, Zhengtian
    Fu, Qiming
    Fu, Baochuan
    [J]. 2018 IEEE INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2018,
  • [3] Double Attention for Multi-Label Image Classification
    Zhao, Haiying
    Zhou, Wei
    Hou, Xiaogang
    Zhu, Hui
    [J]. IEEE ACCESS, 2020, 8 : 225539 - 225550
  • [4] Visual Attention in Multi-Label Image Classification
    Luo, Yan
    Jiang, Ming
    Zhao, Qi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 820 - 827
  • [5] Multi-Label Image Classification by Feature Attention Network
    Yan, Zheng
    Liu, Weiwei
    Wen, Shiping
    Yang, Yin
    [J]. IEEE ACCESS, 2019, 7 : 98005 - 98013
  • [6] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [7] DATran: Dual Attention Transformer for Multi-Label Image Classification
    Zhou, Wei
    Zheng, Zhijie
    Su, Tao
    Hu, Haifeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 342 - 356
  • [8] Pose Guided Attention for Multi-label Fashion Image Classification
    Ferreira, Beatriz Quintino
    Costeira, Joao P.
    Sousa, Ricardo G.
    Gui, Liang-Yan
    Gomes, Joao P.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3125 - 3128
  • [9] Image-to-Markup Generation with Coarse-to-Fine Attention
    Deng, Yuntian
    Kanervisto, Anssi
    Ling, Jeffrey
    Rush, Alexander M.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [10] Double Attention Based on Graph Attention Network for Image Multi-Label Classification
    Zhou, Wei
    Xia, Zhiwu
    Dou, Peng
    Su, Tao
    Hu, Haifeng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)