Coarse to Fine: Multi-label Image Classification with Global/Local Attention

被引:0
|
作者
Lyu, Fan [1 ]
Hu, Fuyuan [1 ]
Sheng, Victor S. [2 ]
Wu, Zhengtian [1 ]
Fu, Qiming [3 ]
Fu, Baochuan [1 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou, Peoples R China
[2] Univ Cent Arkansas, Comp Sci Dept, Conway, AR USA
[3] Jiangsu Prov Key Lab Intelligent Bldg Energy Effi, Suzhou, Peoples R China
关键词
Multi-label image classification; Scene recognition; Deep learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our daily life, the scenes around us are always with multiple labels especially in a smart city, i.e., recognizing the information of city operation to response and control. Great efforts have been made by using Deep Neural Networks to recognize multi-label images. Since multi-label image classification is very complicated, people seek to use the attention mechanism to guide the classification process. However, conventional attention-based methods always analyzed images directly and aggressively. It is difficult for them to well understand complicated scenes. In this paper, we propose a global/local attention method that can recognize an image from coarse to fine by mimicking how humanbeings observe images. Specifically, our global/local attention method first concentrates on the whole image, and then focuses on local specific objects in the image. We also propose a joint max-margin objective function, which enforces that the minimum score of positive labels should be larger than the maximum score of negative labels horizontally and vertically. This function can further improve our multi-label image classification method. We evaluate the effectiveness of our method on two popular multilabel image datasets (i.e., Pascal VOC and MS-COCO). Our experimental results show that our method outperforms state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Multi-label Image Classification via Coarse-to-Fine Attention*
    Lyu, Fan
    Li, Linyan
    Victor, S. Sheng
    Fu, Qiming
    Hu, Fuyuan
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (06) : 1118 - 1126
  • [2] Multi-label Image Classification via Coarse-to-Fine Attention
    LYU Fan
    LI Linyan
    Victor S.Sheng
    FU Qiming
    HU Fuyuan
    [J]. Chinese Journal of Electronics, 2019, 28 (06) : 1118 - 1126
  • [3] Visual Attention in Multi-Label Image Classification
    Luo, Yan
    Jiang, Ming
    Zhao, Qi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 820 - 827
  • [4] Double Attention for Multi-Label Image Classification
    Zhao, Haiying
    Zhou, Wei
    Hou, Xiaogang
    Zhu, Hui
    [J]. IEEE ACCESS, 2020, 8 : 225539 - 225550
  • [5] Self-Supervised Multi-Label Classification with Global Context and Local Attention
    Chen, Chun-Yen
    Yeh, Mei-Chen
    [J]. PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 934 - 942
  • [6] Multi-Label Image Classification by Feature Attention Network
    Yan, Zheng
    Liu, Weiwei
    Wen, Shiping
    Yang, Yin
    [J]. IEEE ACCESS, 2019, 7 : 98005 - 98013
  • [7] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [8] DATran: Dual Attention Transformer for Multi-Label Image Classification
    Zhou, Wei
    Zheng, Zhijie
    Su, Tao
    Hu, Haifeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 342 - 356
  • [9] Pose Guided Attention for Multi-label Fashion Image Classification
    Ferreira, Beatriz Quintino
    Costeira, Joao P.
    Sousa, Ricardo G.
    Gui, Liang-Yan
    Gomes, Joao P.
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3125 - 3128
  • [10] Robust Multi-Label Classification with Enhanced Global and Local Label Correlation
    Zhao, Tianna
    Zhang, Yuanjian
    Pedrycz, Witold
    [J]. MATHEMATICS, 2022, 10 (11)