Coarse to Fine: Multi-label Image Classification with Global/Local Attention

被引:0
|
作者
Lyu, Fan [1 ]
Hu, Fuyuan [1 ]
Sheng, Victor S. [2 ]
Wu, Zhengtian [1 ]
Fu, Qiming [3 ]
Fu, Baochuan [1 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou, Peoples R China
[2] Univ Cent Arkansas, Comp Sci Dept, Conway, AR USA
[3] Jiangsu Prov Key Lab Intelligent Bldg Energy Effi, Suzhou, Peoples R China
关键词
Multi-label image classification; Scene recognition; Deep learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our daily life, the scenes around us are always with multiple labels especially in a smart city, i.e., recognizing the information of city operation to response and control. Great efforts have been made by using Deep Neural Networks to recognize multi-label images. Since multi-label image classification is very complicated, people seek to use the attention mechanism to guide the classification process. However, conventional attention-based methods always analyzed images directly and aggressively. It is difficult for them to well understand complicated scenes. In this paper, we propose a global/local attention method that can recognize an image from coarse to fine by mimicking how humanbeings observe images. Specifically, our global/local attention method first concentrates on the whole image, and then focuses on local specific objects in the image. We also propose a joint max-margin objective function, which enforces that the minimum score of positive labels should be larger than the maximum score of negative labels horizontally and vertically. This function can further improve our multi-label image classification method. We evaluate the effectiveness of our method on two popular multilabel image datasets (i.e., Pascal VOC and MS-COCO). Our experimental results show that our method outperforms state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Multi-label Iterated Learning for Image Classification with Label Ambiguity
    Rajeswar, Sai
    Rodriguez, Pau
    Singhal, Soumye
    Vazquez, David
    Courville, Aaron
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4773 - 4783
  • [42] LABEL RELATION INFERENCE FOR MULTI-LABEL AERIAL IMAGE CLASSIFICATION
    Hua, Yuansheng
    Mou, Lichao
    Zhu, Xiao Xiang
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 5244 - 5247
  • [43] Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks
    Lyu, Fan
    Wu, Qi
    Hu, Fuyuan
    Wu, Qingyao
    Tan, Mingkui
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (08) : 1971 - 1981
  • [44] Multi-label classification of legislative contents with hierarchical label attention networks
    Danielle Caled
    Mário J. Silva
    Bruno Martins
    Miguel Won
    [J]. International Journal on Digital Libraries, 2022, 23 : 77 - 90
  • [45] Multi-label classification of legislative contents with hierarchical label attention networks
    Caled, Danielle
    Silva, Mario J.
    Martins, Bruno
    Won, Miguel
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2022, 23 (01) : 77 - 90
  • [46] Multi-label feature selection with global and local label correlation
    Faraji, Mohammad
    Seyedi, Seyed Amjad
    Tab, Fardin Akhlaghian
    Mahmoodi, Reza
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [47] Global and local multi-view multi-label learning
    Zhu, Changming
    Miao, Duoqian
    Wang, Zhe
    Zhou, Rigui
    Wei, Lai
    Zhang, Xiafen
    [J]. NEUROCOMPUTING, 2020, 371 : 67 - 77
  • [48] Sparse Local Embeddings for Extreme Multi-label Classification
    Bhatia, Kush
    Jain, Himanshu
    Kar, Purushottam
    Varma, Manik
    Jain, Prateek
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [49] Learning Local Instance Constraint for Multi-label Classification
    Luo, Shang
    Wu, Xiaofeng
    Wang, Bin
    Zhang, Liming
    [J]. IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 284 - 294
  • [50] Mining Knowledge within Categories in Global and Local Fashion for Multi-Label Text Classification
    Bi, Sheng
    Shi, Peng
    Du, Yuntao
    Jin, Bin
    Yu, Lingshuang
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,