Learning label correlations for multi-label image recognition with graph networks

被引:22
|
作者
Li, Qing [1 ,2 ]
Peng, Xiaojiang [2 ]
Qiao, Yu [2 ,3 ]
Peng, Qiang [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Comp Vis & Pattern Recognit, Shenzhen, Peoples R China
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, SIAT Branch, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image recognition; Graph convolutional networks; Label correlation graph; Sparse correlation constraint;
D O I
10.1016/j.patrec.2020.07.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image recognition is a task that predicts a set of object labels in an image. As the objects co-occur in the physical world, it is desirable to model label dependencies. Previous existing methods resort to either recurrent networks or pre-defined label correlation graphs for this purpose. In this paper, instead of using a pre-defined graph which is inflexible and may be sub-optimal for multi-label classification, we propose the A-GCN, which leverages the popular Graph Convolutional Networks with an Adaptive label correlation graph to model label dependencies. Specifically, we introduce a plug-and-play Label Graph (LG) module to learn label correlations with word embeddings, and then utilize traditional GCN to map this graph into label-dependent object classifiers which are further applied to image features. The basic LG module incorporates two 1 x 1 convolutional layers and uses the dot product to generate label graphs. In addition, we propose a sparse correlation constraint to enhance the LG module, and also explore different LG architectures. We validate our method on two diverse multi-label datasets: MS-COCO and Fashion550K. Experimental results show that our A-GCN significantly improves baseline methods and achieves performance superior or comparable to the state of the art. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:378 / 384
页数:7
相关论文
共 50 条
  • [21] Semantic representation and dependency learning for multi-label image recognition
    Pu, Tao
    Sun, Mingzhan
    Wu, Hefeng
    Chen, Tianshui
    Tian, Ling
    Lin, Liang
    NEUROCOMPUTING, 2023, 526 : 121 - 130
  • [22] Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition
    Chen, Tianshui
    Wang, Zhouxia
    Li, Guanbin
    Lin, Liang
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6730 - 6737
  • [23] Multi-Label Learning for Activity Recognition
    Kumar, R.
    Qamar, I.
    Virdi, J. S.
    Krishnan, N. C.
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS IE 2015, 2015, : 152 - 155
  • [24] Dual-channel graph contrastive learning for multi-label classification with label-specific features and label correlations
    Zhu X.
    Zhu T.
    Li J.
    Wang J.
    Neural Computing and Applications, 2024, 36 (23) : 14483 - 14502
  • [25] Partial Multi-label Learning with Instance Correlations
    Gao, Guangliang
    Zhan, Zhiwei
    Sun, Jiachen
    Sun, Aiqin
    Lan, Haoliang
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND NETWORKS, VOL II, CENET 2023, 2024, 1126 : 432 - 438
  • [26] Image emotion multi-label classification based on multi-graph learning
    Wang, Meixia
    Zhao, Yuhai
    Wang, Yejiang
    Xu, Tongze
    Sun, Yiming
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [27] Multi-label Iterated Learning for Image Classification with Label Ambiguity
    Rajeswar, Sai
    Rodriguez, Pau
    Singhal, Soumye
    Vazquez, David
    Courville, Aaron
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4773 - 4783
  • [28] Image multi-label learning algorithm based on label correlation
    Huang, Mengyue
    Zhao, Ping
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 606 - 609
  • [29] Partial multi-label learning based on sparse asymmetric label correlations
    Zhao, Peng
    Zhao, Shiyi
    Zhao, Xuyang
    Liu, Huiting
    Ji, Xia
    Knowledge-Based Systems, 2022, 245
  • [30] Improving multi-label learning by modeling Local label and feature correlations
    Cheng, Qianqian
    Huang, Jun
    Zhang, Huiyi
    Chen, Sibao
    Zheng, Xiao
    INTELLIGENT DATA ANALYSIS, 2023, 27 (02) : 379 - 398