Learning label correlations for multi-label image recognition with graph networks

被引:22
|
作者
Li, Qing [1 ,2 ]
Peng, Xiaojiang [2 ]
Qiao, Yu [2 ,3 ]
Peng, Qiang [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Comp Vis & Pattern Recognit, Shenzhen, Peoples R China
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, SIAT Branch, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image recognition; Graph convolutional networks; Label correlation graph; Sparse correlation constraint;
D O I
10.1016/j.patrec.2020.07.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image recognition is a task that predicts a set of object labels in an image. As the objects co-occur in the physical world, it is desirable to model label dependencies. Previous existing methods resort to either recurrent networks or pre-defined label correlation graphs for this purpose. In this paper, instead of using a pre-defined graph which is inflexible and may be sub-optimal for multi-label classification, we propose the A-GCN, which leverages the popular Graph Convolutional Networks with an Adaptive label correlation graph to model label dependencies. Specifically, we introduce a plug-and-play Label Graph (LG) module to learn label correlations with word embeddings, and then utilize traditional GCN to map this graph into label-dependent object classifiers which are further applied to image features. The basic LG module incorporates two 1 x 1 convolutional layers and uses the dot product to generate label graphs. In addition, we propose a sparse correlation constraint to enhance the LG module, and also explore different LG architectures. We validate our method on two diverse multi-label datasets: MS-COCO and Fashion550K. Experimental results show that our A-GCN significantly improves baseline methods and achieves performance superior or comparable to the state of the art. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:378 / 384
页数:7
相关论文
共 50 条
  • [31] Partial multi-label learning based on sparse asymmetric label correlations
    Zhao, Peng
    Zhao, Shiyi
    Zhao, Xuyang
    Liu, Huiting
    Ji, Xia
    KNOWLEDGE-BASED SYSTEMS, 2022, 245
  • [32] Multi-Label Active Learning with Label Correlation for Image Classification
    Ye, Chen
    Wu, Jian
    Sheng, Victor S.
    Zhao, Pengpeng
    Cui, Zhiming
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3437 - 3441
  • [33] Multi-label weak-label learning via semantic reconstruction and label correlations
    Zhao, Dawei
    Li, Hong
    Lu, Yixiang
    Sun, Dong
    Zhu, De
    Gao, Qingwei
    INFORMATION SCIENCES, 2023, 623 : 379 - 401
  • [34] Multi-label zero-shot learning with graph convolutional networks
    Ou, Guangjin
    Yu, Guoxian
    Domeniconi, Carlotta
    Lu, Xuequan
    Zhang, Xiangliang
    NEURAL NETWORKS, 2020, 132 (132) : 333 - 341
  • [35] Multi-label learning based on iterative label propagation over graph
    Fu, Bin
    Wang, Zhihai
    Xu, Guandong
    Cao, Longbing
    PATTERN RECOGNITION LETTERS, 2014, 42 : 85 - 90
  • [36] Calibrated Multi-label Classification with Label Correlations
    Zhi-Fen He
    Ming Yang
    Hui-Dong Liu
    Lei Wang
    Neural Processing Letters, 2019, 50 : 1361 - 1380
  • [37] Calibrated Multi-label Classification with Label Correlations
    He, Zhi-Fen
    Yang, Ming
    Liu, Hui-Dong
    Wang, Lei
    NEURAL PROCESSING LETTERS, 2019, 50 (02) : 1361 - 1380
  • [38] Multi-label classification by exploiting label correlations
    Yu, Ying
    Pedrycz, Witold
    Miao, Duoqian
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (06) : 2989 - 3004
  • [39] Label Co-Occurrence Learning With Graph Convolutional Networks for Multi-Label Chest X-Ray Image Classification
    Chen, Bingzhi
    Li, Jinxing
    Lu, Guangming
    Yu, Hongbing
    Zhang, David
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (08) : 2292 - 2302
  • [40] Disentangling, Embedding and Ranking Label Cues for Multi-Label Image Recognition
    Chen, Zhao-Min
    Cui, Quan
    Wei, Xiu-Shen
    Jin, Xin
    Guo, Yanwen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1827 - 1840