Learning label correlations for multi-label image recognition with graph networks

被引:22
|
作者
Li, Qing [1 ,2 ]
Peng, Xiaojiang [2 ]
Qiao, Yu [2 ,3 ]
Peng, Qiang [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Comp Vis & Pattern Recognit, Shenzhen, Peoples R China
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, SIAT Branch, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image recognition; Graph convolutional networks; Label correlation graph; Sparse correlation constraint;
D O I
10.1016/j.patrec.2020.07.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image recognition is a task that predicts a set of object labels in an image. As the objects co-occur in the physical world, it is desirable to model label dependencies. Previous existing methods resort to either recurrent networks or pre-defined label correlation graphs for this purpose. In this paper, instead of using a pre-defined graph which is inflexible and may be sub-optimal for multi-label classification, we propose the A-GCN, which leverages the popular Graph Convolutional Networks with an Adaptive label correlation graph to model label dependencies. Specifically, we introduce a plug-and-play Label Graph (LG) module to learn label correlations with word embeddings, and then utilize traditional GCN to map this graph into label-dependent object classifiers which are further applied to image features. The basic LG module incorporates two 1 x 1 convolutional layers and uses the dot product to generate label graphs. In addition, we propose a sparse correlation constraint to enhance the LG module, and also explore different LG architectures. We validate our method on two diverse multi-label datasets: MS-COCO and Fashion550K. Experimental results show that our A-GCN significantly improves baseline methods and achieves performance superior or comparable to the state of the art. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:378 / 384
页数:7
相关论文
共 50 条
  • [1] Multi-Label Image Recognition with Graph Convolutional Networks
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
  • [2] Learning Graph Convolutional Networks for Multi-Label Recognition and Applications
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6969 - 6983
  • [3] Label graph learning for multi-label image recognition with cross-modal fusion
    Yanzhao Xie
    Yangtao Wang
    Yu Liu
    Ke Zhou
    Multimedia Tools and Applications, 2022, 81 : 25363 - 25381
  • [4] Label graph learning for multi-label image recognition with cross-modal fusion
    Xie, Yanzhao
    Wang, Yangtao
    Liu, Yu
    Zhou, Ke
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (18) : 25363 - 25381
  • [5] Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition
    Chen, Tianshui
    Xu, Muxin
    Hui, Xiaolu
    Wu, Hefeng
    Lin, Liang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 522 - 531
  • [6] Scene-Aware Label Graph Learning for Multi-Label Image Classification
    Zhu, Xuelin
    Liu, Jian
    Liu, Weijia
    Ge, Jiawei
    Liu, Bo
    Cao, Jiuxin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1473 - 1482
  • [7] Label-aware graph representation learning for multi-label image classification
    Chen, Yilu
    Zou, Changzhong
    Chen, Jianli
    NEUROCOMPUTING, 2022, 492 : 50 - 61
  • [8] Joint learning of multi-label classification and label correlations
    He, Zhi-Fen
    Yang, Ming
    Liu, Hui-Dong
    Ruan Jian Xue Bao/Journal of Software, 2014, 25 (09): : 1967 - 1981
  • [9] Multi-Label learning by exploiting label correlations with LDA
    Peng, Yue
    Chen, Gang
    Xu, Ming
    Wang, Chongjun
    Xie, Junyuan
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 168 - 174
  • [10] Learning discriminative representations for multi-label image recognition
    Hassanin, Mohammed
    Radwan, Ibrahim
    Khan, Salman
    Tahtali, Murat
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83