Multi-layered semantic representation network for multi-label image classification

被引:0
|
作者
Xiwen Qu
Hao Che
Jun Huang
Linchuan Xu
Xiao Zheng
机构
[1] Anhui University of Technology,School of Computer Science and Technology
[2] Hefei Comprehensive National Science Center,Institute of Artificial Intelligence
[3] Australian National University,Department of Computing
[4] The Hong Kong Polytechnic University,undefined
关键词
Multi-label image classification; Convolutional neural network; Label embeddings; Multi-layered attention;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-label image classification is a fundamental and practical task, which aims to assign multiple possible labels to an image. In recent years, many deep convolutional neural network (CNN) based approaches have been proposed which model label correlations to discover semantics of labels and learn semantic representations of images. This paper advances this research direction by improving both the modeling of label correlations and the learning of semantic representations. On the one hand, besides the local semantics of each label, we propose to further explore global semantics shared by multiple labels. On the other hand, existing approaches mainly learn the semantic representations at the last convolutional layer of a CNN. But it has been noted that the image representations of different layers of CNN capture different levels or scales of features and have different discriminative abilities. We thus propose to learn semantic representations at multiple convolutional layers. To this end, this paper designs a Multi-layered Semantic Representation Network (MSRN) which discovers both local and global semantics of labels through modeling label correlations and utilizes the label semantics to guide the semantic representations learning at multiple layers through an attention mechanism. Extensive experiments on five benchmark datasets including VOC2007, VOC2012, MS-COCO, NUS-WIDE, and Apparel show a competitive performance of the proposed MSRN against state-of-the-art models.
引用
下载
收藏
页码:3427 / 3435
页数:8
相关论文
共 50 条
  • [41] MULTIMODAL LEARNING FOR MULTI-LABEL IMAGE CLASSIFICATION
    Pang, Yanwei
    Ma, Zhao
    Yuan, Yuan
    Li, Xuelong
    Wang, Kongqiao
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1797 - 1800
  • [42] Image to Text Translation by Multi-Label Classification
    Nasierding, Gulisong
    Kouzani, Abbas Z.
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2010, 6216 : 247 - +
  • [43] Visual Attention in Multi-Label Image Classification
    Luo, Yan
    Jiang, Ming
    Zhao, Qi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 820 - 827
  • [44] Causal multi-label learning for image classification
    Tian, Yingjie
    Bai, Kunlong
    Yu, Xiaotong
    Zhu, Siyu
    NEURAL NETWORKS, 2023, 167 : 626 - 637
  • [45] Multi-label Active Learning for Image Classification
    Wu, Jian
    Sheng, Victor S.
    Zhang, Jing
    Zhao, Pengpeng
    Cui, Zhiming
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5227 - 5231
  • [46] Representation Learning on Multi-layered Heterogeneous Network
    Zhang, Delvin Ce
    Lauw, Hady W.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 399 - 416
  • [47] Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels
    Pu, Tao
    Chen, Tianshui
    Wu, Hefeng
    Lin, Liang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2091 - 2098
  • [48] Mmnet: A Multi-Method Network For Multi-Label Classification
    Zhi, Cheng
    2020 5TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2020), 2020, : 441 - 445
  • [49] Aligning Image Semantics and Label Concepts for Image Multi-Label Classification
    Zhou, Wei
    Xia, Zhiwu
    Dou, Peng
    Su, Tao
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [50] Neural Tensor Network for Multi-Label Classification
    Hong, Wenxing
    Xu, Wenjing
    Qi, Jianwei
    Weng, Yang
    IEEE ACCESS, 2019, 7 : 96936 - 96941