Multi-layered semantic representation network for multi-label image classification

被引:0
|
作者
Xiwen Qu
Hao Che
Jun Huang
Linchuan Xu
Xiao Zheng
机构
[1] Anhui University of Technology,School of Computer Science and Technology
[2] Hefei Comprehensive National Science Center,Institute of Artificial Intelligence
[3] Australian National University,Department of Computing
[4] The Hong Kong Polytechnic University,undefined
关键词
Multi-label image classification; Convolutional neural network; Label embeddings; Multi-layered attention;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-label image classification is a fundamental and practical task, which aims to assign multiple possible labels to an image. In recent years, many deep convolutional neural network (CNN) based approaches have been proposed which model label correlations to discover semantics of labels and learn semantic representations of images. This paper advances this research direction by improving both the modeling of label correlations and the learning of semantic representations. On the one hand, besides the local semantics of each label, we propose to further explore global semantics shared by multiple labels. On the other hand, existing approaches mainly learn the semantic representations at the last convolutional layer of a CNN. But it has been noted that the image representations of different layers of CNN capture different levels or scales of features and have different discriminative abilities. We thus propose to learn semantic representations at multiple convolutional layers. To this end, this paper designs a Multi-layered Semantic Representation Network (MSRN) which discovers both local and global semantics of labels through modeling label correlations and utilizes the label semantics to guide the semantic representations learning at multiple layers through an attention mechanism. Extensive experiments on five benchmark datasets including VOC2007, VOC2012, MS-COCO, NUS-WIDE, and Apparel show a competitive performance of the proposed MSRN against state-of-the-art models.
引用
下载
收藏
页码:3427 / 3435
页数:8
相关论文
共 50 条
  • [1] Multi-layered semantic representation network for multi-label image classification
    Qu, Xiwen
    Che, Hao
    Huang, Jun
    Xu, Linchuan
    Zheng, Xiao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3427 - 3435
  • [2] Multi-label image classification with multi-layered multi-perspective dynamic semantic representation
    Kuang, Wenlan
    Li, Zhixin
    MACHINE LEARNING, 2024, 113 (06) : 3443 - 3461
  • [3] Multi-label image classification with multi-layered multi-perspective dynamic semantic representation
    Wenlan Kuang
    Zhixin Li
    Machine Learning, 2024, 113 : 3443 - 3461
  • [4] Semantic-Guided Representation Enhancement for Multi-Label Image Classification
    Zhu X.
    Li J.
    Cao J.
    Tang D.
    Liu J.
    Liu B.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (10) : 1 - 1
  • [5] Semantic Supplementary Network With Prior Information for Multi-Label Image Classification
    Wang, Zhe
    Fang, Zhongli
    Li, Dongdong
    Yang, Hai
    Du, Wenli
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 1848 - 1859
  • [6] A semantic guidance-based fusion network for multi-label image classification
    Wang, Jiuhang
    Tang, Hongying
    Luo, Shanshan
    Yang, Liqi
    Liu, Shusheng
    Hong, Aoping
    Li, Baoqing
    PATTERN RECOGNITION LETTERS, 2024, 185 : 254 - 261
  • [7] Semantic representation and dependency learning for multi-label image recognition
    Pu, Tao
    Sun, Mingzhan
    Wu, Hefeng
    Chen, Tianshui
    Tian, Ling
    Lin, Liang
    NEUROCOMPUTING, 2023, 526 : 121 - 130
  • [8] Multi-label Image Classification with Multi-scale Global-Local Semantic Graph Network
    Kuang, Wenlan
    Zhu, Qiangxi
    Li, Zhixin
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT III, 2023, 14171 : 53 - 69
  • [9] Multi-label image classification with recurrently learning semantic dependencies
    Long Chen
    Ronggui Wang
    Juan Yang
    Lixia Xue
    Min Hu
    The Visual Computer, 2019, 35 : 1361 - 1371
  • [10] Multi-label image classification with recurrently learning semantic dependencies
    Chen, Long
    Wang, Ronggui
    Yang, Juan
    Xue, Lixia
    Hu, Min
    VISUAL COMPUTER, 2019, 35 (10): : 1361 - 1371