Multi-layered semantic representation network for multi-label image classification

被引:0
|
作者
Xiwen Qu
Hao Che
Jun Huang
Linchuan Xu
Xiao Zheng
机构
[1] Anhui University of Technology,School of Computer Science and Technology
[2] Hefei Comprehensive National Science Center,Institute of Artificial Intelligence
[3] Australian National University,Department of Computing
[4] The Hong Kong Polytechnic University,undefined
关键词
Multi-label image classification; Convolutional neural network; Label embeddings; Multi-layered attention;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-label image classification is a fundamental and practical task, which aims to assign multiple possible labels to an image. In recent years, many deep convolutional neural network (CNN) based approaches have been proposed which model label correlations to discover semantics of labels and learn semantic representations of images. This paper advances this research direction by improving both the modeling of label correlations and the learning of semantic representations. On the one hand, besides the local semantics of each label, we propose to further explore global semantics shared by multiple labels. On the other hand, existing approaches mainly learn the semantic representations at the last convolutional layer of a CNN. But it has been noted that the image representations of different layers of CNN capture different levels or scales of features and have different discriminative abilities. We thus propose to learn semantic representations at multiple convolutional layers. To this end, this paper designs a Multi-layered Semantic Representation Network (MSRN) which discovers both local and global semantics of labels through modeling label correlations and utilizes the label semantics to guide the semantic representations learning at multiple layers through an attention mechanism. Extensive experiments on five benchmark datasets including VOC2007, VOC2012, MS-COCO, NUS-WIDE, and Apparel show a competitive performance of the proposed MSRN against state-of-the-art models.
引用
下载
收藏
页码:3427 / 3435
页数:8
相关论文
共 50 条
  • [21] Feature learning network with transformer for multi-label image classification
    Zhou, Wei
    Dou, Peng
    Su, Tao
    Hu, Haifeng
    Zheng, Zhijie
    PATTERN RECOGNITION, 2023, 136
  • [22] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    Neurocomputing, 2022, 491 : 14 - 23
  • [23] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    NEUROCOMPUTING, 2022, 491 : 14 - 23
  • [24] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [25] Multi-Label Classification Based on Low Rank Representation for Image Annotation
    Tan, Qiaoyu
    Liu, Yezi
    Chen, Xia
    Yu, Guoxian
    REMOTE SENSING, 2017, 9 (02)
  • [26] A multi-label text classification method via dynamic semantic representation model and deep neural network
    Tianshi Wang
    Li Liu
    Naiwen Liu
    Huaxiang Zhang
    Long Zhang
    Shanshan Feng
    Applied Intelligence, 2020, 50 : 2339 - 2351
  • [27] A multi-label text classification method via dynamic semantic representation model and deep neural network
    Wang, Tianshi
    Liu, Li
    Liu, Naiwen
    Zhang, Huaxiang
    Zhang, Long
    Feng, Shanshan
    APPLIED INTELLIGENCE, 2020, 50 (08) : 2339 - 2351
  • [28] Supervised representation learning for multi-label classification
    Ming Huang
    Fuzhen Zhuang
    Xiao Zhang
    Xiang Ao
    Zhengyu Niu
    Min-Ling Zhang
    Qing He
    Machine Learning, 2019, 108 : 747 - 763
  • [29] Supervised representation learning for multi-label classification
    Huang, Ming
    Zhuang, Fuzhen
    Zhang, Xiao
    Ao, Xiang
    Niu, Zhengyu
    Zhang, Min-Ling
    He, Qing
    MACHINE LEARNING, 2019, 108 (05) : 747 - 763
  • [30] Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition
    Chen, Tianshui
    Xu, Muxin
    Hui, Xiaolu
    Wu, Hefeng
    Lin, Liang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 522 - 531