A semantic guidance-based fusion network for multi-label image classification

被引:0
|
作者
Wang, Jiuhang [1 ,2 ]
Tang, Hongying [1 ]
Luo, Shanshan [1 ]
Yang, Liqi [1 ,2 ]
Liu, Shusheng [1 ,2 ]
Hong, Aoping [1 ,2 ]
Li, Baoqing [1 ]
机构
[1] Shanghai lnstitute Microsyst & informat Technol, Sci & Technol Microsyst Lab, 1455 Pingcheng Rd, Shanghai 201800, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun, 1 Yanqihu East Rd, Beijing 100049, Peoples R China
关键词
Image spatial correlation; Label semantic correlation; Layered semantic guidance fusion; Multi-label image classification;
D O I
10.1016/j.patrec.2024.08.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image classification (MLIC), a fundamental task assigning multiple labels to each image, has been seen notable progress in recent years. Considering simultaneous appearances of objects in the physical world, modeling object correlations is crucial for enhancing classification accuracy. This involves accounting for spatial image feature correlation and label semantic correlation. However, existing methods struggle to establish these correlations due to complex spatial location and label semantic relationships. On the other hand, regarding the fusion of image feature relevance and label semantic relevance, existing methods typically learn a semantic representation in the final CNN layer to combine spatial and label semantic correlations. However, different CNN layers capture features at diverse scales and possess distinct discriminative abilities. To address these issues, in this paper we introduce the Semantic Guidance-Based Fusion Network (SGFN) for MLIC. To model spatial image feature correlation, we leverage the advanced TResNet architecture as the backbone network and employ the Feature Aggregation Module for capturing global spatial correlation. For label semantic correlation, we establish both local and global semantic correlation. We further enrich model features by learning semantic representations across multiple convolutional layers. Our method outperforms current state-of-the-art techniques on PASCAL VOC (2007, 2012) and MS-COCO datasets.
引用
下载
收藏
页码:254 / 261
页数:8
相关论文
共 50 条
  • [1] Cross-modality semantic guidance for multi-label image classification
    Huang, Jun
    Wang, Dian
    Hong, Xudong
    Qu, Xiwen
    Xue, Wei
    INTELLIGENT DATA ANALYSIS, 2024, 28 (03) : 633 - 646
  • [2] Multi-layered semantic representation network for multi-label image classification
    Qu, Xiwen
    Che, Hao
    Huang, Jun
    Xu, Linchuan
    Zheng, Xiao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3427 - 3435
  • [3] Multi-layered semantic representation network for multi-label image classification
    Xiwen Qu
    Hao Che
    Jun Huang
    Linchuan Xu
    Xiao Zheng
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3427 - 3435
  • [4] Semantic Supplementary Network With Prior Information for Multi-Label Image Classification
    Wang, Zhe
    Fang, Zhongli
    Li, Dongdong
    Yang, Hai
    Du, Wenli
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 1848 - 1859
  • [5] Mining Semantic Information With Dual Relation Graph Network for Multi-Label Image Classification
    Zhou, Wei
    Jiang, Weitao
    Chen, Dihu
    Hu, Haifeng
    Su, Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1143 - 1157
  • [6] Multi-label image classification with recurrently learning semantic dependencies
    Long Chen
    Ronggui Wang
    Juan Yang
    Lixia Xue
    Min Hu
    The Visual Computer, 2019, 35 : 1361 - 1371
  • [7] Multi-label image classification with recurrently learning semantic dependencies
    Chen, Long
    Wang, Ronggui
    Yang, Juan
    Xue, Lixia
    Hu, Min
    VISUAL COMPUTER, 2019, 35 (10): : 1361 - 1371
  • [8] Deep Semantic Dictionary Learning for Multi-label Image Classification
    Zhou, Fengtao
    Huang, Sheng
    Xing, Yun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3572 - 3580
  • [9] A Capsule Network for Hierarchical Multi-label Image Classification
    Noor, Khondaker Tasrif
    Robles-Kelly, Antonio
    Kusy, Brano
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2022, 2022, 13813 : 163 - 172
  • [10] Multi-Label Image Classification by Feature Attention Network
    Yan, Zheng
    Liu, Weiwei
    Wen, Shiping
    Yang, Yin
    IEEE ACCESS, 2019, 7 : 98005 - 98013