Multi-label Image Classification with Multi-scale Global-Local Semantic Graph Network

被引:2
|
作者
Kuang, Wenlan [1 ,2 ]
Zhu, Qiangxi [1 ,2 ]
Li, Zhixin [1 ,2 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image classification; Multi-scale feature; Attention mechanisms; Semantic relationship graph; CNN;
D O I
10.1007/978-3-031-43418-1_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of deep learning techniques, multi-label image classification tasks have achieved good performance. Recently, graph convolutional network has been proved to be an effective way to explore the labels dependencies. However, due to the complexity of label semantic relations, the static dependencies obtained by existing methods cannot consider the overall characteristics of an image and accurately locate the target region. Therefore, we propose the Multi-scale Global-local Semantic Graph Network (MGSGN) for multi-label image classification, which mainly includes three important parts. First, the multi-scale feature reconstruction aggregates complementary information at different levels in CNN through cross-layer attention, which can effectively identify target categories of different sizes. We then design a channel dual-branch cross-attention module to explore the correlation between global information and local features in multi-scale features, which using the way of adaptive cross-fusion to locate the target area more accurately. Moreover, we propose the multi-perspective weighted cosine measure in multi-perspective dynamic semantic representation module to construct content-based label dependencies for each image to dynamically construct a semantic relationship graph. Extensive experiments on the two public datasets have verified that the classification performance of our model is better than many state-of-the-art methods.
引用
收藏
页码:53 / 69
页数:17
相关论文
共 50 条
  • [21] A semantic guidance-based fusion network for multi-label image classification
    Wang, Jiuhang
    Tang, Hongying
    Luo, Shanshan
    Yang, Liqi
    Liu, Shusheng
    Hong, Aoping
    Li, Baoqing
    PATTERN RECOGNITION LETTERS, 2024, 185 : 254 - 261
  • [22] Double Attention Based on Graph Attention Network for Image Multi-Label Classification
    Zhou, Wei
    Xia, Zhiwu
    Dou, Peng
    Su, Tao
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [23] Active learning in multi-label image classification with graph convolutional network embedding
    Xie, Xiurui
    Tian, Maojun
    Luo, Guangchun
    Liu, Guisong
    Wu, Yizhe
    Qin, Ke
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 56 - 65
  • [24] Multi-label text classification based on semantic-sensitive graph convolutional network
    Zeng, Delong
    Zha, Enze
    Kuang, Jiayi
    Shen, Ying
    KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [25] Multi-label image classification with recurrently learning semantic dependencies
    Long Chen
    Ronggui Wang
    Juan Yang
    Lixia Xue
    Min Hu
    The Visual Computer, 2019, 35 : 1361 - 1371
  • [26] Multi-label image classification with recurrently learning semantic dependencies
    Chen, Long
    Wang, Ronggui
    Yang, Juan
    Xue, Lixia
    Hu, Min
    VISUAL COMPUTER, 2019, 35 (10): : 1361 - 1371
  • [27] Deep Semantic Dictionary Learning for Multi-label Image Classification
    Zhou, Fengtao
    Huang, Sheng
    Xing, Yun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3572 - 3580
  • [28] Multi-Label Classification with Label Graph Superimposing
    Wang, Ya
    He, Dongliang
    Li, Fu
    Long, Xiang
    Zhou, Zhichao
    Ma, Jinwen
    Wen, Shilei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12265 - 12272
  • [29] Three-way graph convolutional network for multi-label classification in multi-label information system
    Yu, Bin
    Xie, Hengjie
    Fu, Yu
    Xu, Zeshui
    APPLIED SOFT COMPUTING, 2024, 161
  • [30] Image emotion multi-label classification based on multi-graph learning
    Wang, Meixia
    Zhao, Yuhai
    Wang, Yejiang
    Xu, Tongze
    Sun, Yiming
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231