Multi-label Image Classification with Multi-scale Global-Local Semantic Graph Network

被引:2
|
作者
Kuang, Wenlan [1 ,2 ]
Zhu, Qiangxi [1 ,2 ]
Li, Zhixin [1 ,2 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image classification; Multi-scale feature; Attention mechanisms; Semantic relationship graph; CNN;
D O I
10.1007/978-3-031-43418-1_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of deep learning techniques, multi-label image classification tasks have achieved good performance. Recently, graph convolutional network has been proved to be an effective way to explore the labels dependencies. However, due to the complexity of label semantic relations, the static dependencies obtained by existing methods cannot consider the overall characteristics of an image and accurately locate the target region. Therefore, we propose the Multi-scale Global-local Semantic Graph Network (MGSGN) for multi-label image classification, which mainly includes three important parts. First, the multi-scale feature reconstruction aggregates complementary information at different levels in CNN through cross-layer attention, which can effectively identify target categories of different sizes. We then design a channel dual-branch cross-attention module to explore the correlation between global information and local features in multi-scale features, which using the way of adaptive cross-fusion to locate the target area more accurately. Moreover, we propose the multi-perspective weighted cosine measure in multi-perspective dynamic semantic representation module to construct content-based label dependencies for each image to dynamically construct a semantic relationship graph. Extensive experiments on the two public datasets have verified that the classification performance of our model is better than many state-of-the-art methods.
引用
收藏
页码:53 / 69
页数:17
相关论文
共 50 条
  • [1] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    Neurocomputing, 2022, 491 : 14 - 23
  • [2] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    NEUROCOMPUTING, 2022, 491 : 14 - 23
  • [3] Multi-label image recognition based on adaptive multi-scale graph convolutional network
    Wang X.-S.
    Rong X.-L.
    Cheng Y.-H.
    Chen Z.-S.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (07): : 1737 - 1744
  • [4] Mining Semantic Information With Dual Relation Graph Network for Multi-Label Image Classification
    Zhou, Wei
    Jiang, Weitao
    Chen, Dihu
    Hu, Haifeng
    Su, Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1143 - 1157
  • [5] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [6] Multi-layered semantic representation network for multi-label image classification
    Qu, Xiwen
    Che, Hao
    Huang, Jun
    Xu, Linchuan
    Zheng, Xiao
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) : 3427 - 3435
  • [7] Multi-layered semantic representation network for multi-label image classification
    Xiwen Qu
    Hao Che
    Jun Huang
    Linchuan Xu
    Xiao Zheng
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3427 - 3435
  • [8] Multi-Scale Annulus Clustering for Multi-Label Classification
    Liu, Yan
    Liu, Changshun
    Song, Jingjing
    Yang, Xibei
    Xu, Taihua
    Wang, Pingxin
    MATHEMATICS, 2023, 11 (08)
  • [9] Global-Local Label Correlation for Partial Multi-Label Learning
    Sun, Lijuan
    Feng, Songhe
    Liu, Jun
    Lyu, Gengyu
    Lang, Congyan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 581 - 593
  • [10] A Multi-View Multi-Scale Neural Network for Multi-Label ECG Classification
    Yang, Shunxiang
    Lian, Cheng
    Zeng, Zhigang
    Xu, Bingrong
    Zang, Junbin
    Zhang, Zhidong
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (03): : 648 - 660