Multi-label Image Classification with Multi-scale Global-Local Semantic Graph Network

被引:2
|
作者
Kuang, Wenlan [1 ,2 ]
Zhu, Qiangxi [1 ,2 ]
Li, Zhixin [1 ,2 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image classification; Multi-scale feature; Attention mechanisms; Semantic relationship graph; CNN;
D O I
10.1007/978-3-031-43418-1_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of deep learning techniques, multi-label image classification tasks have achieved good performance. Recently, graph convolutional network has been proved to be an effective way to explore the labels dependencies. However, due to the complexity of label semantic relations, the static dependencies obtained by existing methods cannot consider the overall characteristics of an image and accurately locate the target region. Therefore, we propose the Multi-scale Global-local Semantic Graph Network (MGSGN) for multi-label image classification, which mainly includes three important parts. First, the multi-scale feature reconstruction aggregates complementary information at different levels in CNN through cross-layer attention, which can effectively identify target categories of different sizes. We then design a channel dual-branch cross-attention module to explore the correlation between global information and local features in multi-scale features, which using the way of adaptive cross-fusion to locate the target area more accurately. Moreover, we propose the multi-perspective weighted cosine measure in multi-perspective dynamic semantic representation module to construct content-based label dependencies for each image to dynamically construct a semantic relationship graph. Extensive experiments on the two public datasets have verified that the classification performance of our model is better than many state-of-the-art methods.
引用
收藏
页码:53 / 69
页数:17
相关论文
共 50 条
  • [31] A Capsule Network for Hierarchical Multi-label Image Classification
    Noor, Khondaker Tasrif
    Robles-Kelly, Antonio
    Kusy, Brano
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2022, 2022, 13813 : 163 - 172
  • [32] Multi-Label Image Classification by Feature Attention Network
    Yan, Zheng
    Liu, Weiwei
    Wen, Shiping
    Yang, Yin
    IEEE ACCESS, 2019, 7 : 98005 - 98013
  • [33] Modeling global and local label correlation with graph convolutional networks for multi-label chest X-ray image classification
    Li, Lanting
    Cao, Peng
    Yang, Jinzhu
    Zaiane, Osmar R.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2022, 60 (09) : 2567 - 2588
  • [34] Modeling global and local label correlation with graph convolutional networks for multi-label chest X-ray image classification
    Lanting Li
    Peng Cao
    Jinzhu Yang
    Osmar R. Zaiane
    Medical & Biological Engineering & Computing, 2022, 60 : 2567 - 2588
  • [35] Modular Graph Transformer Networks for Multi-Label Image Classification
    Nguyen, Hoang D.
    Vu, Xuan-Son
    Le, Duc-Trong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9092 - 9100
  • [36] Multi-scale pedestrian detection with global-local attention and multi-scale receptive field context
    Xue, Pan
    Chen, Houjin
    Li, Yanfeng
    Li, Jupeng
    IET COMPUTER VISION, 2023, 17 (01) : 13 - 25
  • [37] An Attention-Driven Multi-label Image Classification with Semantic Embedding and Graph Convolutional Networks
    Sun, Dengdi
    Ma, Leilei
    Ding, Zhuanlian
    Luo, Bin
    COGNITIVE COMPUTATION, 2023, 15 (04) : 1308 - 1319
  • [38] An Attention-Driven Multi-label Image Classification with Semantic Embedding and Graph Convolutional Networks
    Dengdi Sun
    Leilei Ma
    Zhuanlian Ding
    Bin Luo
    Cognitive Computation, 2023, 15 : 1308 - 1319
  • [39] Pedestrian attribute classification with multi-scale and multi-label convolutional neural networks
    朱建清
    Zeng Huanqiang
    Zhang Yuzhao
    Zheng Lixin
    Cai Canhui
    HighTechnologyLetters, 2018, 24 (01) : 53 - 61
  • [40] AMC-Net: Asymmetric and multi-scale convolutional neural network for multi-label HPA classification
    Xiang, Shao
    Liang, Qiaokang
    Hu, Yucheng
    Tang, Pen
    Coppola, Gianmarc
    Zhan, Dan
    Sun, Wei
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 178 : 275 - 287