Double Attention Based on Graph Attention Network for Image Multi-Label Classification

被引:13
|
作者
Zhou, Wei [1 ]
Xia, Zhiwu [1 ]
Dou, Peng [1 ]
Su, Tao [1 ]
Hu, Haifeng [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label classification; label correlation; channel attention mechanism; graph attention network; visual analysis; EFFICIENT;
D O I
10.1145/3519030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The task of image multi-label classification is to accurately recognize multiple objects in an input image. Most of the recent works need to leverage the label co-occurrence matrix counted from training data to construct the graph structure, which are inflexible and may degrade model generalizability. In addition, these methods fail to capture the semantic correlation between the channel feature maps to further improve model performance. To address these issues, we propose DA-GAT (a Double Attention framework based on the Graph Attention neTwork) to effectively learn the correlation between labels from training data. First, we devise a new channel attention mechanism to enhance the semantic correlation between channel feature maps, so as to implicitly capture the correlation between labels. Second, we propose a new label attention mechanism to avoid the adverse impact of a manually constructed label co-occurrence matrix. It only needs to leverage the label embedding as the input of network, then automatically constructs the label relation matrix to explicitly establish the correlation between labels. Finally, we effectively fuse the output of these two attention mechanisms to further improve model performance. Extensive experiments are conducted on three public multi-label classification benchmarks. Our DA-GAT model achieves mean average precision of 87.1%, 96.6%, and 64.3% on MS-COCO 2014, PASCAL VOC 2007, and NUS-WIDE, respectively, and obviously outperforms other existing state-of-the-art methods. In addition, visual analysis experiments demonstrate that each attention mechanism can capture the correlation between labels well and significantly promote the model performance.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] When graph convolution meets double attention: online privacy disclosure detection with multi-label text classification
    Zhanbo Liang
    Jie Guo
    Weidong Qiu
    Zheng Huang
    Shujun Li
    [J]. Data Mining and Knowledge Discovery, 2024, 38 : 1171 - 1192
  • [32] Real-Time Image Semantic Segmentation Based on Attention Mechanism and Multi-Label Classification
    Gao, Xiang
    Li, Chungeng
    An, Jubai
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (01): : 59 - 67
  • [33] DCA-GCN: a dual-branching channel attention and graph convolution network for multi-label remote sensing image classification
    Yang, Minhang
    Liu, Hui
    Gao, Liang
    Qian, Yurong
    Xiao, Zhengqing
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (04)
  • [34] Multi-module Fusion Relevance Attention Network for Multi-label Text Classification
    Yu, Xinmiao
    Li, Zhengpeng
    Wu, Jiansheng
    Liu, Mingao
    [J]. ENGINEERING LETTERS, 2022, 30 (04)
  • [35] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    [J]. NEUROCOMPUTING, 2022, 491 : 14 - 23
  • [36] Multi-label legal text classification with BiLSTM and attention
    Enamoto, Liriam
    Santos, Andre R. A. S.
    Maia, Ricardo
    Weigang, Li
    Rocha Filho, Geraldo P.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2022, 68 (04) : 369 - 378
  • [37] LA-HCN: Label-based Attention for Hierarchical Multi-label Text Classification Neural Network
    Zhang, Xinyi
    Xu, Jiahao
    Soh, Charlie
    Chen, Lihui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [38] Label Correlation Based Graph Convolutional Network for Multi-label Text Classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Manh-Tran Tien
    Van-Hau Nguyen
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [39] Graph convolutional networks with attention for multi-label weather recognition
    Kezhen Xie
    Zhiqiang Wei
    Lei Huang
    Qibing Qin
    Wenfeng Zhang
    [J]. Neural Computing and Applications, 2021, 33 : 11107 - 11123
  • [40] Multi-Label Patent Categorization with Non-Local Attention-Based Graph Convolutional Network
    Tang, Pingjie
    Jiang, Meng
    Xia, Bryan
    Pitera, Jed W.
    Welser, Jeffrey
    Chawla, Nitesh, V
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9024 - 9031