Multi-Label Image Classification by Feature Attention Network

被引:52
|
作者
Yan, Zheng [1 ]
Liu, Weiwei [1 ,2 ]
Wen, Shiping [2 ,3 ]
Yang, Yin [4 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Hubei, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China
[3] Huazhong Univ Sci & Technol, Res Inst Shenzhen, Wuhan, Hubei, Peoples R China
[4] Hamad Bin Khalifa Univ, Coll Sci Engn & Technol, Doha 5855, Qatar
关键词
Deep neural network; multi-label recognition; label correlation; attention; MEMRISTOR; PASSIVITY;
D O I
10.1109/ACCESS.2019.2929512
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning the correlation among labels is a standing-problem in the multi-label image recognition task. The label correlation is the key to solve the multi-label classification but it is too abstract to model. Most solutions try to learn image label dependencies to improve multi-label classification performance. However, they have ignored two more realistic problems: object scale inconsistent and label tail (category imbalance). These two problems will impact the bad influence on the classification model. To tackle these two problems and learn the label correlations, we propose feature attention network (FAN) which contains feature refinement network and correlation learning network. FAN builds top-down feature fusion mechanism to refine more important features and learn the correlations among convolutional features from FAN to indirect learn the label dependencies. Following our proposed solution, we achieve performed classification accuracy on MSCOCO 2014 and VOC 2007 dataset.
引用
收藏
页码:98005 / 98013
页数:9
相关论文
共 50 条
  • [1] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [2] Feature learning network with transformer for multi-label image classification
    Zhou, Wei
    Dou, Peng
    Su, Tao
    Hu, Haifeng
    Zheng, Zhijie
    [J]. PATTERN RECOGNITION, 2023, 136
  • [3] Double Attention Based on Graph Attention Network for Image Multi-Label Classification
    Zhou, Wei
    Xia, Zhiwu
    Dou, Peng
    Su, Tao
    Hu, Haifeng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [4] Double Attention for Multi-Label Image Classification
    Zhao, Haiying
    Zhou, Wei
    Hou, Xiaogang
    Zhu, Hui
    [J]. IEEE ACCESS, 2020, 8 : 225539 - 225550
  • [5] Visual Attention in Multi-Label Image Classification
    Luo, Yan
    Jiang, Ming
    Zhao, Qi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 820 - 827
  • [6] Attention-Augmented Memory Network for Image Multi-Label Classification
    Zhou, Wei
    Hou, Yanke
    Chen, Dihu
    Hu, Haifeng
    Su, Tao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (03)
  • [7] Multi-Label Fundus Image Classification Using Attention Mechanisms and Feature Fusion
    Li, Zhenwei
    Xu, Mengying
    Yang, Xiaoli
    Han, Yanqi
    [J]. MICROMACHINES, 2022, 13 (06)
  • [8] Label-Guided Cross-Modal Attention Network for Multi-Label Aerial Image Classification
    Chen, Ying
    Zhang, Ding
    Han, Tao
    Meng, Xiaoliang
    Gao, Mianxin
    Wang, Teng
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [9] DATran: Dual Attention Transformer for Multi-Label Image Classification
    Zhou, Wei
    Zheng, Zhijie
    Su, Tao
    Hu, Haifeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 342 - 356
  • [10] A Capsule Network for Hierarchical Multi-label Image Classification
    Noor, Khondaker Tasrif
    Robles-Kelly, Antonio
    Kusy, Brano
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2022, 2022, 13813 : 163 - 172