Multi-layer Attention Aggregation in Deep Neural Network

被引:0
|
作者
Zhang, Zetan [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510006, Guangdong, Peoples R China
关键词
Attention block; Aggregation; Convolutional neural networks; Performance improvement; Image classification;
D O I
10.1109/itaic.2019.8785533
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks have achieved significant successes in image classification recently due to its high capacity in learning discriminative features. In this work, we propose Multi-layer attention aggregation(MAA) model, a convolutional architecture using attention mechanism and global aggregation module iteratively. By merging attention-aware features from every convolutional stage, MAA improves the performance of image classification. Specifically, the proposed MAA model can be applied to state-of-the-art convolutional architectures, such as ResNet, and improve its performance by increasing 30% computational cost. Furthermore, we also employ ArcFace loss in the training process to improve the performance of image classification. Applying the proposed method on ResNet, our MAA model achieves higher image classification performance including on standard benchmarks of Google-Landmarks dataset, CIFAR-10 and CIFAR-100 dataset. Note that, our method achieves 0.68% top-1 accuracy improvement on Google-Landmarks dataset, 2.27% top-1 accuracy improvement on CIFAR-100 and 1.14% top-1 accuracy improvement on CIFAR-10.
引用
收藏
页码:134 / 138
页数:5
相关论文
共 50 条
  • [21] A Fast Learning Algorithm for the Multi-layer Neural Network
    Bilski, Jaroslaw
    Kowalczyk, Bartosz
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I, 2023, 13588 : 3 - 15
  • [22] Document resizing using a multi-layer neural network
    Ahmed, MN
    Cooper, BE
    Love, ST
    IS&T'S NIP17: INTERNATIONAL CONFERENCE ON DIGITAL PRINTING TECHNOLOGIES, 2001, : 792 - 796
  • [23] A Study on Single and Multi-layer Perceptron Neural Network
    Singh, Jaswinder
    Banerjee, Rajdeep
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 35 - 40
  • [24] Multi-Layer Fusion Neural Network for Deepfake Detection
    Zhao, Zheng
    Wang, Penghui
    Lu, Wei
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2021, 13 (04) : 26 - 39
  • [25] An Effective Multi-Layer Attention Network for SAR Ship Detection
    Suo, Zhiling
    Zhao, Yongbo
    Hu, Yili
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (05)
  • [26] Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding
    Zhu, Hongning
    Lee, Kong Aik
    Li, Haizhou
    INTERSPEECH 2021, 2021, : 106 - 110
  • [27] MI-VFDNN: An Efficient Vertical Federated Deep Neural Network With Multi-Layer Interaction
    Sun, Xiao
    Yu, Haining
    Liu, Zhichao
    Jia, Xiaohua
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 7435 - 7448
  • [28] Policing function in ATM network using multi-layer neural network
    Fan, KK
    Jayasumana, AP
    21ST IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, 1996, : 102 - 104
  • [29] Partitioning multi-layer edge network for neural network collaborative computing
    Li, Qiang
    Zhou, Ming-Tuo
    Ren, Tian-Feng
    Jiang, Cheng-Bin
    Chen, Yong
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2023, 2023 (01)
  • [30] Partitioning multi-layer edge network for neural network collaborative computing
    Qiang Li
    Ming-Tuo Zhou
    Tian-Feng Ren
    Cheng-Bin Jiang
    Yong Chen
    EURASIP Journal on Wireless Communications and Networking, 2023