EMNAPE: Efficient Multi-Dimensional Neural Architecture Pruning for EdgeAI

被引:0
|
作者
Kong, Hao [1 ,2 ]
Luo, Xiangzhong [1 ]
Huai, Shuo [1 ,2 ]
Liu, Di [3 ]
Subramaniam, Ravi [4 ]
Makaya, Christian [4 ]
Lin, Qian [4 ]
Liu, Weichen [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Nanyang Technol Univ, HP NTU Digital Mfg Corp Lab, Singapore, Singapore
[3] Norwegian Univ Sci & Technol, Dept Comp Sci, Trondheim, Norway
[4] HP Inc, Palo Alto, CA USA
关键词
D O I
10.23919/DATE56975.2023.10137122
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a multi-dimensional pruning framework, EMNAPE, to jointly prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In EMNAPE, we introduce a two-stage evaluation strategy to evaluate the importance of each pruning unit and identify the computational redundancy in the three dimensions. Based on the evaluation strategy, we further present a heuristic pruning algorithm to progressively prune redundant units from the three dimensions for better accuracy and efficiency. Experiments demonstrate the superiority of EMNAPE over existing methods.
引用
收藏
页数:2
相关论文
共 50 条
  • [1] An efficient architecture for multi-dimensional convolution
    Elnaggar, A
    Aboelaze, M
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 2000, 47 (12): : 1520 - 1523
  • [2] Towards Efficient Convolutional Neural Network for Embedded Hardware via Multi-Dimensional Pruning
    Kong, Hao
    Liu, Di
    Luo, Xiangzhong
    Huai, Shuo
    Subramaniam, Ravi
    Makaya, Christian
    Lin, Qian
    Liu, Weichen
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [3] Differentiable architecture search with multi-dimensional attention for spiking neural networks
    Man, Yilei
    Xie, Linhai
    Qiao, Shushan
    Zhou, Yumei
    Shang, Delong
    NEUROCOMPUTING, 2024, 601
  • [4] Design of an efficient multiplier-less architecture for multi-dimensional convolution
    Zhang, MZ
    Ngo, HT
    Asari, VK
    ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2005, 3740 : 65 - 78
  • [5] Multi-Dimensional Pruning: A Unified Framework for Model Compression
    Guo, Jinyang
    Ouyang, Wanli
    Xu, Dong
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1505 - 1514
  • [6] Multi-dimensional recurrent neural networks
    Graves, Alex
    Fernandez, Santiago
    Schmidhuber, Juergen
    ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 1, PROCEEDINGS, 2007, 4668 : 549 - +
  • [7] Neural representation of multi-dimensional stimuli
    Eurich, CW
    Wilke, SD
    Schwegler, H
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 115 - 121
  • [8] multiFIA - Multi-dimensional Future Internet Architecture
    Seok, Seung-Joon
    Muhammad, Afaq
    Kim, Kyungbaek
    Choi, Deokjai
    Han, Youn-Hee
    Seok, Woojin
    Song, Wang-Cheol
    2017 NINTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2017), 2017, : 87 - 92
  • [9] 'ESSAY ON MULTI-DIMENSIONAL ARCHITECTURE' + A POETIC SELECTION
    GINS, M
    BOUNDARY 2-AN INTERNATIONAL JOURNAL OF LITERATURE AND CULTURE, 1986, 14 (1-2): : 95 - 98
  • [10] Multi-dimensional spatial pruning for remote sensing image scene classification
    Zhai, Dezhao
    Chen, Wei
    Miao, Baoming
    Liu, Fulong
    Han, Siqi
    Ding, Yinghao
    Yu, Ming
    Wu, Hang
    DIGITAL SIGNAL PROCESSING, 2025, 158