EMNAPE: Efficient Multi-Dimensional Neural Architecture Pruning for EdgeAI

被引:0
|
作者
Kong, Hao [1 ,2 ]
Luo, Xiangzhong [1 ]
Huai, Shuo [1 ,2 ]
Liu, Di [3 ]
Subramaniam, Ravi [4 ]
Makaya, Christian [4 ]
Lin, Qian [4 ]
Liu, Weichen [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Nanyang Technol Univ, HP NTU Digital Mfg Corp Lab, Singapore, Singapore
[3] Norwegian Univ Sci & Technol, Dept Comp Sci, Trondheim, Norway
[4] HP Inc, Palo Alto, CA USA
关键词
D O I
10.23919/DATE56975.2023.10137122
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a multi-dimensional pruning framework, EMNAPE, to jointly prune the three dimensions (depth, width, and resolution) of convolutional neural networks (CNNs) for better execution efficiency on embedded hardware. In EMNAPE, we introduce a two-stage evaluation strategy to evaluate the importance of each pruning unit and identify the computational redundancy in the three dimensions. Based on the evaluation strategy, we further present a heuristic pruning algorithm to progressively prune redundant units from the three dimensions for better accuracy and efficiency. Experiments demonstrate the superiority of EMNAPE over existing methods.
引用
收藏
页数:2
相关论文
共 50 条
  • [11] Multi-dimensional Fuzzy Interpolation Neural Network
    Li, Dayou
    Yue, Yong
    Maple, Carsten
    Schetinin, Vitaly
    Qiu, Hua
    2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 186 - +
  • [12] Efficient quantile retrieval on multi-dimensional data
    Yiu, Man Lung
    Mamoulis, Nikos
    Tao, Yufei
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 167 - 185
  • [13] Efficient Methods for Multi-Dimensional Array Redistribution
    Ching-Hsien Hsu
    Yeh-Ching Chung
    Chyi-Ren Dow
    The Journal of Supercomputing, 2000, 17 : 23 - 46
  • [14] Efficient methods for multi-dimensional array redistribution
    Chung, YC
    Hsu, CH
    1998 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 1998, : 410 - 417
  • [15] Efficient methods for multi-dimensional array redistribution
    Hsu, CH
    Chung, YC
    Dow, CR
    JOURNAL OF SUPERCOMPUTING, 2000, 17 (01): : 23 - 46
  • [16] Efficient implementation of multi-dimensional array redistribution
    Guo, MY
    Yamashita, N
    Nakata, I
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (11) : 1195 - 1204
  • [17] Space Efficient Multi-dimensional Range Reporting
    Karpinski, Marek
    Nekrich, Yakov
    COMPUTING AND COMBINATORICS, PROCEEDINGS, 2009, 5609 : 215 - 224
  • [18] Modelling of Complete Robot Dynamics Based on a Multi-Dimensional, RBF-like Neural Architecture
    Markus Krabbes
    Christian Döschner
    Applied Intelligence, 2002, 17 : 61 - 73
  • [19] An Efficient Probabilistic Framework for Multi-Dimensional Classification
    Batal, Iyad
    Hong, Charmgil
    Hauskrecht, Milos
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2417 - 2422
  • [20] Accurate and efficient multi-dimensional TVD interpolation
    Kim, Sung-soo
    Kim, Kyu-Hong
    Kim, Chongam
    COMPUTATIONAL FLUID DYNAMICS 2004, PROCEEDINGS, 2006, : 785 - +