Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks

被引:0
|
作者
Christian Heidorn
Muhammad Sabih
Nicolai Meyerhöfer
Christian Schinabeck
Jürgen Teich
Frank Hannig
机构
[1] Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU),Department of Computer Science – Hardware/Software Co
[2] Fraunhofer Institute for Integrated Circuits IIS,Design
关键词
Filter pruning; Evolutionary algorithm; Explainable AI;
D O I
暂无
中图分类号
学科分类号
摘要
Filter pruning of convolutional neural networks (CNNs) is a common technique to effectively reduce the memory footprint, the number of arithmetic operations, and, consequently, inference time. Recent pruning approaches also consider the targeted device (i.e., graphics processing units) for CNN deployment to reduce the actual inference time. However, simple metrics, such as the ℓ1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell ^1$$\end{document}-norm, are used for deciding which filters to prune. In this work, we propose a hardware-aware technique to explore the vast multi-objective design space of possible filter pruning configurations. Our approach incorporates not only the targeted device but also techniques from explainable artificial intelligence for ranking and deciding which filters to prune. For each layer, the number of filters to be pruned is optimized with the objective of minimizing the inference time and the error rate of the CNN. Experimental results show that our approach can speed up inference time by 1.40× and 1.30× for VGG-16 on the CIFAR-10 dataset and ResNet-18 on the ILSVRC-2012 dataset, respectively, compared to the state-of-the-art ABCPruner.
引用
收藏
页码:40 / 58
页数:18
相关论文
共 50 条
  • [1] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
    Heidorn, Christian
    Sabih, Muhammad
    Meyerhoefer, Nicolai
    Schinabeck, Christian
    Teich, Juergen
    Hannig, Frank
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (1-2) : 40 - 58
  • [2] HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration
    Yu, Fang
    Han, Chuanqi
    Wang, Pengcheng
    Huang, Ruoran
    Huang, Xi
    Cui, Li
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 255 - 262
  • [3] Hardware-Aware Evolutionary Filter Pruning
    Heidorn, Christian
    Meyerhoefer, Nicolai
    Schinabeck, Christian
    Hannig, Frank
    Teich, Juergen
    [J]. EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2022, 2022, 13511 : 283 - 299
  • [4] CHAMP: Coherent Hardware-Aware Magnitude Pruning of Integrated Photonic Neural Networks
    Banerjee, Sanmitra
    Nikdast, Mahdi
    Pasricha, Sudeep
    Chakrabarty, Krishnendu
    [J]. 2022 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2022,
  • [5] Hardware-Aware Softmax Approximation for Deep Neural Networks
    Geng, Xue
    Lin, Jie
    Zhao, Bin
    Kong, Anmin
    Aly, Mohamed M. Sabry
    Chandrasekhar, Vijay
    [J]. COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 107 - 122
  • [6] DyFiP: Explainable AI-based Dynamic Filter Pruning of Convolutional Neural Networks
    Sabih, Muhammad
    Hannig, Frank
    Teich, Juergen
    [J]. PROCEEDINGS OF THE 2022 2ND EUROPEAN WORKSHOP ON MACHINE LEARNING AND SYSTEMS (EUROMLSYS '22), 2022, : 109 - 115
  • [7] Control-free and efficient integrated photonic neural networks via hardware-aware training and pruning
    Xu, Tengji
    Zhang, Weipeng
    Zhang, Jiawei
    Luo, Zeyu
    Xiao, Qiarong
    Wang, Benshan
    Luo, Mingcheng
    Xu, Xingyuan
    Shastri, Bhavin J.
    Prucnal, Paul R.
    Huang, Chaoran
    [J]. Optica, 2024, 11 (08): : 1039 - 1049
  • [8] Hardware-aware Model Architecture for Ternary Spiking Neural Networks
    Wu, Nai-Chun
    Chen, Tsu-Hsiang
    Huang, Chih-Tsun
    [J]. 2023 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI-TSA/VLSI-DAT, 2023,
  • [9] Hardware-aware Partitioning of Convolutional Neural Network Inference for Embedded AI Applications
    Kress, Fabian
    Hoefer, Julian
    Hotfilter, Tim
    Walter, Iris
    Sidorenko, Vladimir
    Harbaum, Tanja
    Becker, Juergen
    [J]. 18TH ANNUAL INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS 2022), 2022, : 133 - 140