Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks

被引：0

作者：

Christian Heidorn

Muhammad Sabih

Nicolai Meyerhöfer

Christian Schinabeck

Jürgen Teich

Frank Hannig

机构：

[1] Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU),Department of Computer Science – Hardware/Software Co

[2] Fraunhofer Institute for Integrated Circuits IIS,Design

来源：

International Journal of Parallel Programming | 2024年 / 52卷

关键词：

Filter pruning; Evolutionary algorithm; Explainable AI;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Filter pruning of convolutional neural networks (CNNs) is a common technique to effectively reduce the memory footprint, the number of arithmetic operations, and, consequently, inference time. Recent pruning approaches also consider the targeted device (i.e., graphics processing units) for CNN deployment to reduce the actual inference time. However, simple metrics, such as the ℓ1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell ^1$$\end{document}-norm, are used for deciding which filters to prune. In this work, we propose a hardware-aware technique to explore the vast multi-objective design space of possible filter pruning configurations. Our approach incorporates not only the targeted device but also techniques from explainable artificial intelligence for ranking and deciding which filters to prune. For each layer, the number of filters to be pruned is optimized with the objective of minimizing the inference time and the error rate of the CNN. Experimental results show that our approach can speed up inference time by 1.40× and 1.30× for VGG-16 on the CIFAR-10 dataset and ResNet-18 on the ILSVRC-2012 dataset, respectively, compared to the state-of-the-art ABCPruner.

引用

页码：40 / 58

页数：18

共 50 条

[1] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
Heidorn, Christian
Sabih, Muhammad
Meyerhoefer, Nicolai
Schinabeck, Christian
Teich, Juergen
Hannig, Frank
[J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (1-2) : 40 - 58
[2] HFP: Hardware-Aware Filter Pruning for Deep Convolutional Neural Networks Acceleration
Yu, Fang
Han, Chuanqi
Wang, Pengcheng
Huang, Ruoran
Huang, Xi
Cui, Li
[J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 255 - 262
[3] Hardware-Aware Evolutionary Filter Pruning
Heidorn, Christian
Meyerhoefer, Nicolai
Schinabeck, Christian
Hannig, Frank
Teich, Juergen
[J]. EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2022, 2022, 13511 : 283 - 299
[4] CHAMP: Coherent Hardware-Aware Magnitude Pruning of Integrated Photonic Neural Networks
Banerjee, Sanmitra
Nikdast, Mahdi
Pasricha, Sudeep
Chakrabarty, Krishnendu
[J]. 2022 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2022,
[5] Hardware-Aware Softmax Approximation for Deep Neural Networks
Geng, Xue
Lin, Jie
Zhao, Bin
Kong, Anmin
Aly, Mohamed M. Sabry
Chandrasekhar, Vijay
[J]. COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 107 - 122
[6] DyFiP: Explainable AI-based Dynamic Filter Pruning of Convolutional Neural Networks
Sabih, Muhammad
Hannig, Frank
Teich, Juergen
[J]. PROCEEDINGS OF THE 2022 2ND EUROPEAN WORKSHOP ON MACHINE LEARNING AND SYSTEMS (EUROMLSYS '22), 2022, : 109 - 115
[7] Control-free and efficient integrated photonic neural networks via hardware-aware training and pruning
Xu, Tengji
Zhang, Weipeng
Zhang, Jiawei
Luo, Zeyu
Xiao, Qiarong
Wang, Benshan
Luo, Mingcheng
Xu, Xingyuan
Shastri, Bhavin J.
Prucnal, Paul R.
Huang, Chaoran
[J]. Optica, 2024, 11 (08): : 1039 - 1049
[8] Hardware-aware Model Architecture for Ternary Spiking Neural Networks
Wu, Nai-Chun
Chen, Tsu-Hsiang
Huang, Chih-Tsun
[J]. 2023 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI-TSA/VLSI-DAT, 2023,
[9] Hardware-aware Partitioning of Convolutional Neural Network Inference for Embedded AI Applications
Kress, Fabian
Hoefer, Julian
Hotfilter, Tim
Walter, Iris
Sidorenko, Vladimir
Harbaum, Tanja
Becker, Juergen
[J]. 18TH ANNUAL INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS 2022), 2022, : 133 - 140
[10] Accelerator-Aware Pruning for Convolutional Neural Networks
Kang, Hyeong-Ju
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2093 - 2103

← 1 2 3 4 5 →