Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks

被引：0

作者：

Christian Heidorn

Muhammad Sabih

Nicolai Meyerhöfer

Christian Schinabeck

Jürgen Teich

Frank Hannig

机构：

[1] Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU),Department of Computer Science – Hardware/Software Co

[2] Fraunhofer Institute for Integrated Circuits IIS,Design

来源：

International Journal of Parallel Programming | 2024年 / 52卷

关键词：

Filter pruning; Evolutionary algorithm; Explainable AI;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Filter pruning of convolutional neural networks (CNNs) is a common technique to effectively reduce the memory footprint, the number of arithmetic operations, and, consequently, inference time. Recent pruning approaches also consider the targeted device (i.e., graphics processing units) for CNN deployment to reduce the actual inference time. However, simple metrics, such as the ℓ1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell ^1$$\end{document}-norm, are used for deciding which filters to prune. In this work, we propose a hardware-aware technique to explore the vast multi-objective design space of possible filter pruning configurations. Our approach incorporates not only the targeted device but also techniques from explainable artificial intelligence for ranking and deciding which filters to prune. For each layer, the number of filters to be pruned is optimized with the objective of minimizing the inference time and the error rate of the CNN. Experimental results show that our approach can speed up inference time by 1.40× and 1.30× for VGG-16 on the CIFAR-10 dataset and ResNet-18 on the ILSVRC-2012 dataset, respectively, compared to the state-of-the-art ABCPruner.

引用

下载

页码：40 / 58

页数：18

共 50 条

[31] Efficient Keyword Spotting through Hardware-Aware Conditional Execution of Deep Neural Networks
Giraldo, J. S. P.
O'Connor, Chris
Verhelst, Marian
2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
[32] HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks
Xiao, Jinqi
Zhang, Chengming
Gong, Yu
Yin, Miao
Sui, Yang
Xiang, Lizhi
Tao, Dingwen
Yuan, Bo
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10464 - 10472
[33] Evolution of Hardware-Aware Neural Architecture Search on the Edge
Richey, Blake
Clay, Mitchell
Grecos, Christos
Shirvaikar, Mukul
REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2023, 2023, 12528
[34] Hardware-Aware Quantization for Multiplierless Neural Network Controllers
Habermann, Tobias
Kuehle, Jonas
Kumm, Martin
Volkova, Anastasia
2022 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS, 2022, : 541 - 545
[35] FRACTIONAL STEP DISCRIMINANT PRUNING: A FILTER PRUNING FRAMEWORK FOR DEEP CONVOLUTIONAL NEURAL NETWORKS
Gkalelis, Nikolaos
Mezaris, Vasileios
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
[36] Deep Quantization of Graph Neural Networks with Run-Time Hardware-Aware Training
Hansson, Olle
Grailoo, Mahdieh
Gustafsson, Oscar
Nunez-Yanez, Jose
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2024, 14553 LNCS : 33 - 47
[37] Generating Neural Networks for Diverse Networking Classification Tasks via Hardware-Aware Neural Architecture Search
Xie, Guorui
Li, Qing
Shi, Zhenning
Fang, Hanbin
Ji, Shengpeng
Jiang, Yong
Yuan, Zhenhui
Ma, Lianbo
Xu, Mingwei
IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (02) : 481 - 494
[38] QuadraNet: Improving High-Order Neural Interaction Efficiency with Hardware-Aware Quadratic Neural Networks
Xu, Chenhui
Yu, Fuxun
Xu, Zirui
Liu, Chenchen
Xiong, Jinjun
Chen, Xiang
29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 19 - 25
[39] Hardware-aware neural architecture search for stochastic computing-based neural networks on tiny devices
Song, Yuhong
Sha, Edwin Hsing-Mean
Zhuge, Qingfeng
Xu, Rui
Xu, Xiaowei
Li, Bingzhe
Yang, Lei
JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 135
[40] Filter Pruning for Efficient Transfer Learning in Deep Convolutional Neural Networks
Reinhold, Caique
Roisenberg, Mauro
ARTIFICIAL INTELLIGENCEAND SOFT COMPUTING, PT I, 2019, 11508 : 191 - 202

← 1 2 3 4 5 →