Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks

被引:0
|
作者
Christian Heidorn
Muhammad Sabih
Nicolai Meyerhöfer
Christian Schinabeck
Jürgen Teich
Frank Hannig
机构
[1] Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU),Department of Computer Science – Hardware/Software Co
[2] Fraunhofer Institute for Integrated Circuits IIS,Design
关键词
Filter pruning; Evolutionary algorithm; Explainable AI;
D O I
暂无
中图分类号
学科分类号
摘要
Filter pruning of convolutional neural networks (CNNs) is a common technique to effectively reduce the memory footprint, the number of arithmetic operations, and, consequently, inference time. Recent pruning approaches also consider the targeted device (i.e., graphics processing units) for CNN deployment to reduce the actual inference time. However, simple metrics, such as the ℓ1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell ^1$$\end{document}-norm, are used for deciding which filters to prune. In this work, we propose a hardware-aware technique to explore the vast multi-objective design space of possible filter pruning configurations. Our approach incorporates not only the targeted device but also techniques from explainable artificial intelligence for ranking and deciding which filters to prune. For each layer, the number of filters to be pruned is optimized with the objective of minimizing the inference time and the error rate of the CNN. Experimental results show that our approach can speed up inference time by 1.40× and 1.30× for VGG-16 on the CIFAR-10 dataset and ResNet-18 on the ILSVRC-2012 dataset, respectively, compared to the state-of-the-art ABCPruner.
引用
下载
收藏
页码:40 / 58
页数:18
相关论文
共 50 条
  • [31] Efficient Keyword Spotting through Hardware-Aware Conditional Execution of Deep Neural Networks
    Giraldo, J. S. P.
    O'Connor, Chris
    Verhelst, Marian
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [32] HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks
    Xiao, Jinqi
    Zhang, Chengming
    Gong, Yu
    Yin, Miao
    Sui, Yang
    Xiang, Lizhi
    Tao, Dingwen
    Yuan, Bo
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10464 - 10472
  • [33] Evolution of Hardware-Aware Neural Architecture Search on the Edge
    Richey, Blake
    Clay, Mitchell
    Grecos, Christos
    Shirvaikar, Mukul
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2023, 2023, 12528
  • [34] Hardware-Aware Quantization for Multiplierless Neural Network Controllers
    Habermann, Tobias
    Kuehle, Jonas
    Kumm, Martin
    Volkova, Anastasia
    2022 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS, 2022, : 541 - 545
  • [35] FRACTIONAL STEP DISCRIMINANT PRUNING: A FILTER PRUNING FRAMEWORK FOR DEEP CONVOLUTIONAL NEURAL NETWORKS
    Gkalelis, Nikolaos
    Mezaris, Vasileios
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [36] Deep Quantization of Graph Neural Networks with Run-Time Hardware-Aware Training
    Hansson, Olle
    Grailoo, Mahdieh
    Gustafsson, Oscar
    Nunez-Yanez, Jose
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2024, 14553 LNCS : 33 - 47
  • [37] Generating Neural Networks for Diverse Networking Classification Tasks via Hardware-Aware Neural Architecture Search
    Xie, Guorui
    Li, Qing
    Shi, Zhenning
    Fang, Hanbin
    Ji, Shengpeng
    Jiang, Yong
    Yuan, Zhenhui
    Ma, Lianbo
    Xu, Mingwei
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (02) : 481 - 494
  • [38] QuadraNet: Improving High-Order Neural Interaction Efficiency with Hardware-Aware Quadratic Neural Networks
    Xu, Chenhui
    Yu, Fuxun
    Xu, Zirui
    Liu, Chenchen
    Xiong, Jinjun
    Chen, Xiang
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 19 - 25
  • [39] Hardware-aware neural architecture search for stochastic computing-based neural networks on tiny devices
    Song, Yuhong
    Sha, Edwin Hsing-Mean
    Zhuge, Qingfeng
    Xu, Rui
    Xu, Xiaowei
    Li, Bingzhe
    Yang, Lei
    JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 135
  • [40] Filter Pruning for Efficient Transfer Learning in Deep Convolutional Neural Networks
    Reinhold, Caique
    Roisenberg, Mauro
    ARTIFICIAL INTELLIGENCEAND SOFT COMPUTING, PT I, 2019, 11508 : 191 - 202