Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks

被引:0
|
作者
Christian Heidorn
Muhammad Sabih
Nicolai Meyerhöfer
Christian Schinabeck
Jürgen Teich
Frank Hannig
机构
[1] Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU),Department of Computer Science – Hardware/Software Co
[2] Fraunhofer Institute for Integrated Circuits IIS,Design
关键词
Filter pruning; Evolutionary algorithm; Explainable AI;
D O I
暂无
中图分类号
学科分类号
摘要
Filter pruning of convolutional neural networks (CNNs) is a common technique to effectively reduce the memory footprint, the number of arithmetic operations, and, consequently, inference time. Recent pruning approaches also consider the targeted device (i.e., graphics processing units) for CNN deployment to reduce the actual inference time. However, simple metrics, such as the ℓ1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell ^1$$\end{document}-norm, are used for deciding which filters to prune. In this work, we propose a hardware-aware technique to explore the vast multi-objective design space of possible filter pruning configurations. Our approach incorporates not only the targeted device but also techniques from explainable artificial intelligence for ranking and deciding which filters to prune. For each layer, the number of filters to be pruned is optimized with the objective of minimizing the inference time and the error rate of the CNN. Experimental results show that our approach can speed up inference time by 1.40× and 1.30× for VGG-16 on the CIFAR-10 dataset and ResNet-18 on the ILSVRC-2012 dataset, respectively, compared to the state-of-the-art ABCPruner.
引用
下载
收藏
页码:40 / 58
页数:18
相关论文
共 50 条
  • [21] Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
    He, Yang
    Dong, Xuanyi
    Kang, Guoliang
    Fu, Yanwei
    Yan, Chenggang
    Yang, Yi
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (08) : 3594 - 3604
  • [22] A Filter Rank Based Pruning Method for Convolutional Neural Networks
    Liu, Hao
    Guan, Zhenyu
    Lei, Peng
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1318 - 1322
  • [23] Pruning convolutional neural networks via filter similarity analysis
    Geng, Lili
    Niu, Baoning
    MACHINE LEARNING, 2022, 111 (09) : 3161 - 3180
  • [24] Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
    He, Yang
    Kang, Guoliang
    Dong, Xuanyi
    Fu, Yanwei
    Yang, Yi
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2234 - 2240
  • [25] Filter pruning for convolutional neural networks in semantic image segmentation
    Lopez-Gonzalez, Clara I.
    Gasco, Esther
    Barrientos-Espillco, Fredy
    Besada-Portas, Eva
    Pajares, Gonzalo
    NEURAL NETWORKS, 2024, 169 : 713 - 732
  • [26] Hybrid-Grained Pruning and Hardware Acceleration for Convolutional Neural Networks
    Li, Yu
    Cao, Shan
    Zhao, Beining
    Zhang, Wei
    Jiang, Zhiyuan
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [27] Hardware-Aware Neural Architecture Search: Survey and Taxonomy
    Benmeziane, Hadjer
    El Maghraoui, Kaoutar
    Ouarnoughi, Hamza
    Niar, Smail
    Wistuba, Martin
    Wang, Naigang
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4322 - 4329
  • [28] Deep Quantization of Graph Neural Networks with Run-Time Hardware-Aware Training
    Hansson, Olle
    Grailoo, Mahdieh
    Gustafsson, Oscar
    Nunez-Yanez, Jose
    APPLIED RECONFIGURABLE COMPUTING. ARCHITECTURES, TOOLS, AND APPLICATIONS, ARC 2024, 2024, 14553 : 33 - 47
  • [29] Hardware-aware approach to deep neural network optimization
    Li, Hengyi
    Meng, Lin
    NEUROCOMPUTING, 2023, 559
  • [30] 3D CNN Acceleration on FPGA using Hardware-Aware Pruning
    Sun, Mengshu
    Zhao, Pu
    Gungor, Mehmet
    Pedram, Massoud
    Leeser, Miriam
    Lin, Xue
    PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,