A Novel Clustering-Based Filter Pruning Method for Efficient Deep Neural Networks

被引:0
|
作者
Wei, Xiaohui [1 ]
Shen, Xiaoxian [1 ]
Zhou, Changbao [1 ]
Yue, Hengshan [1 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering-based; Filter pruning; Deep neural networks;
D O I
10.1007/978-3-030-60239-0_17
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks have achieved great success in various applications, accompanied by a significant increase in the computational operations and storage costs. It is difficult to deploy this model on embedded systems. Therefore, model compress is a popular solution to reduce the above overheads. In this paper, a new filter pruning method based on the clustering algorithm is proposed to compress network models. First, we perform clustering with features of filters and select one for each category as a representative. Next, we rank all filters according to their impacts on the result to select configurable amounts of top features. Finally, we prune the redundant connections that are not selected. We empirically demonstrate the effectiveness of our approach with several network models, including VGG and ResNet. Experimental results show that on CIFAR-10, our method reduces inference costs for VGG-16 by up to 44% and ResNet-32 by up to 50%, while the accuracy can regain close to the original level.
引用
收藏
页码:245 / 258
页数:14
相关论文
共 50 条
  • [31] TRP: Trained Rank Pruning for Efficient Deep Neural Networks
    Xu, Yuhui
    Li, Yuxi
    Zhang, Shuai
    Wen, Wei
    Wang, Botao
    Qi, Yingyong
    Chen, Yiran
    Lin, Weiyao
    Xiong, Hongkai
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 977 - 983
  • [32] Acceleration of Deep Convolutional Neural Networks Using Adaptive Filter Pruning
    Singh, Pravendra
    Verma, Vinay Kumar
    Rai, Piyush
    Namboodiri, Vinay P.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (04) : 838 - 847
  • [33] Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration
    He, Yang
    Ding, Yuhang
    Liu, Ping
    Zhu, Linchao
    Zhang, Hanwang
    Yang, Yi
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2006 - 2015
  • [34] A NOVEL LAYERWISE PRUNING METHOD FOR MODEL REDUCTION OF FULLY CONNECTED DEEP NEURAL NETWORKS
    Mauch, Lukas
    Yang, Bin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2382 - 2386
  • [35] Auto-Balanced Filter Pruning for Efficient Convolutional Neural Networks
    Ding, Xiaohan
    Ding, Guiguang
    Han, Jungong
    Tang, Sheng
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6797 - 6804
  • [36] Filter pruning with uniqueness mechanism in the frequency domain for efficient neural networks
    Zhang, Shuo
    Gao, Mingqi
    Ni, Qiang
    Han, Jungong
    [J]. NEUROCOMPUTING, 2023, 530 : 116 - 124
  • [37] ONLINE FILTER CLUSTERING AND PRUNING FOR EFFICIENT CONVNETS
    Zhou, Zhengguang
    Zhou, Wengang
    Hong, Richang
    Li, Houqiang
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 11 - 15
  • [38] Genius: Subteam Replacement with Clustering-based Graph Neural Networks
    Hu, Chuxuan
    Zhou, Qinghai
    Tong, Hanghang
    [J]. PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, : 10 - 18
  • [39] An Efficient End-to-End Channel Level Pruning Method for Deep Neural Networks Compression
    Zeng, Lei
    Chen, Shi
    Zeng, Sen
    [J]. PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 43 - 46
  • [40] A Flight Arrival Time Prediction Method Based on Cluster Clustering-Based Modular With Deep Neural Network
    Deng, Wu
    Li, Kunpeng
    Zhao, Huimin
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 6238 - 6247