Neural Network Compression and Acceleration by Federated Pruning

被引：4

作者：

Pei, Songwen ^{[1
,2
,3
]}

Wu, Yusheng ^{[1
]}

Qiu, Meikang ^{[4
]}

机构：

[1] Univ Shanghai Sci & Technol, Shanghai 200093, Peoples R China

[2] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing 100190, Peoples R China

[3] Fudan Univ, Shanghai Key Lab Data Sci, Shanghai 200433, Peoples R China

[4] Texas A&M Univ Commerce, Dept Comp Sci, Commerce, TX 75428 USA

来源：

ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II | 2020年 / 12453卷

基金：

中国国家自然科学基金;

关键词：

Model compression; Channel pruning; Federated pruning; Neural network; Pre-trained model; SYSTEM;

D O I：

10.1007/978-3-030-60239-0_12

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, channel pruning is one of the important methods for deep model compression. But the resulting model still has tremendous redundant feature maps. In this paper, we propose a novel method, namely federated pruning algorithm, to achieve narrower model with negligible performance degradation. Different from many existing approaches, the federated pruning algorithm removes all filters in the pre-trained model together with their connecting feature map by combining the weights with the importance of the channels, rather than pruning the network in terms of a single criterion. Finally, we fine-tune the resulting model to restore network performance. Extensive experiments demonstrate the effectiveness of federated pruning algorithm. VGG-19 network pruned by federated pruning algorithm on CIFAR-10 achieves 92.5% reduction in total parameters and 13.58x compression ratio with only 0.23% decrease in accuracy. Meanwhile, tested on SVHN, VGG-19 achieves 94.5% reduction in total parameters and 18.01x compression ratio with only 0.43% decrease in accuracy.

引用

页码：173 / 183

页数：11

共 50 条

[1] Neural network pruning and hardware acceleration
Jeong, Taehee
Ghasemi, Ehsam
Tuyls, Jorn
Delaye, Elliott
Sirasao, Ashish
2020 IEEE/ACM 13TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC 2020), 2020, : 440 - 445
[2] Federated Pruning: Improving Neural Network Efficiency with Federated Learning
Lin, Rongmei
Xiao, Yonghui
Yang, Tien-Ju
Zhao, Ding
Xiong, Li
Motta, Giovanni
Beaufays, Francoise
INTERSPEECH 2022, 2022, : 1701 - 1705
[3] Dirichlet Pruning for Neural Network Compression
Adamczewski, Kamil
Park, Mijung
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[4] Automated Pruning for Deep Neural Network Compression
Manessi, Franco
Rozza, Alessandro
Bianco, Simone
Napoletano, Paolo
Schettini, Raimondo
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 657 - 664
[5] ON THE ROLE OF STRUCTURED PRUNING FOR NEURAL NETWORK COMPRESSION
Bragagnolo, Andrea
Tartaglione, Enzo
Fiandrotti, Attilio
Grangetto, Marco
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3527 - 3531
[6] Quantisation and Pruning for Neural Network Compression and Regularisation
Paupamah, Kimessha
James, Steven
Klein, Richard
2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 295 - 300
[7] Pruning and quantization for deep neural network acceleration: A survey
Liang, Tailin
Glossner, John
Wang, Lei
Shi, Shaobo
Zhang, Xiaotong
NEUROCOMPUTING, 2021, 461 : 370 - 403
[8] Revisiting Random Channel Pruning for Neural Network Compression
Li, Yawei
Adamczewski, Kamil
Li, Wen
Gu, Shuhang
Timofte, Radu
Van Gool, Luc
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 191 - 201
[9] Convolutional neural network acceleration algorithm based on filters pruning
Li H.
Zhao W.-J.
Han B.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2019, 53 (10): : 1994 - 2002
[10] An FSCV Deep Neural Network: Development, Pruning, and Acceleration on an FPGA
Zhang, Zhichao
Oh, Yoonbae
Adams, Scott D.
Bennet, Kevin E.
Kouzani, Abbas Z.
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) : 2248 - 2259

← 1 2 3 4 5 →