Trained Rank Pruning for Efficient Deep Neural Networks

被引：11

作者：

Xu, Yuhui ^{[1
]}

Li, Yuxi ^{[1
]}

Zhang, Shuai ^{[2
]}

Wen, Wei ^{[3
]}

Wang, Botao ^{[2
]}

Dai, Wenrui ^{[1
]}

Qi, Yingyong ^{[2
]}

Chen, Yiran ^{[3
]}

Lin, Weiyao ^{[1
]}

Xiong, Hongkai ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] Qualcomm AI Res, San Diego, CA 92121 USA

[3] Duke Univ, Durham, NC 27706 USA

来源：

FIFTH WORKSHOP ON ENERGY EFFICIENT MACHINE LEARNING AND COGNITIVE COMPUTING - NEURIPS EDITION (EMC2-NIPS 2019) | 2019年

基金：

中国国家自然科学基金;

关键词：

low-rank; decomposition; acceleration; pruning;

D O I：

10.1109/EMC2-NIPS53020.2019.00011

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To accelerate DNNs inference, low-rank approximation has been widely adopted because of its solid theoretical rationale and efficient implementations. Several previous works attempted to directly approximate a pre-trained model by low-rank decomposition; however, small approximation errors in parameters can ripple over a large prediction loss. Apparently, it is not optimal to separate low-rank approximation from training. Unlike previous works, this paper integrates low-rank approximation and regularization into the training process. We propose Trained Rank Pruning (TRP), which alternates between low rank approximation and training. TRP maintains the capacity of the original network while imposing low-rank constraints during training. A nuclear regularization optimized by stochastic sub-gradient descent is utilized to further promote low rank in TRP. Networks trained with TRP has a low-rank structure in nature, and is approximated with negligible performance loss, thus eliminating fine-tuning after low rank approximation. The proposed method is comprehensively evaluated on CIFAR-10 and ImageNet, outperforming previous compression counterparts using low rank approximation.

引用

页码：14 / 17

页数：4

共 50 条

[11] Pruning Deep Neural Networks for Green Energy-Efficient Models: A Survey
Tmamna, Jihene
Ben Ayed, Emna
Fourati, Rahma
Gogate, Mandar
Arslan, Tughrul
Hussain, Amir
Ayed, Mounir Ben
[J]. COGNITIVE COMPUTATION, 2024,
[12] Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks
Hoefler, Torsten
Alistarh, Dan
Ben-Nun, Tal
Dryden, Nikoli
Peste, Alexandra
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 23
[13] Fpar: filter pruning via attention and rank enhancement for deep convolutional neural networks acceleration
Chen, Yanming
Wu, Gang
Shuai, Mingrui
Lou, Shubin
Zhang, Yiwen
An, Zhulin
[J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2973 - 2985
[14] Structured Pruning of Deep Convolutional Neural Networks
Anwar, Sajid
Hwang, Kyuyeon
Sung, Wonyong
[J]. ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)
[15] Fast Convex Pruning of Deep Neural Networks
Aghasi, Alireza
Abdi, Afshin
Romberg, Justin
[J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2020, 2 (01): : 158 - 188
[16] Activation Pruning of Deep Convolutional Neural Networks
Ardakani, Arash
Condo, Carlo
Gross, Warren J.
[J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1325 - 1329
[17] Rank Diminishing in Deep Neural Networks
Feng, Ruili
Zheng, Kecheng
Huang, Yukun
Zhao, Deli
Jordan, Michael
Zha, Zheng-Jun
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[18] A Filter Rank Based Pruning Method for Convolutional Neural Networks
Liu, Hao
Guan, Zhenyu
Lei, Peng
[J]. 2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1318 - 1322
[19] Zero-Keep Filter Pruning for Energy/Power Efficient Deep Neural Networks
Woo, Yunhee
Kim, Dongyoung
Jeong, Jaemin
Ko, Young-Woong
Lee, Jeong-Gun
[J]. ELECTRONICS, 2021, 10 (11)
[20] Automatic Pruning Rate Derivation for Structured Pruning of Deep Neural Networks
Sakai, Yasufumi
Iwakawa, Akinori
Tabaru, Tsuguchika
Inoue, Atsuki
Kawaguchi, Hiroshi
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2561 - 2567

← 1 2 3 4 5 →