Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision

被引：0

作者：

Wang, Yuejiao ^{[1
]}

Ma, Zhong ^{[1
]}

Yang, Chaojie ^{[1
]}

Yang, Yu ^{[1
]}

Wei, Lu ^{[1
]}

机构：

[1] Xian Microelect Technol Inst, Xian 710065, Peoples R China

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 01期

关键词：

Mixed precision quantization; quantization strategy optimal assignment; reinforcement learning; neural network; model deployment;

D O I：

10.32604/cmc.2024.047108

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The quantization algorithm compresses the original network by reducing the numerical bit width of the model, which improves the computation speed. Because different layers have different redundancy and sensitivity to data bit width. Reducing the data bit width will result in a loss of accuracy. Therefore, it is difficult to determine the optimal bit width for different parts of the network with guaranteed accuracy. Mixed precision quantization can effectively reduce the amount of computation while keeping the model accuracy basically unchanged. In this paper, a hardware-aware mixed precision quantization strategy optimal assignment algorithm adapted to low bit width is proposed, and reinforcement learning is used to automatically predict the mixed precision that meets the constraints of hardware resources. In the state-space design, the standard deviation of weights is used to measure the distribution difference of data, the execution speed feedback of simulated neural network accelerator inference is used as the environment to limit the action space of the agent, and the accuracy of the quantization model after retraining is used as the reward function to guide the agent to carry out deep reinforcement learning training. The experimental results show that the proposed method obtains a suitable model layer-by-layer quantization strategy under the condition that the computational resources are satisfied, and the model accuracy is effectively improved. The proposed method has strong intelligence and certain universality and has strong application potential in the field of mixed precision quantization and embedded neural network model deployment.

引用

页码：819 / 836

页数：18

共 50 条

[1] Optimal Defense Strategy Selection Algorithm Based on Reinforcement Learning and Opposition-Based Learning
Yue, Yiqun
Zhou, Yang
Xu, Lijuan
Zhao, Dawei
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):
[2] A Precision Advertising Strategy Based on Deep Reinforcement Learning
Liang H.
[J]. Ingenierie des Systemes d'Information, 2020, 25 (03): : 397 - 403
[3] Data Quality-Aware Mixed-Precision Quantization via Hybrid Reinforcement Learning
Wang, Yingchun
Guo, Song
Guo, Jingcai
Zhang, Yuanhong
Zhang, Weizhan
Zheng, Qinghua
Zhang, Jie
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[4] The Optimal Path Finding Algorithm Based on Reinforcement Learning
Khekare, Ganesh
Verma, Pushpneel
Dhanre, Urvashi
Raut, Seema
Sheikh, Shahrukh
[J]. INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2020, 12 (04): : 1 - 18
[5] Gate Assignment Algorithm for Airport Peak Time Based on Reinforcement Learning
Zhu, Chenwei
Wei, Zhenchun
Lyu, Zengwei
Yuan, Xiaohui
Hang, Dawei
Feng, Lin
[J]. TRANSPORTATION RESEARCH RECORD, 2024, : 750 - 760
[6] Optimal mixed block withholding attacks based on reinforcement learning
Wang, Yilei
Yang, Guoyu
Li, Tao
Zhang, Lifeng
Wang, Yanli
Ke, Lishan
Dou, Yi
Li, Shouzhe
Yu, Xiaomei
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (12) : 2032 - 2048
[7] Optimal operation strategy of microgrid based on deep reinforcement learning
Zhao P.
Wu J.
Wang Y.
Zhang H.
[J]. Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2022, 42 (11): : 9 - 16
[8] Transit signal priority strategy based on reinforcement learning algorithm
[J]. Li, D.-M. (damingl@163.com), 2012, Northeast University (33):
[9] Credit of optimal state transition based reinforcement learning algorithm
Bai, TF
Wu, GF
[J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 62 - 66
[10] Differential evolution with mixed mutation strategy based on deep reinforcement learning
Tan, Zhiping
Li, Kangshun
[J]. APPLIED SOFT COMPUTING, 2021, 111

← 1 2 3 4 5 →