Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision

被引:0
|
作者
Wang, Yuejiao [1 ]
Ma, Zhong [1 ]
Yang, Chaojie [1 ]
Yang, Yu [1 ]
Wei, Lu [1 ]
机构
[1] Xian Microelect Technol Inst, Xian 710065, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 01期
关键词
Mixed precision quantization; quantization strategy optimal assignment; reinforcement learning; neural network; model deployment;
D O I
10.32604/cmc.2024.047108
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The quantization algorithm compresses the original network by reducing the numerical bit width of the model, which improves the computation speed. Because different layers have different redundancy and sensitivity to data bit width. Reducing the data bit width will result in a loss of accuracy. Therefore, it is difficult to determine the optimal bit width for different parts of the network with guaranteed accuracy. Mixed precision quantization can effectively reduce the amount of computation while keeping the model accuracy basically unchanged. In this paper, a hardware-aware mixed precision quantization strategy optimal assignment algorithm adapted to low bit width is proposed, and reinforcement learning is used to automatically predict the mixed precision that meets the constraints of hardware resources. In the state-space design, the standard deviation of weights is used to measure the distribution difference of data, the execution speed feedback of simulated neural network accelerator inference is used as the environment to limit the action space of the agent, and the accuracy of the quantization model after retraining is used as the reward function to guide the agent to carry out deep reinforcement learning training. The experimental results show that the proposed method obtains a suitable model layer-by-layer quantization strategy under the condition that the computational resources are satisfied, and the model accuracy is effectively improved. The proposed method has strong intelligence and certain universality and has strong application potential in the field of mixed precision quantization and embedded neural network model deployment.
引用
收藏
页码:819 / 836
页数:18
相关论文
共 50 条
  • [1] Optimal Defense Strategy Selection Algorithm Based on Reinforcement Learning and Opposition-Based Learning
    Yue, Yiqun
    Zhou, Yang
    Xu, Lijuan
    Zhao, Dawei
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [2] A Precision Advertising Strategy Based on Deep Reinforcement Learning
    Liang H.
    [J]. Ingenierie des Systemes d'Information, 2020, 25 (03): : 397 - 403
  • [3] Data Quality-Aware Mixed-Precision Quantization via Hybrid Reinforcement Learning
    Wang, Yingchun
    Guo, Song
    Guo, Jingcai
    Zhang, Yuanhong
    Zhang, Weizhan
    Zheng, Qinghua
    Zhang, Jie
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [4] The Optimal Path Finding Algorithm Based on Reinforcement Learning
    Khekare, Ganesh
    Verma, Pushpneel
    Dhanre, Urvashi
    Raut, Seema
    Sheikh, Shahrukh
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2020, 12 (04): : 1 - 18
  • [5] Gate Assignment Algorithm for Airport Peak Time Based on Reinforcement Learning
    Zhu, Chenwei
    Wei, Zhenchun
    Lyu, Zengwei
    Yuan, Xiaohui
    Hang, Dawei
    Feng, Lin
    [J]. TRANSPORTATION RESEARCH RECORD, 2024, : 750 - 760
  • [6] Optimal mixed block withholding attacks based on reinforcement learning
    Wang, Yilei
    Yang, Guoyu
    Li, Tao
    Zhang, Lifeng
    Wang, Yanli
    Ke, Lishan
    Dou, Yi
    Li, Shouzhe
    Yu, Xiaomei
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2020, 35 (12) : 2032 - 2048
  • [7] Optimal operation strategy of microgrid based on deep reinforcement learning
    Zhao P.
    Wu J.
    Wang Y.
    Zhang H.
    [J]. Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2022, 42 (11): : 9 - 16
  • [8] Transit signal priority strategy based on reinforcement learning algorithm
    [J]. Li, D.-M. (damingl@163.com), 2012, Northeast University (33):
  • [9] Credit of optimal state transition based reinforcement learning algorithm
    Bai, TF
    Wu, GF
    [J]. PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 62 - 66
  • [10] Differential evolution with mixed mutation strategy based on deep reinforcement learning
    Tan, Zhiping
    Li, Kangshun
    [J]. APPLIED SOFT COMPUTING, 2021, 111