A Review of State-of-the-Art Mixed-Precision Neural Network Frameworks

被引:0
|
作者
Rakka M. [1 ]
Fouda M.E. [3 ]
Khargonekar P. [1 ]
Kurdahi F. [1 ]
机构
[1] Cyber-physical Systems, University of California-Irvine, Irvine, CA
[2] Rain Neuromorphics Inc., San Francisco
关键词
Artificial neural networks; Computational Complexity; Deep Neural Networks; Edge Inference; Hardware; Logic gates; Memory management; Mixed-Precision Neural Networks; Optimization; Quantization; Quantization (signal); Training;
D O I
10.1109/TPAMI.2024.3394390
中图分类号
学科分类号
摘要
Mixed-precision Deep Neural Networks (DNNs) provide an efficient solution for hardware deployment, especially under resource constraints, while maintaining model accuracy. Identifying the ideal bit precision for each layer, however, remains a challenge given the vast array of models, datasets, and quantization schemes, leading to an expansive search space. Recent literature has addressed this challenge, resulting in several promising frameworks. This paper offers a comprehensive overview of the standard quantization classifications prevalent in existing studies. A detailed survey of current mixed-precision frameworks is provided, with an in-depth comparative analysis highlighting their respective merits and limitations. The paper concludes with insights into potential avenues for future research in this domain. IEEE
引用
收藏
页码:1 / 20
页数:19
相关论文
共 50 条
  • [1] Mixed-precision Deep Neural Network Quantization With Multiple Compression Rates
    Wang, Xuanda
    Fei, Wen
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 371 - 371
  • [2] EVOLUTIONARY QUANTIZATION OF NEURAL NETWORKS WITH MIXED-PRECISION
    Liu, Zhenhua
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2785 - 2789
  • [3] Application of artificial neural network in environmental engineering - a state-of-the-art review
    Chandanshive, Viren
    Shanbhag, Ashwini
    INTERNATIONAL JOURNAL OF ENVIRONMENT AND WASTE MANAGEMENT, 2024, 33 (04) : 499 - 510
  • [4] Campo: Cost-Aware Performance Optimization for Mixed-Precision Neural Network Training
    He, Xin
    Sun, Jianhua
    Chen, Hao
    Li, Dong
    PROCEEDINGS OF THE 2022 USENIX ANNUAL TECHNICAL CONFERENCE, 2022, : 505 - 518
  • [5] Hardware for Quantized Mixed-Precision Deep Neural Networks
    Rios, Andres
    Nava, Patricia
    PROCEEDINGS OF THE 2022 15TH IEEE DALLAS CIRCUITS AND SYSTEMS CONFERENCE (DCAS 2022), 2022,
  • [6] State-of-the-art technologies in precision agriculture: a systematic review
    Bhakta, Ishita
    Phadikar, Santanu
    Majumder, Koushik
    JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE, 2019, 99 (11) : 4878 - 4888
  • [7] Low-latency Buffering for Mixed-precision Neural Network Accelerator with MulTAP and FQPipe
    Li, Yike
    Wang, Zheng
    Ou, Wenhui
    Liang, Chen
    Zhou, Weiyu
    Yang, Yongkui
    Chen, Chao
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [8] DPQ: dynamic pseudo-mean mixed-precision quantization for pruned neural network
    Pei, Songwen
    Wang, Jiyao
    Zhang, Bingxue
    Qin, Wei
    Xue, Hai
    Ye, Xiaochun
    Chen, Mingsong
    MACHINE LEARNING, 2024, 113 (07) : 4099 - 4112
  • [9] Rethinking Differentiable Search for Mixed-Precision Neural Networks
    Cai, Zhaowei
    Vasconcelos, Nuno
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2346 - 2355
  • [10] Research and applications of artificial neural network in pavement engineering:A state-of-the-art review
    Xu Yang
    Jinchao Guan
    Ling Ding
    Zhanping You
    Vincent C.S.Lee
    Mohd Rosli Mohd Hasan
    Xiaoyun Cheng
    Journal of Traffic and Transportation Engineering(English Edition), 2021, 8 (06) : 1000 - 1021