A Review of State-of-the-Art Mixed-Precision Neural Network Frameworks

被引:0
|
作者
Rakka M. [1 ]
Fouda M.E. [3 ]
Khargonekar P. [1 ]
Kurdahi F. [1 ]
机构
[1] Cyber-physical Systems, University of California-Irvine, Irvine, CA
[2] Rain Neuromorphics Inc., San Francisco
关键词
Artificial neural networks; Computational Complexity; Deep Neural Networks; Edge Inference; Hardware; Logic gates; Memory management; Mixed-Precision Neural Networks; Optimization; Quantization; Quantization (signal); Training;
D O I
10.1109/TPAMI.2024.3394390
中图分类号
学科分类号
摘要
Mixed-precision Deep Neural Networks (DNNs) provide an efficient solution for hardware deployment, especially under resource constraints, while maintaining model accuracy. Identifying the ideal bit precision for each layer, however, remains a challenge given the vast array of models, datasets, and quantization schemes, leading to an expansive search space. Recent literature has addressed this challenge, resulting in several promising frameworks. This paper offers a comprehensive overview of the standard quantization classifications prevalent in existing studies. A detailed survey of current mixed-precision frameworks is provided, with an in-depth comparative analysis highlighting their respective merits and limitations. The paper concludes with insights into potential avenues for future research in this domain. IEEE
引用
收藏
页码:1 / 20
页数:19
相关论文
共 50 条
  • [31] Precision Medicine in the Management of Dilated Cardiomyopathy JACC State-of-the-Art Review
    Fatkin, Diane
    Huttner, Inken G.
    Kovacic, Jason C.
    Seidman, J. G.
    Seidman, Christine E.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 74 (23) : 2921 - 2938
  • [32] Advances in precision micro/nano-electroforming: a state-of-the-art review
    Zhang, Honggang
    Zhang, Nan
    Gilchrist, Michael
    Fang, Fengzhou
    JOURNAL OF MICROMECHANICS AND MICROENGINEERING, 2020, 30 (10)
  • [33] A state-of-the-art review of image motion deblurring techniques in precision agriculture
    Yu, Huihui
    Li, Daoliang
    Chen, Yingyi
    HELIYON, 2023, 9 (06)
  • [34] Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization
    Chen, Weihan
    Wang, Peisong
    Cheng, Jian
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5330 - 5339
  • [35] Mixed-precision quantization for neural networks based on error limit (Invited)
    Li Y.
    Guo Z.
    Liu K.
    Sun X.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2022, 51 (04):
  • [36] Mixed-precision Quantization with Dynamical Hessian Matrix for Object Detection Network
    Yang, Zerui
    Fei, Wen
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [37] Sustainable development of process facilities: State-of-the-art review of pollution prevention frameworks
    Hossain, Khandoker A.
    Khan, Faisal I.
    Hawboldt, Kelly
    JOURNAL OF HAZARDOUS MATERIALS, 2008, 150 (01) : 4 - 20
  • [38] Molecular design of covalent organic frameworks for seawater desalination: A state-of-the-art review
    Jrad, Asmaa
    Olson, Mark A.
    Trabolsi, Ali
    CHEM, 2023, 9 (06): : 1413 - 1451
  • [39] Empowering the Vehicular Network with RIS Technology: A State-of-the-Art Review
    Naaz, Farheen
    Nauman, Ali
    Khurshaid, Tahir
    Kim, Sung-Won
    SENSORS, 2024, 24 (02)
  • [40] Reverse Logistics Network Design: A State-of-the-art Literature Review
    Chanintrakul, Piyawat
    Mondragon, Adrian E. Coronado
    Lalwani, Chandra
    ICPOM2008: PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE OF PRODUCTION AND OPERATION MANAGEMENT, VOLUMES 1-3, 2008, : 1310 - 1315