A Review of State-of-the-Art Mixed-Precision Neural Network Frameworks

被引:0
|
作者
Rakka M. [1 ]
Fouda M.E. [3 ]
Khargonekar P. [1 ]
Kurdahi F. [1 ]
机构
[1] Cyber-physical Systems, University of California-Irvine, Irvine, CA
[2] Rain Neuromorphics Inc., San Francisco
关键词
Artificial neural networks; Computational Complexity; Deep Neural Networks; Edge Inference; Hardware; Logic gates; Memory management; Mixed-Precision Neural Networks; Optimization; Quantization; Quantization (signal); Training;
D O I
10.1109/TPAMI.2024.3394390
中图分类号
学科分类号
摘要
Mixed-precision Deep Neural Networks (DNNs) provide an efficient solution for hardware deployment, especially under resource constraints, while maintaining model accuracy. Identifying the ideal bit precision for each layer, however, remains a challenge given the vast array of models, datasets, and quantization schemes, leading to an expansive search space. Recent literature has addressed this challenge, resulting in several promising frameworks. This paper offers a comprehensive overview of the standard quantization classifications prevalent in existing studies. A detailed survey of current mixed-precision frameworks is provided, with an in-depth comparative analysis highlighting their respective merits and limitations. The paper concludes with insights into potential avenues for future research in this domain. IEEE
引用
收藏
页码:1 / 20
页数:19
相关论文
共 50 条
  • [41] Mixed-precision weights network for field-programmable gate array
    Fuengfusin, Ninnart
    Tamukoh, Hakaru
    PLOS ONE, 2021, 16 (05):
  • [42] Optimized co-scheduling of mixed-precision neural network accelerator for real-time multitasking applications
    Jiang, Wei
    Song, Ziwei
    Zhan, Jinyu
    He, Zhiyuan
    Wen, Xiangyu
    Jiang, Ke
    JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 110 (110)
  • [43] A State-of-the-Art Review on Synchrophasor Applications to Power Network Protection
    Prabhu, M. S.
    Nayak, Paresh Kumar
    ADVANCES IN POWER SYSTEMS AND ENERGY MANAGEMENT, 2018, 436
  • [44] Comparing State-of-the-Art Neural Network Ensemble Methods in Soccer Predictions
    Mendes-Neves, Tiago
    Mendes-Moreira, Joao
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020), 2020, 12117 : 139 - 149
  • [45] Neural-network-based target tracking state-of-the-art survey
    Amoozegar, F
    OPTICAL ENGINEERING, 1998, 37 (03) : 836 - 846
  • [46] A comparison between state-of-the-art and neural network modelling of solar collectors
    Fischer, Stephan
    Frey, Patrick
    Druck, Harald
    SOLAR ENERGY, 2012, 86 (11) : 3268 - 3277
  • [47] State-of-the-Art Review: Neurosyphilis
    Hamill, Matthew M.
    Ghanem, Khalil G.
    Tuddenham, Susan
    CLINICAL INFECTIOUS DISEASES, 2024, 78 (05) : e57 - e68
  • [48] VIDEOARTHROSCOPY - REVIEW AND STATE-OF-THE-ART
    WHELAN, JM
    JACKSON, DW
    ARTHROSCOPY, 1992, 8 (03): : 311 - 319
  • [49] Presbylaryngis: a state-of-the-art review
    Mallick, Ali Sameer
    Garas, George
    McGlashan, Julian
    CURRENT OPINION IN OTOLARYNGOLOGY & HEAD AND NECK SURGERY, 2019, 27 (03): : 168 - 177
  • [50] CRYPTOGRAPHY - A STATE-OF-THE-ART REVIEW
    MEYER, CH
    VLSI AND COMPUTER PERIPHERALS: VLSI AND MICROELECTRONIC APPLICATIONS IN INTELLIGENT PERIPHERALS AND THEIR INTERCONNECTION NETWORKS, 1989, : D150 - D154