HEMP: High-order entropy minimization for neural network compression

被引:6
|
作者
Tartaglione, Enzo [1 ]
Lathuiliere, Stephane [2 ]
Fiandrotti, Attilio [1 ,2 ]
Cagnazzo, Marco [2 ]
Grangetto, Marco [1 ]
机构
[1] Univ Torino, Turin, Italy
[2] Telecom Paris, Paris, France
关键词
Deep learning; Compression; Entropy; Neural networks; Regularization;
D O I
10.1016/j.neucom.2021.07.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We formulate the entropy of a quantized artificial neural network as a differentiable function that can be plugged as a regularization term into the cost function minimized by gradient descent. Our formulation scales efficiently beyond the first order and is agnostic of the quantization scheme. The network can then be trained to minimize the entropy of the quantized parameters, so that they can be optimally compressed via entropy coding. We experiment with our entropy formulation at quantizing and compressing well-known network architectures over multiple datasets. Our approach compares favorably over similar methods, enjoying the benefits of higher order entropy estimate, showing flexibility towards non-uniform quantization (we use Lloyd-max quantization), scalability towards any entropy order to be minimized and efficiency in terms of compression. We show that HEMP is able to work in synergy with other approaches aiming at pruning or quantizing the model itself, delivering significant benefits in terms of storage size compressibility without harming the model's performance. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:244 / 253
页数:10
相关论文
共 50 条
  • [1] Fast compression with a static model in high-order entropy
    Foschini, L
    Grossi, R
    Gupta, A
    Vitter, JS
    DCC 2004: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2004, : 62 - 71
  • [2] OPTOELECTRONIC HIGH-ORDER FEEDBACK NEURAL NETWORK
    SELVIAH, DR
    MAO, ZQ
    MIDWINTER, JE
    ELECTRONICS LETTERS, 1990, 26 (23) : 1954 - 1955
  • [3] High-order MS_CMAC neural network
    Jan, JC
    Hung, SL
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2001, 12 (03): : 598 - 603
  • [4] Adaptive color correction by high-order CMAC neural network
    Chen, JJ
    Huang, KL
    FIFTH COLOR IMAGING CONFERENCE: COLOR SCIENCE, SYSTEMS, AND APPLICATIONS, 1997, : 182 - 186
  • [5] Modified high-order neural network for invariant pattern recognition
    Artyomov, E
    Yadid-Pecht, O
    PATTERN RECOGNITION LETTERS, 2005, 26 (06) : 843 - 851
  • [6] High-Order Social Graph Neural Network for Service Recommendation
    Wei, Chunyu
    Fan, Yushun
    Zhang, Jia
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4615 - 4628
  • [7] HSDN: A High-Order Structural Semantic Disentangled Neural Network
    Hu, Bingde
    Wang, Xingen
    Feng, Zunlei
    Song, Jie
    Zhao, Ji
    Song, Mingli
    Wang, Xinyu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8742 - 8756
  • [8] Analysis of the effects of quantization in high-order function neural network
    Jiang, MH
    Zhu, XY
    Lin, Y
    Yuan, BZ
    Tang, XF
    Lin, BQ
    Ruan, QQ
    Jiang, MY
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1629 - 1632
  • [9] High-Order Entropy Coding for Images
    Tzou, Kou-Hu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1992, 2 (01) : 87 - 89
  • [10] NHCE: A Neural High-Order Causal Entropy Algorithm for Disentangling Coupling Dynamics
    He, Yanyan
    Kang, Mingyu
    Chen, Duxin
    Yu, Wenwu
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5930 - 5942