An Information-Theoretic Justification for Model Pruning

被引:0
|
作者
Isik, Berivan [1 ]
Weissman, Tsachy [1 ]
No, Albert [2 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Hongik Univ, Seoul, South Korea
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
COMPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the neural network (NN) compression problem, viewing the tension between the compression ratio and NN performance through the lens of rate-distortion theory. We choose a distortion metric that reflects the effect of NN compression on the model output and derive the tradeoff between rate (compression) and distortion. In addition to characterizing theoretical limits of NN compression, this formulation shows that pruning, implicitly or explicitly, must be a part of a good compression algorithm. This observation bridges a gap between parts of the literature pertaining to NN and data compression, respectively, providing insight into the empirical success of model pruning. Finally, we propose a novel pruning strategy derived from our information-theoretic formulation and show that it outperforms the relevant baselines on CIFAR-10 and ImageNet datasets.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] An Information-Theoretic Account of Static Index Pruning
    Chen, Ruey-Cheng
    Lee, Chia-Jung
    [J]. SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 163 - 172
  • [2] An information-theoretic model for steganography
    Cachin, C
    [J]. INFORMATION AND COMPUTATION, 2004, 192 (01) : 41 - 56
  • [3] ON AN INFORMATION-THEORETIC MODEL OF EXPLANATION
    WOODWARD, J
    [J]. PHILOSOPHY OF SCIENCE, 1987, 54 (01) : 21 - 44
  • [4] An information-theoretic model for steganography
    Cachin, C
    [J]. INFORMATION HIDING, 1998, 1525 : 306 - 318
  • [5] InfoMoD: Information-theoretic Model Diagnostics
    Esmaeilzadeh, Armin
    Golab, Lukasz
    Taghva, Kazem
    [J]. 35TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2023, 2023,
  • [6] Analysis of an information-theoretic model for communication
    Dickman, Ronald
    Moloney, Nicholas R.
    Altmann, Eduardo G.
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2012,
  • [7] An information-theoretic model of voting systems
    Hosp, Ben
    Vora, Poorvi L.
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 2008, 48 (9-10) : 1628 - 1645
  • [8] Emergence of genetic coding: An information-theoretic model
    Piraveenan, Mahendra
    Polani, Daniel
    Prokopenko, Mikhail
    [J]. ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 2007, 4648 : 42 - +
  • [9] AN INFORMATION-THEORETIC MODEL FOR SERIAL POSITION EFFECT
    THOMAS, HBG
    [J]. PSYCHOLOGICAL REVIEW, 1968, 75 (05) : 409 - +
  • [10] Information-Theoretic Foundation for the Weighted Updating Model
    Zinn, Jesse Aaron
    [J]. REVIEW OF BEHAVIORAL ECONOMICS, 2019, 6 (01): : 39 - 51