Iteratively Training Look-Up Tables for Network Quantization

被引:8
|
作者
Cardinaux, Fabien [1 ]
Uhlich, Stefan [1 ]
Yoshiyama, Kazuki [1 ]
Garcia, Javier Alonso [1 ]
Mauch, Lukas [1 ]
Tiedemann, Stephen [1 ]
Kemp, Thomas [1 ]
Nakamura, Akira [2 ]
机构
[1] Sony Europe BV, D-70327 Stuttgart, Germany
[2] Sony Corp, Tokyo 1080075, Japan
关键词
Neural network compression; network quantization; look-up table quantization; weight tying; multiplier-less networks; multiplier-less batch normalization; NEURAL-NETWORKS;
D O I
10.1109/JSTSP.2020.3005030
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Operating deep neural networks (DNNs) on devices with limited resources requires the reduction of their memory as well as computational footprint. Popular reduction methods are network quantization or pruning, which either reduce the word length of the network parameters or remove weights from the network if they are not needed. In this article, we discuss a general framework for network reduction which we call Look-Up Table Quantization (LUT-Q). For each layer, we learn a value dictionary and an assignment matrix to represent the network weights. We propose a special solver which combines gradient descent and a one-step k-means update to learn both the value dictionaries and assignment matrices iteratively. This method is very flexible: by constraining the value dictionary, many different reduction problems such as non-uniform network quantization, training of multiplierless networks, network pruning, or simultaneous quantization and pruning can be implemented without changing the solver. This flexibility of the LUT-Q method allows us to use the same method to train networks for different hardware capabilities.
引用
收藏
页码:860 / 870
页数:11
相关论文
共 50 条
  • [1] Post-Quantization Dithering with Look-Up Tables
    Kasher, Morriel
    Tinston, Michael
    Spasojevic, Predrag
    2024 58TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS, CISS, 2024,
  • [2] Repeated Look-Up Tables
    Reinhard, Erik
    Garces, Elena
    Stauder, Jurgen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2370 - 2379
  • [3] IMPLEMENTATION OF DYNAMIC LOOK-UP TABLES
    CLARKE, N
    CANTONI, A
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1994, 141 (06): : 391 - 397
  • [4] INMOS COLOR LOOK-UP TABLES
    不详
    MICROPROCESSORS AND MICROSYSTEMS, 1992, 16 (01) : 37 - 49
  • [5] MICROCOMPUTER INTERFACING - LOOK-UP TABLES
    TITUS, CA
    LARSEN, DG
    RONY, PR
    TITUS, JA
    AMERICAN LABORATORY, 1979, 11 (05) : 113 - &
  • [6] Monotonic Calibrated Interpolated Look-Up Tables
    Gupta, Maya
    Cotter, Andrew
    Pfeifer, Jan
    Voevodski, Konstantin
    Canini, Kevin
    Mangylov, Alexander
    Moczydlowski, Wojciech
    van Esbroeck, Alexander
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [7] Irregular Invertible Bloom Look-Up Tables
    Lazaro, Francisco
    Matuz, Balazs
    2021-11TH INTERNATIONAL SYMPOSIUM ON TOPICS IN CODING (ISTC'21), 2021,
  • [8] Perceptually Optimizing Color Look-up Tables
    Reinhard, Johann
    Urban, Philipp
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 403 - 414
  • [9] Speaking Stata: Matrices as look-up tables
    Cox, Nicholas J.
    STATA JOURNAL, 2012, 12 (04): : 748 - 758
  • [10] Wave pipelines via look-up tables
    Boemo, EI
    LopezBuedo, S
    Meneses, JM
    ISCAS 96: 1996 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - CIRCUITS AND SYSTEMS CONNECTING THE WORLD, VOL 4, 1996, : 185 - 188