Lattice Quantization

被引:2
|
作者
Metz, Clement [1 ]
Allenet, Thibault [1 ]
Thiele, Johannes [2 ]
Dupret, Antoine [1 ]
Bichler, Olivier [1 ]
机构
[1] CEA, List, Palaiseau, France
[2] Axelera ai, Zurich, Switzerland
关键词
Artificial Intelligence; Neural networks; Quantization; Post-training;
D O I
10.23919/DATE56975.2023.10137188
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Post-training quantization of neural networks consists in quantizing a model without retraining nor hyperparameter search, while being fast and data frugal. In this paper, we propose LatticeQ, a novel post-training weight quantization method designed for deep convolutional neural networks (DCNNs). Contrary to scalar rounding widely used in state-of-the-art quantization methods, LatticeQ uses a quantizer based on lattices - discrete algebraic structures. LatticeQ exploits the inner correlations between the model parameters to the benefit of minimizing quantization error. We achieve state-of-the-art results in post-training quantization. In particular, we achieve ImageNet classification results close to full precision on Resnet-18/50, with little to no accuracy drop for 4-bit models. Our code is available here, and a more thorough version of the paper here.
引用
收藏
页数:2
相关论文
共 50 条
  • [1] LATTICE QUANTIZATION
    GIBSON, JD
    SAYOOD, K
    [J]. ADVANCES IN ELECTRONICS AND ELECTRON PHYSICS, 1988, 72 : 259 - 330
  • [2] On lattice quantization noise
    Zamir, R
    Feder, M
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1996, 42 (04) : 1152 - 1159
  • [3] On lattice quantization noise
    Univ of California, Santa Barbara, United States
    [J]. IEEE Trans Inf Theory, 4 (1152-1159):
  • [4] QUANTIZATION OF LATTICE VIBRATIONS
    TAKAHASHI, Y
    [J]. ANNALS OF PHYSICS, 1967, 45 (01) : 132 - +
  • [5] On General Lattice Quantization Noise
    Gariby, Tal
    Erez, Uri
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-6, 2008, : 2717 - 2721
  • [6] QUANTIZATION OF THE OPEN TODA LATTICE
    DEBIARD, A
    GAVEAU, B
    [J]. COMPTES RENDUS DE L ACADEMIE DES SCIENCES SERIE I-MATHEMATIQUE, 1985, 301 (20): : 943 - 946
  • [7] Lattice quantization of Yangian charges
    MacKay, N. J.
    [J]. Physics Letters. Section B: Nuclear, Elementary Particle and High-Energy Physics, 1995, 349 (1-2):
  • [8] LATTICE QUANTIZATION OF YANGIAN CHARGES
    MACKAY, NJ
    [J]. PHYSICS LETTERS B, 1995, 349 (1-2) : 94 - 98
  • [9] FLUX-QUANTIZATION ON A LATTICE
    KUSMARTSEV, FV
    [J]. PHYSICA B, 1991, 169 (1-4): : 585 - 586
  • [10] Lattice Quantization Noise Revisited
    Ling, Cong
    Gan, Lu
    [J]. 2013 IEEE INFORMATION THEORY WORKSHOP (ITW), 2013,