Dimension-Free Bounds for Low-Precision Training

被引:0
|
作者
Li, Zheng [1 ]
De Sa, Christopher [2 ]
机构
[1] Tsinghua Univ, IIIS, Beijing, Peoples R China
[2] Cornell Univ, Ithaca, NY 14853 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low-precision training is a promising way of decreasing the time and energy cost of training machine learning models. Previous work has analyzed low-precision training algorithms, such as low-precision stochastic gradient descent, and derived theoretical bounds on their convergence rates. These bounds tend to depend on the dimension of the model d in that the number of bits needed to achieve a particular error bound increases as d increases. In this paper, we derive new bounds for low-precision training algorithms that do not contain the dimension d, which lets us better understand what affects the convergence of these algorithms as parameters scale. Our methods also generalize naturally to let us prove new convergence bounds on low-precision training with other quantization schemes, such as low-precision floating-point computation and logarithmic quantization.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Dimension-free bounds and structural results in communication complexity
    Lianna Hambardzumyan
    Hamed Hatami
    Pooya Hatami
    [J]. Israel Journal of Mathematics, 2023, 253 : 555 - 616
  • [2] Dimension-Free Error Bounds from Random Projections
    Kaban, Ata
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4049 - 4056
  • [3] Dimension-free bounds and structural results in communication complexity
    Hambardzumyan, Lianna
    Hatami, Iiamed
    Hatami, Pooya
    [J]. ISRAEL JOURNAL OF MATHEMATICS, 2023, 253 (02) : 555 - 616
  • [4] Dimension-Free Bounds for the Union-Closed Sets Conjecture
    Yu, Lei
    [J]. ENTROPY, 2023, 25 (05)
  • [5] Dimension-free Concentration Bounds on Hankel Matrices for Spectral Learning
    Denis, Francois
    Gybels, Mattias
    Habrard, Amaury
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [6] Dimension-free Concentration Bounds on Hankel Matrices for Spectral Learning
    Denis, Francois
    Gybels, Mattias
    Habrard, Amaury
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
  • [8] Dimension-free bounds for largest singular values of matrix Gaussian series
    Gao, Xianjie
    Zhang, Chao
    Zhang, Hongwei
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2021, 50 (10) : 2419 - 2428
  • [9] SWALP: Stochastic Weight Averaging in Low-Precision Training
    Yang, Guandao
    Zhang, Tianyi
    Kirichenko, Polina
    Bai, Junwen
    Wilson, Andrew Gordon
    De Sa, Christopher
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [10] Hyperbolic geometry, dimension-free
    Benz, W
    [J]. Non-Euclidean Geometries, 2006, 581 : 97 - 107