Dimension-Free Bounds for Low-Precision Training

被引:0
|
作者
Li, Zheng [1 ]
De Sa, Christopher [2 ]
机构
[1] Tsinghua Univ, IIIS, Beijing, Peoples R China
[2] Cornell Univ, Ithaca, NY 14853 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low-precision training is a promising way of decreasing the time and energy cost of training machine learning models. Previous work has analyzed low-precision training algorithms, such as low-precision stochastic gradient descent, and derived theoretical bounds on their convergence rates. These bounds tend to depend on the dimension of the model d in that the number of bits needed to achieve a particular error bound increases as d increases. In this paper, we derive new bounds for low-precision training algorithms that do not contain the dimension d, which lets us better understand what affects the convergence of these algorithms as parameters scale. Our methods also generalize naturally to let us prove new convergence bounds on low-precision training with other quantization schemes, such as low-precision floating-point computation and logarithmic quantization.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] Dimension-Free Empirical Entropy Estimation
    Cohen, Doron
    Kontorovich, Aryeh
    Koolyk, Aaron
    Wolfer, Geoffrey
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [22] On dimension-free Sobolev imbeddings II
    Miroslav Krbec
    Hans-Jürgen Schmeisser
    Revista Matemática Complutense, 2012, 25 : 247 - 265
  • [23] DIMENSION-FREE UNIFORMITY WITH APPLICATIONS, I
    Beck, Jozsef
    MATHEMATIKA, 2017, 63 (03) : 734 - 761
  • [24] On dimension-free Sobolev imbeddings I
    Krbec, Miroslav
    Schmeisser, Hans-Juergen
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2012, 387 (01) : 114 - 125
  • [25] Dimension-Free Harnack Inequalities on Spaces
    Li, Huaiqian
    JOURNAL OF THEORETICAL PROBABILITY, 2016, 29 (04) : 1280 - 1297
  • [26] Quantization and training of object detection networks with low-precision weights and activations
    Yang, Bo
    Liu, Jian
    Zhou, Li
    Wang, Yun
    Chen, Jie
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (01)
  • [27] TESTING LOW-PRECISION OPTICS
    SPEYER, E
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA, 1958, 48 (12) : 953 - 954
  • [28] Dimension-free Complexity Bounds for High-order Nonconvex Finite-sum Optimization
    Zhou, Dongruo
    Gu, Quanquan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [29] POSITNN: TRAINING DEEP NEURAL NETWORKS WITH MIXED LOW-PRECISION POSIT
    Raposo, Goncalo
    Tomas, Pedro
    Roma, Nuno
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7908 - 7912
  • [30] You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding
    Yuan, Geng
    Chang, Sung-En
    Jin, Qing
    Lu, Alec
    Li, Yanyu
    Wu, Yushu
    Kong, Zhenglun
    Xie, Yanyue
    Dong, Peiyan
    Qin, Minghai
    Ma, Xiaolong
    Tang, Xulong
    Fang, Zhenman
    Wang, Yanzhi
    COMPUTER VISION, ECCV 2022, PT XII, 2022, 13672 : 34 - 51