Large-Scale Machine Learning with Stochastic Gradient Descent

被引:3576
|
作者
Bottou, Leon [1 ]
机构
[1] NEC Labs Amer, Princeton, NJ 08542 USA
关键词
stochastic gradient descent; online learning; efficiency;
D O I
10.1007/978-3-7908-2604-3_16
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
During the last decade, the data sizes have grown faster than the speed of processors. In this context, the capabilities of statistical machine learning methods is limited by the computing time rather than the sample size. A more precise analysis uncovers qualitatively different tradeoffs for the case of small-scale and large-scale learning problems. The large-scale case involves the computational complexity of the underlying optimization algorithm in non-trivial ways. Unlikely optimization algorithms such as stochastic gradient descent show amazing performance for large-scale problems. In particular, second order stochastic gradient and averaged stochastic gradient are asymptotically efficient after a single pass on the training set.
引用
收藏
页码:177 / 186
页数:10
相关论文
共 50 条
  • [41] Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning
    Zhang, Yao
    Saxe, Andrew M.
    Advani, Madhu S.
    Lee, Alpha A.
    MOLECULAR PHYSICS, 2018, 116 (21-22) : 3214 - 3223
  • [42] Manifold Learning Method for Large Scale Dataset Based on Gradient Descent
    Wang, Yunhe
    Gao, Yuan
    Xu, Chao
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 1187 - 1194
  • [43] Parallel stochastic gradient algorithms for large-scale matrix completion
    Recht B.
    Ré C.
    Mathematical Programming Computation, 2013, 5 (2) : 201 - 226
  • [44] Efficient Machine Learning On Large-Scale Graphs
    Erickson, Parker
    Lee, Victor E.
    Shi, Feng
    Tang, Jiliang
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4788 - 4789
  • [45] Large-scale kernel extreme learning machine
    Deng, Wan-Yu
    Zheng, Qing-Hua
    Chen, Lin
    Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (11): : 2235 - 2246
  • [46] Machine learning for large-scale MOF screening
    Coupry, Damien
    Groot, Laurens
    Addicoat, Matthew
    Heine, Thomas
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [47] Large-Scale Machine Learning and Neuroimaging in Psychiatry
    Thompson, Paul
    BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S51 - S51
  • [48] Coding for Large-Scale Distributed Machine Learning
    Xiao, Ming
    Skoglund, Mikael
    ENTROPY, 2022, 24 (09)
  • [49] Large-scale Machine Learning over Graphs
    Yang, Yiming
    PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 9 - 9
  • [50] Robust Large-Scale Machine Learning in the Cloud
    Rendle, Steffen
    Fetterly, Dennis
    Shekita, Eugene J.
    Su, Bor-yiing
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1125 - 1134