Large-Scale Machine Learning with Stochastic Gradient Descent

被引:3576
|
作者
Bottou, Leon [1 ]
机构
[1] NEC Labs Amer, Princeton, NJ 08542 USA
关键词
stochastic gradient descent; online learning; efficiency;
D O I
10.1007/978-3-7908-2604-3_16
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
During the last decade, the data sizes have grown faster than the speed of processors. In this context, the capabilities of statistical machine learning methods is limited by the computing time rather than the sample size. A more precise analysis uncovers qualitatively different tradeoffs for the case of small-scale and large-scale learning problems. The large-scale case involves the computational complexity of the underlying optimization algorithm in non-trivial ways. Unlikely optimization algorithms such as stochastic gradient descent show amazing performance for large-scale problems. In particular, second order stochastic gradient and averaged stochastic gradient are asymptotically efficient after a single pass on the training set.
引用
下载
收藏
页码:177 / 186
页数:10
相关论文
共 50 条
  • [1] Painless Stochastic Conjugate Gradient for Large-Scale Machine Learning
    Yang, Zhuang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 35 (10) : 1 - 14
  • [2] Large-scale machine learning with fast and stable stochastic conjugate gradient
    Yang, Zhuang
    COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 173
  • [3] A large-scale stochastic gradient descent algorithm over a graphon
    Chen, Yan
    Li, Tao
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 4806 - 4811
  • [4] Stochastic Gradient Descent for Large-scale Linear Nonparallel SVM
    Tang, Jingjing
    Tian, Yingjie
    Wu, Guoqiang
    Li, Dewei
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 980 - 983
  • [5] Large-scale support vector regression with budgeted stochastic gradient descent
    Zongxia Xie
    Yingda Li
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 1529 - 1541
  • [6] Large-scale support vector regression with budgeted stochastic gradient descent
    Xie, Zongxia
    Li, Yingda
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (06) : 1529 - 1541
  • [7] Constrained Stochastic Gradient Descent for Large-scale Least Squares Problem
    Mu, Yang
    Ding, Wei
    Zhou, Tianyi
    Tao, Dacheng
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 883 - 891
  • [8] Powered stochastic optimization with hypergradient descent for large-scale learning systems
    Yang, Zhuang
    Li, Xiaotian
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [9] Large-scale machine learning with synchronous parallel adaptive stochastic variance reduction gradient descent for high-dimensional blindness detection on spark
    Chuandong Qin
    Yiqing Zhang
    Yu Cao
    The Journal of Supercomputing, 81 (4)
  • [10] Adaptive Powerball Stochastic Conjugate Gradient for Large-Scale Learning
    Yang, Zhuang
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (06) : 1598 - 1606