The Frontier of SGD and Its Variants in Machine Learning

被引:4
|
作者
Du, Juan [1 ]
机构
[1] New Res & Dev Ctr Hisense, Qingdao 266071, Peoples R China
关键词
D O I
10.1088/1742-6596/1229/1/012046
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A Numerical optimization is a classical field in operation research and computer science, which has been widely used in areas such as physics and economics. Although optimization algorithms have achieved great success for plenty of applications, handling the big data in the best fashion possible is a very inspiring and demanding challenge in the artificial intelligence era. Stochastic gradient descent (SGD) is pretty simple but surprisingly, highly effective in machine learning models, such as support vector machine (SVM) and deep neural network (DNN). Theoretically, the performance of SGD for convex optimization is well understood. But, for the non-convex setting, which is very common for the machine learning problems, to obtain the theoretical guarantee for SGD and its variants is still a standing problem. In the paper, we do a survey about the SGD and its variants such as Momentum, ADAM and SVRG, differentiate their algorithms and applications and present some recent breakthrough and open problems.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [41] Review of research on restricted Boltzmann machine and its variants
    Wang Q.
    Gao X.
    Wu B.
    Hu Z.
    Wan K.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2024, 46 (07): : 2323 - 2345
  • [42] A Frontier: Dependable, Reliable and Secure Machine Learning for Network/System Management
    Duc C. Le
    Nur Zincir-Heywood
    Journal of Network and Systems Management, 2020, 28 : 827 - 849
  • [43] Deep Machine Learning-A New Frontier in Artificial Intelligence Research
    Arel, Itamar
    Rose, Derek C.
    Karnowski, Thomas P.
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2010, 5 (04) : 13 - 18
  • [44] A Frontier: Dependable, Reliable and Secure Machine Learning for Network/System Management
    Duc C Le
    Zincir-Heywood, Nur
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2020, 28 (04) : 827 - 849
  • [45] Knowledge Extraction in Web Media: At The Frontier of NLP, Machine Learning and Semantics
    Plu, Julien
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 317 - 322
  • [46] The Speech Generating Device (SGD) Mentoring Program: Supporting the Development of People Learning to Use an SGD
    Liora Ballin
    Susan Balandin
    Roger J. Stancliffe
    Journal of Developmental and Physical Disabilities, 2013, 25 : 437 - 459
  • [47] BASGD: Buffered Asynchronous SGD for Byzantine Learning
    Yang, Yi-Rui
    Li, Wu-Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [48] The Speech Generating Device (SGD) Mentoring Program: Supporting the Development of People Learning to Use an SGD
    Ballin, Liora
    Balandin, Susan
    Stancliffe, Roger J.
    JOURNAL OF DEVELOPMENTAL AND PHYSICAL DISABILITIES, 2013, 25 (04) : 437 - 459
  • [49] Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology
    Yan, Jun
    Wang, Xiangfeng
    PLANT JOURNAL, 2022, 111 (06): : 1527 - 1538
  • [50] A low-resolution real-time face recognition using extreme learning machine and its variants
    Rajpal, Ankit
    Sehra, Khushwant
    Mishra, Anurag
    Chetty, Girija
    IMAGING SCIENCE JOURNAL, 2023, 71 (05): : 456 - 471