Finite-sum smooth optimization with SARAH

被引:0
|
作者
Lam M. Nguyen
Marten van Dijk
Dzung T. Phan
Phuong Ha Nguyen
Tsui-Wei Weng
Jayant R. Kalagnanam
机构
[1] IBM Research,CWI
[2] Thomas J. Watson Research Center,undefined
[3] Computer Security Group,undefined
[4] eBay Inc.,undefined
[5] University of California San Diego,undefined
关键词
Finite-sum; Smooth; Non-convex; Convex; Stochastic algorithm; Variance reduction;
D O I
暂无
中图分类号
学科分类号
摘要
We introduce NC-SARAH for non-convex optimization as a practical modified version of the original SARAH algorithm that was developed for convex optimization. NC-SARAH is the first to achieve two crucial performance properties at the same time—allowing flexible minibatch sizes and large step sizes to achieve fast convergence in practice as verified by experiments. NC-SARAH has a close to optimal asymptotic convergence rate equal to existing prior variants of SARAH called SPIDER and SpiderBoost that either use an order of magnitude smaller step size or a fixed minibatch size. For convex optimization, we propose SARAH++ with sublinear convergence for general convex and linear convergence for strongly convex problems; and we provide a practical version for which numerical experiments on various datasets show an improved performance.
引用
收藏
页码:561 / 593
页数:32
相关论文
共 50 条
  • [1] Finite-sum smooth optimization with SARAH
    Nguyen, Lam M.
    van Dijk, Marten
    Phan, Dzung T.
    Nguyen, Phuong Ha
    Weng, Tsui-Wei
    Kalagnanam, Jayant R.
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2022, 82 (03) : 561 - 593
  • [2] Lower Bounds for Smooth Nonconvex Finite-Sum Optimization
    Zhou, Dongruo
    Gu, Quanquan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [3] AN OPTIMAL ALGORITHM FOR DECENTRALIZED FINITE-SUM OPTIMIZATION
    Hendrikx, Hadrien
    Bach, Francis
    Massoulie, Laurent
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2021, 31 (04) : 2753 - 2783
  • [4] Non-Smooth Weakly-Convex Finite-sum Coupled Compositional Optimization
    Hu, Quanqi
    Zhu, Dixian
    Yang, Tianbao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] On the Oracle Complexity of Higher-Order Smooth Non-Convex Finite-Sum Optimization
    Emmenegger, Nicolas
    Kyng, Rasmus
    Zehmakan, Ahad N.
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [6] ACCELERATED STOCHASTIC ALGORITHMS FOR NONCONVEX FINITE-SUM AND MULTIBLOCK OPTIMIZATION
    Lan, Guanghui
    Yang, Yu
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2019, 29 (04) : 2753 - 2784
  • [7] Proximal Stochastic Methods for Nonsmooth Nonconvex Finite-Sum Optimization
    Reddi, Sashank J.
    Sra, Suvrit
    Poczos, Barnabas
    Smola, Alexander J.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [8] Finite-Sum Coupled Compositional Stochastic Optimization: Theory and Applications
    Wang, Bokun
    Yang, Tianbao
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] RESILIENT TO BYZANTINE ATTACKS FINITE-SUM OPTIMIZATION OVER NETWORKS
    Wu, Zhaoxian
    Ling, Qing
    Chen, Tianyi
    Giannakis, Georgios B.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5910 - 5914
  • [10] An accelerated stochastic ADMM for nonconvex and nonsmooth finite-sum optimization
    Zeng, Yuxuan
    Wang, Zhiguo
    Bai, Jianchao
    Shen, Xiaojing
    [J]. AUTOMATICA, 2024, 163