MULTILEVEL COMPOSITE STOCHASTIC OPTIMIZATION VIA NESTED VARIANCE REDUCTION

被引:16
|
作者
Zhang, Junyu [1 ]
Xiao, Lin [2 ]
机构
[1] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[2] Microsoft Res, Redmond, WA 98052 USA
关键词
composite stochastic optimization; proximal gradient method; variance reduction; GRADIENT METHODS;
D O I
10.1137/19M1285457
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We consider multilevel composite optimization problems where each mapping in the composition is the expectation over a family of randomly chosen smooth mappings or the sum of some finite number of smooth mappings. We present a normalized proximal approximate gradient method where the approximate gradients are obtained via nested stochastic variance reduction. In order to find an approximate stationary point where the expected norm of its gradient mapping is less than epsilon, the total sample complexity of our method is O(epsilon(-3)) in the expectation case and O(N + root N epsilon(-2)) in the finite-sum case where N is the total number of functions across all composition levels. In addition, the dependence of our total sample complexity on the number of composition levels is polynomial, rather than exponential as in previous work.
引用
收藏
页码:1131 / 1157
页数:27
相关论文
共 50 条
  • [1] Stochastic Nested Variance Reduction for Nonconvex Optimization
    Zhou, Dongruo
    Xu, Pan
    Gu, Quanquan
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [2] Stochastic Nested Variance Reduction for Nonconvex Optimization
    Zhou, Dongruo
    Xu, Pan
    Gu, Quanquan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] MULTILEVEL STOCHASTIC GRADIENT METHODS FOR NESTED COMPOSITION OPTIMIZATION
    Yang, Shuoguang
    Wang, Mengdi
    Fang, Ethan X.
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2019, 29 (01) : 616 - 659
  • [4] Stochastic Variance Reduction for Nonconvex Optimization
    Reddi, Sashank J.
    Hefny, Ahmed
    Sra, Suvrit
    Poczos, Barnabas
    Smola, Alex
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [5] Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise
    Kulunchakov, Andrei
    Mairal, Julien
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [6] Estimate sequences for stochastic composite optimization: Variance reduction, acceleration, and robustness to noise
    Kulunchakov, Andrei
    Mairal, Julien
    [J]. Journal of Machine Learning Research, 2020, 21
  • [7] A MULTILEVEL APPROACH TO VARIANCE REDUCTION IN THE STOCHASTIC ESTIMATION OF THE TRACE OF A MATRIX
    Frommer, Andreas
    Khalil, Mostafa Nasr
    Ramirez-Hidalgo, Gustavo
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2022, 44 (04): : A2536 - A2556
  • [8] Optimal methods for convex nested stochastic composite optimization
    Zhang, Zhe
    Lan, Guanghui
    [J]. MATHEMATICAL PROGRAMMING, 2024,
  • [9] A Stochastic Composite Gradient Method with Incremental Variance Reduction
    Zhang, Junyu
    Xiao, Lin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [10] Stochastic variable metric proximal gradient with variance reduction for non-convex composite optimization
    Gersende Fort
    Eric Moulines
    [J]. Statistics and Computing, 2023, 33 (3)