Mini-batch stochastic approximation methods for nonconvex stochastic composite optimization

被引:222
|
作者
Ghadimi, Saeed [1 ]
Lan, Guanghui [1 ]
Zhang, Hongchao [2 ]
机构
[1] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA
[2] Louisiana State Univ, Dept Math, Baton Rouge, LA 70803 USA
基金
美国国家科学基金会;
关键词
Constrained stochastic programming; Mini-batch of samples; Stochastic approximation; Nonconvex optimization; Stochastic programming; First-order method; Zeroth-order method; CONVEX; ALGORITHMS; GRADIENT; DESCENT;
D O I
10.1007/s10107-014-0846-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper considers a class of constrained stochastic composite optimization problems whose objective function is given by the summation of a differentiable (possibly nonconvex) component, together with a certain non-differentiable (but convex) component. In order to solve these problems, we propose a randomized stochastic projected gradient (RSPG) algorithm, in which proper mini-batch of samples are taken at each iteration depending on the total budget of stochastic samples allowed. The RSPG algorithm also employs a general distance function to allow taking advantage of the geometry of the feasible region. Complexity of this algorithm is established in a unified setting, which shows nearly optimal complexity of the algorithm for convex stochastic programming. A post-optimization phase is also proposed to significantly reduce the variance of the solutions returned by the algorithm. In addition, based on the RSPG algorithm, a stochastic gradient free algorithm, which only uses the stochastic zeroth-order information, has been also discussed. Some preliminary numerical results are also provided.
引用
收藏
页码:267 / 305
页数:39
相关论文
共 50 条
  • [21] Stochastic Variance-Reduced Algorithms for PCA with Arbitrary Mini-Batch Sizes
    Kim, Cheolmin
    Klabjan, Diego
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4302 - 4311
  • [22] ClusterEA: Scalable Entity Alignment with Stochastic Training and Normalized Mini-batch Similarities
    Gao, Yunjun
    Liu, Xiaoze
    Wu, Junyang
    Li, Tianyi
    Wang, Pengfei
    Chen, Lu
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 421 - 431
  • [23] Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization
    Horvath, Samuel
    Lei, Lihua
    Richtarik, Peter
    Jordan, Michael I.
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2022, 4 (02): : 634 - 648
  • [24] MBA: Mini-Batch AUC Optimization
    Gultekin, San
    Saha, Avishek
    Ratnaparkhi, Adwait
    Paisley, John
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5561 - 5574
  • [25] Inexact proximal stochastic second-order methods for nonconvex composite optimization
    Wang, Xiao
    Zhang, Hongchao
    [J]. OPTIMIZATION METHODS & SOFTWARE, 2020, 35 (04): : 808 - 835
  • [26] LARGE-SCALE NONCONVEX STOCHASTIC OPTIMIZATION BY DOUBLY STOCHASTIC SUCCESSIVE CONVEX APPROXIMATION
    Mokhtari, Aryan
    Koppel, Alec
    Scutari, Gesualdo
    Ribeiro, Alejandro
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4701 - 4705
  • [27] High-Dimensional Nonconvex Stochastic Optimization by Doubly Stochastic Successive Convex Approximation
    Mokhtari, Aryan
    Koppel, Alec
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 6287 - 6302
  • [28] Adaptive Natural Gradient Method for Learning of Stochastic Neural Networks in Mini-Batch Mode
    Park, Hyeyoung
    Lee, Kwanyong
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (21):
  • [29] A Mini-Batch Proximal Stochastic Recursive Gradient Algorithm with Diagonal Barzilai–Borwein Stepsize
    Teng-Teng Yu
    Xin-Wei Liu
    Yu-Hong Dai
    Jie Sun
    [J]. Journal of the Operations Research Society of China, 2023, 11 : 277 - 307
  • [30] Efficient mini-batch stochastic gradient descent with Centroidal Voronoi Tessellation for PDE-constrained optimization under uncertainty
    Chen, Liuhong
    Xiong, Meixin
    Ming, Ju
    He, Xiaoming
    [J]. PHYSICA D-NONLINEAR PHENOMENA, 2024, 467