Mini-batch stochastic approximation methods for nonconvex stochastic composite optimization

被引：222

作者：

Ghadimi, Saeed ^{[1
]}

Lan, Guanghui ^{[1
]}

Zhang, Hongchao ^{[2
]}

机构：

[1] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA

[2] Louisiana State Univ, Dept Math, Baton Rouge, LA 70803 USA

来源：

MATHEMATICAL PROGRAMMING | 2016年 / 155卷 / 1-2期

基金：

美国国家科学基金会;

关键词：

Constrained stochastic programming; Mini-batch of samples; Stochastic approximation; Nonconvex optimization; Stochastic programming; First-order method; Zeroth-order method; CONVEX; ALGORITHMS; GRADIENT; DESCENT;

D O I：

10.1007/s10107-014-0846-1

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper considers a class of constrained stochastic composite optimization problems whose objective function is given by the summation of a differentiable (possibly nonconvex) component, together with a certain non-differentiable (but convex) component. In order to solve these problems, we propose a randomized stochastic projected gradient (RSPG) algorithm, in which proper mini-batch of samples are taken at each iteration depending on the total budget of stochastic samples allowed. The RSPG algorithm also employs a general distance function to allow taking advantage of the geometry of the feasible region. Complexity of this algorithm is established in a unified setting, which shows nearly optimal complexity of the algorithm for convex stochastic programming. A post-optimization phase is also proposed to significantly reduce the variance of the solutions returned by the algorithm. In addition, based on the RSPG algorithm, a stochastic gradient free algorithm, which only uses the stochastic zeroth-order information, has been also discussed. Some preliminary numerical results are also provided.

引用

页码：267 / 305

页数：39

共 50 条

[21] Stochastic Variance-Reduced Algorithms for PCA with Arbitrary Mini-Batch Sizes
Kim, Cheolmin
Klabjan, Diego
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4302 - 4311
[22] ClusterEA: Scalable Entity Alignment with Stochastic Training and Normalized Mini-batch Similarities
Gao, Yunjun
Liu, Xiaoze
Wu, Junyang
Li, Tianyi
Wang, Pengfei
Chen, Lu
[J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 421 - 431
[23] Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization
Horvath, Samuel
Lei, Lihua
Richtarik, Peter
Jordan, Michael I.
[J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2022, 4 (02): : 634 - 648
[24] MBA: Mini-Batch AUC Optimization
Gultekin, San
Saha, Avishek
Ratnaparkhi, Adwait
Paisley, John
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5561 - 5574
[25] Inexact proximal stochastic second-order methods for nonconvex composite optimization
Wang, Xiao
Zhang, Hongchao
[J]. OPTIMIZATION METHODS & SOFTWARE, 2020, 35 (04): : 808 - 835
[26] LARGE-SCALE NONCONVEX STOCHASTIC OPTIMIZATION BY DOUBLY STOCHASTIC SUCCESSIVE CONVEX APPROXIMATION
Mokhtari, Aryan
Koppel, Alec
Scutari, Gesualdo
Ribeiro, Alejandro
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4701 - 4705
[27] High-Dimensional Nonconvex Stochastic Optimization by Doubly Stochastic Successive Convex Approximation
Mokhtari, Aryan
Koppel, Alec
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 6287 - 6302
[28] Adaptive Natural Gradient Method for Learning of Stochastic Neural Networks in Mini-Batch Mode
Park, Hyeyoung
Lee, Kwanyong
[J]. APPLIED SCIENCES-BASEL, 2019, 9 (21):
[29] A Mini-Batch Proximal Stochastic Recursive Gradient Algorithm with Diagonal Barzilai–Borwein Stepsize
Teng-Teng Yu
Xin-Wei Liu
Yu-Hong Dai
Jie Sun
[J]. Journal of the Operations Research Society of China, 2023, 11 : 277 - 307
[30] Efficient mini-batch stochastic gradient descent with Centroidal Voronoi Tessellation for PDE-constrained optimization under uncertainty
Chen, Liuhong
Xiong, Meixin
Ming, Ju
He, Xiaoming
[J]. PHYSICA D-NONLINEAR PHENOMENA, 2024, 467

← 1 2 3 4 5 →