Stochastic Optimization for Nonconvex Problem With Inexact Hessian Matrix, Gradient, and Function

被引：1

作者：

Liu, Liu ^{[1
]}

Liu, Xuanqing ^{[2
]}

Hsieh, Cho-Jui ^{[3
]}

Tao, Dacheng ^{[4
]}

机构：

[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China

[2] Amazon Web Serv AWS, Seattle, WA 98108 USA

[3] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA

[4] Univ Sydney, Fac Engn, Sydney AI Ctr, Sch Comp Sci, Sydney, NSW 2008, Australia

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 36卷 / 01期

关键词：

Adaptive regularization; stochastic optimization; trust region (TR); ALGORITHMS; REGULARIZATION; COMPLEXITY;

D O I：

10.1109/TNNLS.2023.3326177

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Trust region (TR) and adaptive regularization using cubics (ARC) have proven to have some very appealing theoretical properties for nonconvex optimization by concurrently computing function value, gradient, and Hessian matrix to obtain the next search direction and the adjusted parameters. Although stochastic approximations help largely reduce the computational cost, it is challenging to theoretically guarantee the convergence rate. In this article, we explore a family of stochastic TR (STR) and stochastic ARC (SARC) methods that can simultaneously provide inexact computations of the Hessian matrix, gradient, and function values. Our algorithms require much fewer propagations overhead per iteration than TR and ARC. We prove that the iteration complexity to achieve epsilon-approximate second-order optimality is of the same order as the exact computations demonstrated in previous studies. In addition, the mild conditions on inexactness can be met by leveraging a random sampling technology in the finite-sum minimization problem. Numerical experiments with a nonconvex problem support these findings and demonstrate that, with the same or a similar number of iterations, our algorithms require less computational overhead per iteration than current second order methods.

引用

页码：1 / 13

页数：13

共 50 条

[31] An inexact modified subgradient algorithm for nonconvex optimization
Burachik, Regina S.
Kaya, C. Yalcin
Mammadov, Musa
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2010, 45 (01) : 1 - 24
[32] A Hybrid and Inexact Algorithm for Nonconvex and Nonsmooth Optimization
Wang, Yiyang
Song, Xiaoliang
JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2024,
[33] Gradient-Free Methods for Deterministic and Stochastic Nonsmooth Nonconvex Optimization
Lin, Tianyi
Zheng, Zeyu
Jordan, Michael I.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[34] STOCHASTIC GENERALIZED-DIFFERENTIABLE FUNCTIONS IN THE PROBLEM OF NONCONVEX NONSMOOTH STOCHASTIC OPTIMIZATION
NORKIN, VI
CYBERNETICS, 1986, 22 (06): : 804 - 809
[35] Online Distributed Stochastic Gradient Algorithm for Nonconvex Optimization With Compressed Communication
Li, Jueyou
Li, Chaojie
Fan, Jing
Huang, Tingwen
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 936 - 951
[36] Second-Order Guarantees of Stochastic Gradient Descent in Nonconvex Optimization
Vlaski, Stefan
Sayed, Ali H.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (12) : 6489 - 6504
[37] pbSGD: Powered Stochastic Gradient Descent Methods for Accelerated Nonconvex Optimization
Zhou, Beitong
Liu, Jun
Sun, Weigao
Chen, Ruijuan
Tomlin, Claire
Yuan, Ye
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3258 - 3266
[38] Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization
Chen, Lesi
Xu, Jing
Luo, Luo
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[39] ZEROTH-ORDER STOCHASTIC PROJECTED GRADIENT DESCENT FOR NONCONVEX OPTIMIZATION
Liu, Sijia
Li, Xingguo
Chen, Pin-Yu
Haupt, Jarvis
Amini, Lisa
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 1179 - 1183
[40] Adaptive regularization for nonconvex optimization using inexact function values and randomly perturbed derivatives
Bellavia, S.
Gurioli, G.
Morini, B.
Toint, Ph. L.
JOURNAL OF COMPLEXITY, 2022, 68

← 1 2 3 4 5 →