Zeroth-Order Nonconvex Stochastic Optimization: Handling Constraints, High Dimensionality, and Saddle Points

被引:0
|
作者
Krishnakumar Balasubramanian
Saeed Ghadimi
机构
[1] University of California,Department of Statistics
[2] University of Waterloo,Department of Management Sciences
关键词
Zeroth-order methods; Nonconvex optimization; Stochastic optimization; Complexity bounds; Conditional gradient methods; Newton method; 90C15; 90C26; 90C56; 49M15; 65K05;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose and analyze zeroth-order stochastic approximation algorithms for nonconvex and convex optimization, with a focus on addressing constrained optimization, high-dimensional setting, and saddle point avoiding. To handle constrained optimization, we first propose generalizations of the conditional gradient algorithm achieving rates similar to the standard stochastic gradient algorithm using only zeroth-order information. To facilitate zeroth-order optimization in high dimensions, we explore the advantages of structural sparsity assumptions. Specifically, (i) we highlight an implicit regularization phenomenon where the standard stochastic gradient algorithm with zeroth-order information adapts to the sparsity of the problem at hand by just varying the step size and (ii) propose a truncated stochastic gradient algorithm with zeroth-order information, whose rate of convergence depends only poly-logarithmically on the dimensionality. We next focus on avoiding saddle points in nonconvex setting. Toward that, we interpret the Gaussian smoothing technique for estimating gradient based on zeroth-order information as an instantiation of first-order Stein’s identity. Based on this, we provide a novel linear-(in dimension) time estimator of the Hessian matrix of a function using only zeroth-order information, which is based on second-order Stein’s identity. We then provide a zeroth-order variant of cubic regularized Newton method for avoiding saddle points and discuss its rate of convergence to local minima.
引用
收藏
页码:35 / 76
页数:41
相关论文
共 50 条
  • [21] Sequential stochastic blackbox optimization with zeroth-order gradient estimators
    Audet, Charles
    Bigeon, Jean
    Couderc, Romain
    Kokkolaras, Michael
    [J]. AIMS MATHEMATICS, 2023, 8 (11): : 25922 - 25956
  • [22] A Generic Approach for Accelerating Stochastic Zeroth-Order Convex Optimization
    Yu, Xiaotian
    King, Irwin
    Lyu, Michael R.
    Yang, Tianbao
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3040 - 3046
  • [23] A zeroth-order algorithm for distributed optimization with stochastic stripe observations
    Yinghui Wang
    Xianlin Zeng
    Wenxiao Zhao
    Yiguang Hong
    [J]. Science China Information Sciences, 2023, 66
  • [24] A zeroth-order algorithm for distributed optimization with stochastic stripe observations
    Wang, Yinghui
    Zeng, Xianlin
    Zhao, Wenxiao
    Hong, Yiguang
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (09)
  • [25] A zeroth-order algorithm for distributed optimization with stochastic stripe observations
    Yinghui WANG
    Xianlin ZENG
    Wenxiao ZHAO
    Yiguang HONG
    [J]. Science China(Information Sciences), 2023, 66 (09) : 297 - 298
  • [26] Adaptive Zeroth-Order Optimisation of Nonconvex Composite Objectives
    Shao, Weijia
    Albayrak, Sahin
    [J]. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2022, PT I, 2023, 13810 : 573 - 595
  • [27] A Proximal Zeroth-Order Algorithm for Nonconvex Nonsmooth Problems
    Kazemi, Ehsan
    Wang, Liqiang
    [J]. 2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 64 - 71
  • [28] A ZEROTH-ORDER PROXIMAL STOCHASTIC GRADIENT METHOD FOR WEAKLY CONVEX STOCHASTIC OPTIMIZATION
    Pougkakiotis, Spyridon
    Kalogerias, Dionysis
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2023, 45 (05): : A2679 - A2702
  • [29] Communication-Efficient Stochastic Zeroth-Order Optimization for Federated Learning
    Fang, Wenzhi
    Yu, Ziyi
    Jiang, Yuning
    Shi, Yuanming
    Jones, Colin N.
    Zhou, Yong
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 5058 - 5073
  • [30] Certified Multifidelity Zeroth-Order Optimization
    de Montbrun, Etienne
    Gerchinovitz, Sebastien
    [J]. SIAM-ASA Journal on Uncertainty Quantification, 2024, 12 (04): : 1135 - 1164