Stochastic Zeroth-Order Optimization under Nonstationarity and Nonconvexity

被引:0
|
作者
Roy, Abhishek [1 ]
Balasubramanian, Krishnakumar [1 ]
Ghadimi, Saeed [2 ]
Mohapatra, Prasant [3 ]
机构
[1] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
[2] Univ Waterloo, Dept Management Sci, Waterloo, ON N2L 3G1, Canada
[3] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
nonstationary and nonconvex optimization; regret measures; stochastic zeroth-order algorithms; online cubic-Newton method; MARKOV DECISION-PROCESSES; OPTIMAL RATES; CONVEX; ALGORITHMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stochastic zeroth-order optimization algorithms have been predominantly analyzed under the assumption that the objective function being optimized is time-invariant. Motivated by dynamic matrix sensing and completion problems, and online reinforcement learning prob-lems, in this work, we propose and analyze stochastic zeroth-order optimization algorithms when the objective being optimized changes with time. Considering general nonconvex functions, we propose nonstationary versions of regret measures based on first-order and second-order optimal solutions, and provide the corresponding regret bounds. For the case of first-order optimal solution based regret measures, we provide regret bounds in both the low-and high-dimensional settings. For the case of second-order optimal solution based re-gret, we propose zeroth-order versions of the stochastic cubic-regularized Newton's method based on estimating the Hessian matrices in the bandit setting via second-order Gaussian Stein's identity. Our nonstationary regret bounds in terms of second-order optimal solu-tions have interesting consequences for avoiding saddle points in the nonstationary setting.
引用
收藏
页数:47
相关论文
共 50 条
  • [1] Stochastic Zeroth-Order Optimization under Nonstationarity and Nonconvexity
    Roy, Abhishek
    Balasubramanian, Krishnakumar
    Ghadimi, Saeed
    Mohapatra, Prasant
    [J]. Journal of Machine Learning Research, 2022, 23
  • [2] Stochastic Zeroth-order Optimization in High Dimensions
    Wang, Yining
    Du, Simon S.
    Balakrishnan, Sivaraman
    Singh, Aarti
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [3] Stochastic Zeroth-Order Riemannian Derivative Estimation and Optimization
    Li, Jiaxiang
    Balasubramanian, Krishnakumar
    Ma, Shiqian
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (02) : 1183 - 1211
  • [4] Adaptive Evolution Strategies for Stochastic Zeroth-Order Optimization
    He, Xiaoyu
    Zheng, Zibin
    Chen, Zefeng
    Zhou, Yuren
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (05): : 1271 - 1285
  • [5] Zeroth-order algorithms for stochastic distributed nonconvex optimization
    Yi, Xinlei
    Zhang, Shengjun
    Yang, Tao
    Johansson, Karl H.
    [J]. AUTOMATICA, 2022, 142
  • [6] Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization
    Liu, Sijia
    Kailkhura, Bhavya
    Chen, Pin-Yu
    Ting, Paishun
    Chang, Shiyu
    Amini, Lisa
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [7] Sequential stochastic blackbox optimization with zeroth-order gradient estimators
    Audet, Charles
    Bigeon, Jean
    Couderc, Romain
    Kokkolaras, Michael
    [J]. AIMS MATHEMATICS, 2023, 8 (11): : 25922 - 25956
  • [8] ZEROTH-ORDER STOCHASTIC PROJECTED GRADIENT DESCENT FOR NONCONVEX OPTIMIZATION
    Liu, Sijia
    Li, Xingguo
    Chen, Pin-Yu
    Haupt, Jarvis
    Amini, Lisa
    [J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 1179 - 1183
  • [9] A Generic Approach for Accelerating Stochastic Zeroth-Order Convex Optimization
    Yu, Xiaotian
    King, Irwin
    Lyu, Michael R.
    Yang, Tianbao
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3040 - 3046
  • [10] A zeroth-order algorithm for distributed optimization with stochastic stripe observations
    Yinghui Wang
    Xianlin Zeng
    Wenxiao Zhao
    Yiguang Hong
    [J]. Science China Information Sciences, 2023, 66