An Optimal Algorithm for Online Non-Convex Learning

被引:10
|
作者
Yang, Lin [1 ]
Deng, Lei [1 ]
Hajiesmaili, Mohammad H. [2 ]
Tan, Cheng [1 ]
Wong, Wing Shing [1 ]
机构
[1] Chinese Univ Hong Kong, Shatin, Hong Kong 999077, Peoples R China
[2] Johns Hopkins Univ, 3400 N Charles St, Baltimore, MD USA
关键词
Online non-convex learning; online convex optimization; Lipschitz expert; regret; online recursive weighting;
D O I
10.1145/3224420
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In many online learning paradigms, convexity plays a central role in the derivation and analysis of online learning algorithms. The results, however, fail to be extended to the non-convex settings, while non-convexity is necessitated by a large number of recent applications. The Online Non-Convex Learning (ONCL) problem generalizes the classic Online Convex Optimization (OCO) framework by relaxing the convexity assumption on the cost function (to a Lipschitz continuous function) and the decision set. The state-of-the-art result for the ONCL demonstrates that the classic online exponential weighting algorithm attains a sublinear regret of O(root T logT). The regret lower bound for the OCO, however, is Omega(root T), and to the best of our knowledge, there is no result in the context of the ONCL problem achieving the same bound. This paper proposes the Online Recursive Weighting (ORW) algorithm with regret of O(root T), matching the tight regret lower bound for the OCO problem, and fills the regret gap between the state-of-the-art results in the online convex and non-convex optimization problems.
引用
下载
收藏
页数:25
相关论文
共 50 条
  • [1] An Optimal Algorithm for Online Non-Convex Learning
    Yang L.
    Deng L.
    Hajiesmaili M.H.
    Tan C.
    Wong W.S.
    2018, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (46): : 41 - 43
  • [2] Online Non-Convex Learning: Following the Perturbed Leader is Optimal
    Suggala, Arun Sai
    Netrapalli, Praneeth
    ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 845 - 861
  • [3] Non-convex online learning via algorithmic equivalence
    Ghai, Udaya
    Lu, Zhou
    Hazan, Elad
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [4] Online Learning with Non-Convex Losses and Non-Stationary Regret
    Gao, Xiang
    Li, Xiaobo
    Zhang, Shuzhong
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [5] NO-REGRET NON-CONVEX ONLINE META-LEARNING
    Zhuang, Zhenxun
    Wang, Yunlong
    Yu, Kezi
    Lu, Songtao
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3942 - 3946
  • [6] Online Bandit Learning for a Special Class of Non-convex Losses
    Zhang, Lijun
    Yang, Tianbao
    Jin, Rong
    Zhou, Zhi-Hua
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3158 - 3164
  • [7] Online non-convex learning for river pollution source identification
    Huang, Wenjie
    Jiang, Jing
    Liu, Xiao
    IISE TRANSACTIONS, 2023, 55 (03) : 229 - 241
  • [8] Optimal, Stochastic, Non-smooth, Non-convex Optimization through Online-to-Non-convex Conversion
    Cutkosky, Ashok
    Mehta, Harsh
    Orabona, Francesco
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [9] Penalty boundary sequential convex programming algorithm for non-convex optimal control problems
    Zhang, Zhe
    Jin, Gumin
    Li, Jianxun
    ISA TRANSACTIONS, 2018, 72 : 229 - 244
  • [10] Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization
    Zhuang, Zhenxun
    Cutkosky, Ashok
    Orabona, Francesco
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97