An Optimal Algorithm for Online Non-Convex Learning

被引：10

作者：

Yang, Lin ^{[1
]}

Deng, Lei ^{[1
]}

Hajiesmaili, Mohammad H. ^{[2
]}

Tan, Cheng ^{[1
]}

Wong, Wing Shing ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Shatin, Hong Kong 999077, Peoples R China

[2] Johns Hopkins Univ, 3400 N Charles St, Baltimore, MD USA

来源：

PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS | 2018年 / 2卷 / 02期

关键词：

Online non-convex learning; online convex optimization; Lipschitz expert; regret; online recursive weighting;

D O I：

10.1145/3224420

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In many online learning paradigms, convexity plays a central role in the derivation and analysis of online learning algorithms. The results, however, fail to be extended to the non-convex settings, while non-convexity is necessitated by a large number of recent applications. The Online Non-Convex Learning (ONCL) problem generalizes the classic Online Convex Optimization (OCO) framework by relaxing the convexity assumption on the cost function (to a Lipschitz continuous function) and the decision set. The state-of-the-art result for the ONCL demonstrates that the classic online exponential weighting algorithm attains a sublinear regret of O(root T logT). The regret lower bound for the OCO, however, is Omega(root T), and to the best of our knowledge, there is no result in the context of the ONCL problem achieving the same bound. This paper proposes the Online Recursive Weighting (ORW) algorithm with regret of O(root T), matching the tight regret lower bound for the OCO problem, and fills the regret gap between the state-of-the-art results in the online convex and non-convex optimization problems.

引用

下载

页数：25

共 50 条

[1] An Optimal Algorithm for Online Non-Convex Learning
Yang L.
Deng L.
Hajiesmaili M.H.
Tan C.
Wong W.S.
2018, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (46): : 41 - 43
[2] Online Non-Convex Learning: Following the Perturbed Leader is Optimal
Suggala, Arun Sai
Netrapalli, Praneeth
ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 845 - 861
[3] Non-convex online learning via algorithmic equivalence
Ghai, Udaya
Lu, Zhou
Hazan, Elad
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[4] Online Learning with Non-Convex Losses and Non-Stationary Regret
Gao, Xiang
Li, Xiaobo
Zhang, Shuzhong
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
[5] NO-REGRET NON-CONVEX ONLINE META-LEARNING
Zhuang, Zhenxun
Wang, Yunlong
Yu, Kezi
Lu, Songtao
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3942 - 3946
[6] Online Bandit Learning for a Special Class of Non-convex Losses
Zhang, Lijun
Yang, Tianbao
Jin, Rong
Zhou, Zhi-Hua
PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3158 - 3164
[7] Online non-convex learning for river pollution source identification
Huang, Wenjie
Jiang, Jing
Liu, Xiao
IISE TRANSACTIONS, 2023, 55 (03) : 229 - 241
[8] Optimal, Stochastic, Non-smooth, Non-convex Optimization through Online-to-Non-convex Conversion
Cutkosky, Ashok
Mehta, Harsh
Orabona, Francesco
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
[9] Penalty boundary sequential convex programming algorithm for non-convex optimal control problems
Zhang, Zhe
Jin, Gumin
Li, Jianxun
ISA TRANSACTIONS, 2018, 72 : 229 - 244
[10] Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization
Zhuang, Zhenxun
Cutkosky, Ashok
Orabona, Francesco
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97

← 1 2 3 4 5 →