Gradient-Variation Bound for Online Convex Optimization with Constraints

被引：0

作者：

Qiu, Shuang ^{[1
]}

Wei, Xiaohan ^{[2
]}

Kolar, Mladen ^{[1
]}

机构：

[1] Univ Chicago, Booth Sch Business, Chicago, IL 60637 USA

[2] Meta Platforms Inc, Menlo Pk, CA USA

来源：

THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8 | 2023年

关键词：

REGRET; O(1/T);

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study online convex optimization with constraints consisting of multiple functional constraints and a relatively simple constraint set, such as a Euclidean ball. As enforcing the constraints at each time step through projections is computationally challenging in general, we allow decisions to violate the functional constraints but aim to achieve a low regret and cumulative violation of the constraints over a horizon of T time steps. First-order methods achieve an O(root T) regret and an O(1) constraint violation, which is the best-known bound under the Slater's condition, but do not take into account the structural information of the problem. Furthermore, the existing algorithms and analysis are limited to Euclidean space. In this paper, we provide an instance-dependent bound for online convex optimization with complex constraints obtained by a novel online primal-dual mirror-prox algorithm. Our instance-dependent regret is quantified by the total gradient variation V*(T) in the sequence of loss functions. The proposed algorithm works in general normed spaces and simultaneously achieves an O(root V*(T)) regret and an O(1) constraint violation, which is never worse than the best-known O(root V*(T)) result and improves over previous works that applied mirror-prox-type algorithms for this problem achieving O(T-2/3) regret and constraint violation. Finally, our algorithm is computationally efficient, as it only performs mirror descent steps in each iteration instead of solving a general Lagrangian minimization problem.

引用

页码：9534 / 9542

页数：9

共 50 条

[21] Trading Regret for Efficiency: Online Convex Optimization with Long Term Constraints
Mahdavi, Mehrdad
Jin, Rong
Yang, Tianbao
JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 2503 - 2528
[22] On the Time-Varying Constraints and Bandit Feedback of Online Convex Optimization
Cao, Xuanyu
Liu, K. J. Ray
2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
[23] Adaptive Algorithms for Online Convex Optimization with Long-term Constraints
Jenatton, Rodolphe
Huang, Jim C.
Archambeau, Cedric
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[24] Bregman-style Online Convex Optimization with Energy Harvesting Constraints
Asgari, Kamiar
Neely, Michael J.
PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2020, 4 (03)
[25] Online Convex Optimization With Long-Term Constraints for Predictable Sequences
Muthirayan, Deepan
Yuan, Jianjun
Khargonekar, Pramod P.
IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 979 - 984
[26] Online Convex Optimization With Time-Varying Constraints and Bandit Feedback
Cao, Xuanyu
Liu, K. J. Ray
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (07) : 2665 - 2680
[27] Robust optimization with convex model considering bounded constraints on performance variation
Naigang Hu
Baoyan Duan
Hongjun Cao
Yali Zong
Structural and Multidisciplinary Optimization, 2017, 56 : 59 - 69
[28] Robust optimization with convex model considering bounded constraints on performance variation
Hu, Naigang
Duan, Baoyan
Cao, Hongjun
Zong, Yali
STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2017, 56 (01) : 59 - 69
[29] Convex Optimization Formulation of Density Upper Bound Constraints in Markov Chain Synthesis
Demir, Nazli
Acikmese, Behcet
Harris, Matthew W.
2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 483 - 488
[30] ON CONVERGENCE RATE OF DISTRIBUTED STOCHASTIC GRADIENT ALGORITHM FOR CONVEX OPTIMIZATION WITH INEQUALITY CONSTRAINTS
Yuan, Deming
Ho, Daniel W. C.
Hong, Yiguang
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2016, 54 (05) : 2872 - 2892

← 1 2 3 4 5 →