Gradient-Variation Bound for Online Convex Optimization with Constraints

被引:0
|
作者
Qiu, Shuang [1 ]
Wei, Xiaohan [2 ]
Kolar, Mladen [1 ]
机构
[1] Univ Chicago, Booth Sch Business, Chicago, IL 60637 USA
[2] Meta Platforms Inc, Menlo Pk, CA USA
关键词
REGRET; O(1/T);
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study online convex optimization with constraints consisting of multiple functional constraints and a relatively simple constraint set, such as a Euclidean ball. As enforcing the constraints at each time step through projections is computationally challenging in general, we allow decisions to violate the functional constraints but aim to achieve a low regret and cumulative violation of the constraints over a horizon of T time steps. First-order methods achieve an O(root T) regret and an O(1) constraint violation, which is the best-known bound under the Slater's condition, but do not take into account the structural information of the problem. Furthermore, the existing algorithms and analysis are limited to Euclidean space. In this paper, we provide an instance-dependent bound for online convex optimization with complex constraints obtained by a novel online primal-dual mirror-prox algorithm. Our instance-dependent regret is quantified by the total gradient variation V*(T) in the sequence of loss functions. The proposed algorithm works in general normed spaces and simultaneously achieves an O(root V*(T)) regret and an O(1) constraint violation, which is never worse than the best-known O(root V*(T)) result and improves over previous works that applied mirror-prox-type algorithms for this problem achieving O(T-2/3) regret and constraint violation. Finally, our algorithm is computationally efficient, as it only performs mirror descent steps in each iteration instead of solving a general Lagrangian minimization problem.
引用
收藏
页码:9534 / 9542
页数:9
相关论文
共 50 条
  • [21] Trading Regret for Efficiency: Online Convex Optimization with Long Term Constraints
    Mahdavi, Mehrdad
    Jin, Rong
    Yang, Tianbao
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 2503 - 2528
  • [22] On the Time-Varying Constraints and Bandit Feedback of Online Convex Optimization
    Cao, Xuanyu
    Liu, K. J. Ray
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [23] Adaptive Algorithms for Online Convex Optimization with Long-term Constraints
    Jenatton, Rodolphe
    Huang, Jim C.
    Archambeau, Cedric
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [24] Bregman-style Online Convex Optimization with Energy Harvesting Constraints
    Asgari, Kamiar
    Neely, Michael J.
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2020, 4 (03)
  • [25] Online Convex Optimization With Long-Term Constraints for Predictable Sequences
    Muthirayan, Deepan
    Yuan, Jianjun
    Khargonekar, Pramod P.
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 979 - 984
  • [26] Online Convex Optimization With Time-Varying Constraints and Bandit Feedback
    Cao, Xuanyu
    Liu, K. J. Ray
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (07) : 2665 - 2680
  • [27] Robust optimization with convex model considering bounded constraints on performance variation
    Naigang Hu
    Baoyan Duan
    Hongjun Cao
    Yali Zong
    Structural and Multidisciplinary Optimization, 2017, 56 : 59 - 69
  • [28] Robust optimization with convex model considering bounded constraints on performance variation
    Hu, Naigang
    Duan, Baoyan
    Cao, Hongjun
    Zong, Yali
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2017, 56 (01) : 59 - 69
  • [29] Convex Optimization Formulation of Density Upper Bound Constraints in Markov Chain Synthesis
    Demir, Nazli
    Acikmese, Behcet
    Harris, Matthew W.
    2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 483 - 488
  • [30] ON CONVERGENCE RATE OF DISTRIBUTED STOCHASTIC GRADIENT ALGORITHM FOR CONVEX OPTIMIZATION WITH INEQUALITY CONSTRAINTS
    Yuan, Deming
    Ho, Daniel W. C.
    Hong, Yiguang
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2016, 54 (05) : 2872 - 2892