A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback

被引:0
|
作者
Cayci, Semih [1 ]
Zheng, Yilin [2 ]
Eryilmaz, Atilla [2 ]
机构
[1] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
[2] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a wide variety of applications including online advertising, contractual hiring, and wireless scheduling, the controller is constrained by a stringent budget constraint on the available resources, which are consumed in a random amount by each action, and a stochastic feasibility constraint that may impose important operational limitations on decision-making In this work, we consider a general model to address such problems, where each action returns a random reward, cost, and penalty from an unknown joint distribution, and the decision-maker aims to maximize the total reward under a budget constraint B on the total cost and a stochastic constraint on the time-average penalty. We propose a novel low-complexity algorithm based on Lyapunov optimization methodology, named LyOn, and prove that for K arms it achieves O(root KB log B) regret and zero constraint violation when B is sufficiently large. The low computational cost and sharp performance bounds of LyOn suggest that Lyapunov-based algorithm design methodology can be effective in solving constrained bandit optimization problems.
引用
收藏
页码:3716 / 3723
页数:8
相关论文
共 50 条
  • [1] On the Optimality of Lyapunov-based Feedback Laws for Constrained Difference Inclusions
    Ferrante, Francesco
    Sanfelice, Ricardo G.
    [J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 3435 - 3440
  • [2] Lyapunov-Based Avoidance Controllers With Stabilizing Feedback
    Ballaben, Riccardo
    Braun, Philipp
    Zaccarian, Luca
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 862 - 867
  • [3] Lyapunov-based boundary feedback design for parabolic PDEs
    Karafyllis, Iasson
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (05) : 1247 - 1260
  • [4] LYAPUNOV-BASED OUTPUT FEEDBACK CONTROL OF HYPERSONIC VEHICLE
    Xu, Haijing
    Jia, Yingmin
    [J]. 2014 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2014, : 253 - 258
  • [5] Lyapunov-Based Output Feedback Learning Control of Robot Manipulators
    Dogan, K. Merve
    Tatlicioglu, Enver
    Zergeroglu, Erkan
    Cetin, Kamil
    [J]. 2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 5337 - 5342
  • [6] Lyapunov-based EMPC of constrained nonlinear systems with changing economic costs
    He, Defeng
    Xu, Shan
    [J]. ASIAN JOURNAL OF CONTROL, 2021, 23 (04) : 1772 - 1781
  • [7] Constrained distributed online convex optimization with bandit feedback for unbalanced digraphs
    Tada, Keishin
    Hayashi, Naoki
    Takai, Shigemasa
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (02): : 184 - 200
  • [8] Distributed Online Stochastic-Constrained Convex Optimization With Bandit Feedback
    Wang, Cong
    Xu, Shengyuan
    Yuan, Deming
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (01) : 63 - 75
  • [9] A Tutorial on a Lyapunov-Based Approach to the Analysis of Iterative Optimization Algorithms
    Van Scoy, Bryan
    Lessard, Laurent
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3003 - 3008
  • [10] Lyapunov-based boundary feedback control in multi-reach canals
    CEN LiHui & XI YuGeng Department of Automation
    [J]. Science China(Information Sciences), 2009, (07) : 1157 - 1164