Sequential Learning under Probabilistic Constraints

被引:0
|
作者
Meisami, Amirhossein [1 ]
Lam, Henry [2 ]
Dong, Chen [1 ]
Pani, Abhishek [1 ]
机构
[1] Adobe Inc, San Jose, CA 95110 USA
[2] Columbia Univ, New York, NY 10027 USA
关键词
RANDOMIZED SOLUTIONS; OPTIMIZATION; FEASIBILITY; ALGORITHM; PROGRAMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide the first study on online learning problems under stochastic constraints that are "soft", i.e., need to be satisfied with high probability. These constraints are imposed on all or some stages of the time horizon so that the stage decisions probabilistically satisfy some given safety conditions. The distributions that govern these conditions are learned through the collected observations. Under a Bayesian framework, we introduce a scheme that provides statistical feasibility guarantees through the time horizon, by using posterior Monte Carlo samples to form sampled constraints which leverage the scenario generation approach in chance-constrained programming. We demonstrate how our scheme can be integrated into Thompson sampling and illustrate it with an application in online advertisement.
引用
收藏
页码:621 / 631
页数:11
相关论文
共 50 条
  • [21] Optimization of chemical technology processes under probabilistic constraints
    T. V. Lapteva
    N. N. Ziyatdinov
    G. M. Ostrovskii
    D. D. Pervukhin
    Theoretical Foundations of Chemical Engineering, 2010, 44 : 651 - 659
  • [22] Dynamic probabilistic constraints under continuous random distributions
    T. González Grandón
    R. Henrion
    P. Pérez-Aros
    Mathematical Programming, 2022, 196 : 1065 - 1096
  • [23] Optimization of chemical technology processes under probabilistic constraints
    Lapteva, T. V.
    Ziyatdinov, N. N.
    Ostrovskii, G. M.
    Pervukhin, D. D.
    THEORETICAL FOUNDATIONS OF CHEMICAL ENGINEERING, 2010, 44 (05) : 651 - 659
  • [24] An adaptive sequential linear programming algorithm for optimal design problems with probabilistic constraints
    Chan, Kuei-Yuan
    Skerlos, Steven J.
    Papalambros, Panos Y.
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2005, VOL 2, PTS A AND B, 2005, : 1111 - 1121
  • [25] Learning sequential constraints of tasks from user demonstrations
    Pardowitz, M
    Zöllner, R
    Dillmann, R
    2005 5TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, 2005, : 424 - 429
  • [26] An adaptive sequential linear programming algorithm for optimal design problems with probabilistic constraints
    Chan, Kuei-Yuan
    Skerlos, Steven J.
    Papalambros, Panos
    JOURNAL OF MECHANICAL DESIGN, 2007, 129 (02) : 140 - 149
  • [27] Exploiting Opponents under Utility Constraints in Sequential Games
    Bernasconi-De-Luca, Martino
    Cacciamani, Federico
    Fioravanti, Simone
    Gatti, Nicola
    Marchesi, Alberto
    Trovo, Francesco
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [28] An LP for Sequential Learning Under Budgets
    Wang, Joseph
    Trapeznikov, Kirill
    Saligrama, Venkatesh
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33, 2014, 33 : 987 - 995
  • [29] Probabilistic Graphical Models Parameter Learning with Transferred Prior and Constraints
    Zhou, Yun
    Fenton, Norman
    Hospedales, Timothy M.
    Neil, Martin
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 972 - 981
  • [30] Probabilistic Learning of Torque Controllers from Kinematic and Force Constraints
    Silverio, Joao
    Huang, Yanlong
    Rozo, Leonel
    Calinon, Sylvain
    Caldwell, Darwin G.
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 6552 - 6559