Sequential Learning under Probabilistic Constraints

被引:0
|
作者
Meisami, Amirhossein [1 ]
Lam, Henry [2 ]
Dong, Chen [1 ]
Pani, Abhishek [1 ]
机构
[1] Adobe Inc, San Jose, CA 95110 USA
[2] Columbia Univ, New York, NY 10027 USA
关键词
RANDOMIZED SOLUTIONS; OPTIMIZATION; FEASIBILITY; ALGORITHM; PROGRAMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide the first study on online learning problems under stochastic constraints that are "soft", i.e., need to be satisfied with high probability. These constraints are imposed on all or some stages of the time horizon so that the stage decisions probabilistically satisfy some given safety conditions. The distributions that govern these conditions are learned through the collected observations. Under a Bayesian framework, we introduce a scheme that provides statistical feasibility guarantees through the time horizon, by using posterior Monte Carlo samples to form sampled constraints which leverage the scenario generation approach in chance-constrained programming. We demonstrate how our scheme can be integrated into Thompson sampling and illustrate it with an application in online advertisement.
引用
下载
收藏
页码:621 / 631
页数:11
相关论文
共 50 条
  • [11] LEARNING SEQUENTIAL PATTERNS FOR PROBABILISTIC INDUCTIVE PREDICTION
    CHAN, KCC
    WONG, AKC
    CHIU, DKY
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (10): : 1532 - 1547
  • [12] PROBABILISTIC DISCRIMINATION LEARNING OF A SEQUENTIAL REINFORCEMENT PATTERN
    BUGGIE, SE
    PSYCHONOMIC SCIENCE, 1969, 15 (06): : 309 - &
  • [13] Probabilistic Specification Learning for Planning with Safety Constraints
    Watanabe, Kandai
    Renninger, Nicholas
    Sankaranarayanan, Sriram
    Lahijanian, Morteza
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 6558 - 6565
  • [14] An Approach to Process Monitoring under Probabilistic Constraints
    Werk, Sebastian
    Barz, Tilman
    Wozny, Guenter
    Arellano-Garcia, Harvey
    22 EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2012, 30 : 1252 - 1256
  • [15] Reinforcement learning in a probabilistic learning task without time constraints
    Jablonska, Judyta
    Szumiec, Lukasz
    Parkitna, Jan Rodriguez
    PHARMACOLOGICAL REPORTS, 2019, 71 (06) : 1310 - 1310
  • [16] Sequential process control under capacity constraints
    Jang, W
    Shanthikumar, JG
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2004, 155 (03) : 695 - 714
  • [17] Sequential Anomaly Detection Under Sampling Constraints
    Tsopelakos, Aristomenis
    Fellouris, Georgios
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (12) : 8126 - 8146
  • [18] Language acquisition and use: Learning and applying probabilistic constraints
    Seidenberg, MS
    SCIENCE, 1997, 275 (5306) : 1599 - 1603
  • [19] On Probabilistic Search Decisions under Searcher Motion Constraints
    Chung, Timothy H.
    ALGORITHMIC FOUNDATIONS OF ROBOTICS VIII, 2010, 57 : 501 - 516
  • [20] Dynamic probabilistic constraints under continuous random distributions
    Gonzalez Grandon, T.
    Henrion, R.
    Perez-Aros, P.
    MATHEMATICAL PROGRAMMING, 2022, 196 (1-2) : 1065 - 1096