Sequential Learning under Probabilistic Constraints

被引：0

作者：

Meisami, Amirhossein ^{[1
]}

Lam, Henry ^{[2
]}

Dong, Chen ^{[1
]}

Pani, Abhishek ^{[1
]}

机构：

[1] Adobe Inc, San Jose, CA 95110 USA

[2] Columbia Univ, New York, NY 10027 USA

来源：

UNCERTAINTY IN ARTIFICIAL INTELLIGENCE | 2018年

关键词：

RANDOMIZED SOLUTIONS; OPTIMIZATION; FEASIBILITY; ALGORITHM; PROGRAMS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We provide the first study on online learning problems under stochastic constraints that are "soft", i.e., need to be satisfied with high probability. These constraints are imposed on all or some stages of the time horizon so that the stage decisions probabilistically satisfy some given safety conditions. The distributions that govern these conditions are learned through the collected observations. Under a Bayesian framework, we introduce a scheme that provides statistical feasibility guarantees through the time horizon, by using posterior Monte Carlo samples to form sampled constraints which leverage the scenario generation approach in chance-constrained programming. We demonstrate how our scheme can be integrated into Thompson sampling and illustrate it with an application in online advertisement.

引用

下载

页码：621 / 631

页数：11

共 50 条

[31] Probabilistic inversion of airborne electromagnetic data under spatial constraints
Hauser, Juerg
Gunning, James
Annetts, David
GEOPHYSICS, 2015, 80 (02) : E135 - E146
[32] Probabilistic structural optimization under reliability, manufacturability, and cost constraints
Rais-Rohani, M
Xie, QL
AIAA JOURNAL, 2005, 43 (04) : 864 - 873
[33] A Probabilistic Model for Planning Research and Development under Linear Constraints
Topka, V. V.
Automation and Remote Control (English translation of Avtomatika i Telemekhanika), 58 (02):
[34] A probabilistic model for planning research and development under linear constraints
Topka, VV
AUTOMATION AND REMOTE CONTROL, 1997, 58 (04) : 709 - 713
[35] Consistency checking and querying in probabilistic databases under integrity constraints
Flesca, Sergio
Furfaro, Filippo
Parisi, Francesco
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (07) : 1448 - 1489
[36] Normal forms and normalization for probabilistic databases under sharp constraints
Hartmann, Sven
Link, Sebastian
Frontiers in Artificial Intelligence and Applications, 2014, 260 : 1 - 16
[37] On the verification of qualitative properties of probabilistic processes under fairness constraints
Baier, C
Kwiatkowska, M
INFORMATION PROCESSING LETTERS, 1998, 66 (02) : 71 - 79
[38] Probabilistic Motion Planning Under Temporal Tasks and Soft Constraints
Guo, Meng
Zavlanos, Michael M.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (12) : 4051 - 4066
[39] Bandwidth Minimization under Probabilistic Constraints and Statistical CSI for NOMA
Chitti, Krishna
Rusek, Fredrik
Tumula, Chaitanya
2017 IEEE 86TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2017,
[40] Batch Policy Learning under Constraints
Le, Hoang M.
Voloshin, Cameron
Yue, Yisong
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97

← 1 2 3 4 5 →