Bayesian Optimization for Policy Search via Online-Offline Experimentation

被引:0
|
作者
Letham, Benjamin [1 ]
Bakshy, Eytan [1 ]
机构
[1] Facebook, Menlo Pk, CA 94025 USA
关键词
Bayesian optimization; multi-task Gaussian process; policy search; A/B testing; multi-fidelity optimization; MULTIVARIATE; ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online field experiments are the gold-standard way of evaluating changes to real-world interactive machine learning systems. Yet our ability to explore complex, multi-dimensional policy spaces-such as those found in recommendation and ranking problems-is often constrained by the limited number of experiments that can be run simultaneously. To alleviate these constraints, we augment online experiments with an offline simulator and apply multi-task Bayesian optimization to tune live machine learning systems. We describe practical issues that arise in these types of applications, including biases that arise from using a simulator and assumptions for the multi-task kernel. We measure empirical learning curves which show substantial gains from including data from biased offline experiments, and show how these learning curves are consistent with theoretical results for multi-task Gaussian process generalization. We find that improved kernel inference is a significant driver of multi-task generalization. Finally, we show several examples of Bayesian optimization efficiently tuning a live machine learning system by combining offline and online experiments.
引用
收藏
页数:30
相关论文
共 50 条
  • [31] Factored Contextual Policy Search with Bayesian Optimization
    Pinsler, Robert
    Karkus, Peter
    Kupesik, Andras
    Hsu, David
    Lee, Wee Sun
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 7242 - 7248
  • [32] Towards the ALICE Online-Offline (O2) control system
    Mrnjavac, Teo
    Barroso, Vasco Chibante
    23RD INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP 2018), 2019, 214
  • [33] FuzzStream: Fuzzy Data Stream Clustering Based on the Online-Offline Framework
    Lopes, Priscilla de Abreu
    Camargo, Heloisa de Arruda
    2017 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2017,
  • [34] Quality signaling strategies of experience goods in online-offline channel integration
    Gao, Ying
    Hu, Xiangpei
    Ji, Qingkai
    MANAGERIAL AND DECISION ECONOMICS, 2022, 43 (07) : 2967 - 2981
  • [35] An Online-Offline Computing Mode based on Apache Storm for Text Classification
    Jiang, Zhiying
    Hao, Guowang
    He, Yanlin
    Chen, Kai
    Wang, Yajie
    Zhu, Qunxiong
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3385 - 3390
  • [36] Device-Enhanced Password Protocols with Optimal Online-Offline Protection
    Jarecki, Stanislaw
    Krawczyk, Hugo
    Shirvanian, Maliheh
    Saxena, Nitesh
    ASIA CCS'16: PROCEEDINGS OF THE 11TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, : 177 - 188
  • [37] Developing Soft-Beacon as a Service Based on Online-Offline Positioning
    Al-Sahly, Abdullah Mohammed
    Al-Rubaian, Majed
    Al-Qurishi, Muhammad
    2019 2ND INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS), 2019,
  • [38] Cannibalization or synergy? Consumers' channel selection in online-offline multichannel systems
    Kollmann, Tobias
    Kuckertz, Andreas
    Kayser, Ina
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2012, 19 (02) : 186 - 194
  • [39] Online-offline Consistency Exploration Based on the Alternating Direction Method of Multipliers
    Lv, Tianqi
    Song, Jiaming
    Wang, Xiaojuan
    Liu, Nianhao
    Chen, Mo
    2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE), 2016, : 617 - 620
  • [40] Efficient Clustering of Short Text Streams using Online-Offline Clustering
    Rakib, Md Rashadul Hasan
    Zeh, Norbert
    Milios, Evangelos
    PROCEEDINGS OF THE 21ST ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG '21), 2021,