Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring

被引:0
|
作者
Wan, Runzhe [1 ]
Liu, Yu [1 ]
McQueen, James [1 ]
Hains, Doug [1 ]
Song, Rui [1 ]
机构
[1] Amazon, Seattle, WA 98109 USA
关键词
Sequential Decision Making; A/B testing; Reinforcement learning;
D O I
10.1145/3580305.3599818
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the growing needs of online A/B testing to support the innovation in industry, the opportunity cost of running an experiment becomes non-negligible. Therefore, there is an increasing demand for an efficient continuous monitoring service that allows early stopping when appropriate. Classic statistical methods focus on hypothesis testing and are mostly developed for traditional highstake problems such as clinical trials, while experiments at online service companies typically have very different features and focuses. Motivated by the real needs, in this paper, we introduce a novel framework that we developed in Amazon to maximize customer experience and control opportunity cost. We formulate the problem as a Bayesian optimal sequential decision making problem that has a unified utility function. We discuss extensively practical design choices and considerations. We further introduce how to solve the optimal decision rule via Reinforcement Learning and scale the solution. We show the effectiveness of this novel approach compared with existing methods via a large-scale meta-analysis on experiments in Amazon.
引用
收藏
页码:5016 / 5027
页数:12
相关论文
共 50 条
  • [1] Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making
    Desai, Nishant
    Critch, Andrew
    Russell, Stuart
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [2] Bayesian Persuasion in Sequential Decision-Making
    Gan, Jiarui
    Majumdar, Rupak
    Radanovic, Goran
    Singla, Adish
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5025 - 5033
  • [3] Reinforcement learning with hierarchical decision-making
    Cohen, Shahar
    Maimon, Oded
    Khmlenitsky, Evgeni
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, 2006, : 177 - +
  • [4] An adaptive decision-making method with fuzzy Bayesian reinforcement learning for robot soccer
    Shi, Haobin
    Lin, Zhiqiang
    Zhang, Shuge
    Li, Xuesi
    Hwang, Kao-Shing
    [J]. INFORMATION SCIENCES, 2018, 436 : 268 - 281
  • [5] A Reinforcement Learning Approach for Sequential Decision-Making Process of Attacks in Smart Grid
    Ni, Zhen
    Paul, Shuva
    Zhong, Xiangnan
    Wei, Qinglai
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1622 - 1629
  • [6] Informing sequential clinical decision-making through reinforcement learning: an empirical study
    Susan M. Shortreed
    Eric Laber
    Daniel J. Lizotte
    T. Scott Stroup
    Joelle Pineau
    Susan A. Murphy
    [J]. Machine Learning, 2011, 84 : 109 - 136
  • [7] Informing sequential clinical decision-making through reinforcement learning: an empirical study
    Shortreed, Susan M.
    Laber, Eric
    Lizotte, Daniel J.
    Stroup, T. Scott
    Pineau, Joelle
    Murphy, Susan A.
    [J]. MACHINE LEARNING, 2011, 84 (1-2) : 109 - 136
  • [8] Distributed reinforcement learning for sequential decision making
    Rogova, G
    Scott, P
    Lolett, C
    [J]. PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL II, 2002, : 1263 - 1268
  • [9] Variational Bayesian Decision-making for Continuous Utilities
    Kusmierczyk, Tomasz
    Sakaya, Joseph
    Klami, Arto
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [10] Decision analysis and reinforcement learning in surgical decision-making
    Loftus, Tyler J.
    Filiberto, Amanda C.
    Li, Yanjun
    Balch, Jeremy
    Cook, Allyson C.
    Tighe, Patrick J.
    Efron, Philip A.
    Upchurch, Gilbert R., Jr.
    Rashidi, Parisa
    Li, Xiaolin
    Bihorac, Azra
    [J]. SURGERY, 2020, 168 (02) : 253 - 266