Gaussian Two-Armed Bandit and Optimization of Batch Data Processing

被引:0
|
作者
A. V. Kolnogorov
机构
[1] Yaroslav-the-Wise Novgorod State University,Department of Applied Mathematics and Information Science
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
We consider the minimax setting for the two-armed bandit problem with normally distributed incomes having a priori unknown mathematical expectations and variances. This setting naturally arises in optimization of batch data processing where two alternative processing methods are available with different a priori unknown efficiencies. During the control process, it is required to determine the most efficient method and ensure its predominant application. We use the main theorem of game theory to search for minimax strategy and minimax risk as Bayesian ones corresponding to the worst-case prior distribution. To find them, a recursive integro-difference equation is obtained. We show that batch data processing almost does not increase the minimax risk if the number of batches is large enough.
引用
收藏
页码:84 / 100
页数:16
相关论文
共 50 条
  • [21] A Finite Memory Automaton for Two-Armed Bernoulli Bandit Problems
    Rao, Ariel
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4981 - 4982
  • [22] Strategic two-sample test via the two-armed bandit process
    Chen, Zengjing
    Yan, Xiaodong
    Zhang, Guodong
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (04) : 1271 - 1298
  • [23] Finite-time lower bounds for the two-armed bandit problem
    Kulkarni, SR
    Lugosi, G
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (04) : 711 - 714
  • [24] Self-efficacy beliefs and imitation: A two-armed bandit experiment
    Innocenti, Stefania
    Cowan, Robin
    [J]. EUROPEAN ECONOMIC REVIEW, 2019, 113 : 156 - 172
  • [25] Parallel Version of the Mirror Descent Algorithm for the Two-Armed Bandit Problem
    Kolnogorov, Alexander
    Shiyan, Dmitry
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MATHEMATICS AND COMPUTERS IN SCIENCES AND IN INDUSTRY (MCSI 2016), 2016, : 241 - 245
  • [26] Demystifying the Two-Armed Futurity Bandit's Unfairness and Apparent Fairness
    Liang, Huaijin
    Ma, Jin
    Wang, Wei
    Yan, Xiaodong
    [J]. MATHEMATICS, 2024, 12 (11)
  • [27] A Bayesian Learning Automaton for Solving Two-Armed Bernoulli Bandit Problems
    Granmo, Ole-Christoffer
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 23 - 30
  • [28] Bees in two-armed bandit situations: foraging choices and possible decision mechanisms
    Keasar, T
    Rashkovich, E
    Cohen, D
    Shmida, A
    [J]. BEHAVIORAL ECOLOGY, 2002, 13 (06) : 757 - 765
  • [29] Solving two-armed Bernoulli bandit problems using a Bayesian learning automaton
    Granmo, Ole-Christoffer
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2010, 3 (02) : 207 - 234
  • [30] Stiffness optimization for two-armed robotic sculpting
    Owen, William
    Croft, Elizabeth
    Benhabib, Beno
    [J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2008, 35 (01): : 46 - 57