Gaussian Two-Armed Bandit and Optimization of Batch Data Processing

被引：0

作者：

A. V. Kolnogorov

机构：

[1] Yaroslav-the-Wise Novgorod State University,Department of Applied Mathematics and Information Science

来源：

Problems of Information Transmission | 2018年 / 54卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We consider the minimax setting for the two-armed bandit problem with normally distributed incomes having a priori unknown mathematical expectations and variances. This setting naturally arises in optimization of batch data processing where two alternative processing methods are available with different a priori unknown efficiencies. During the control process, it is required to determine the most efficient method and ensure its predominant application. We use the main theorem of game theory to search for minimax strategy and minimax risk as Bayesian ones corresponding to the worst-case prior distribution. To find them, a recursive integro-difference equation is obtained. We show that batch data processing almost does not increase the minimax risk if the number of batches is large enough.

引用

页码：84 / 100

页数：16

共 50 条

[21] A Finite Memory Automaton for Two-Armed Bernoulli Bandit Problems
Rao, Ariel
[J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4981 - 4982
[22] Strategic two-sample test via the two-armed bandit process
Chen, Zengjing
Yan, Xiaodong
Zhang, Guodong
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (04) : 1271 - 1298
[23] Finite-time lower bounds for the two-armed bandit problem
Kulkarni, SR
Lugosi, G
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (04) : 711 - 714
[24] Self-efficacy beliefs and imitation: A two-armed bandit experiment
Innocenti, Stefania
Cowan, Robin
[J]. EUROPEAN ECONOMIC REVIEW, 2019, 113 : 156 - 172
[25] Parallel Version of the Mirror Descent Algorithm for the Two-Armed Bandit Problem
Kolnogorov, Alexander
Shiyan, Dmitry
[J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MATHEMATICS AND COMPUTERS IN SCIENCES AND IN INDUSTRY (MCSI 2016), 2016, : 241 - 245
[26] Demystifying the Two-Armed Futurity Bandit's Unfairness and Apparent Fairness
Liang, Huaijin
Ma, Jin
Wang, Wei
Yan, Xiaodong
[J]. MATHEMATICS, 2024, 12 (11)
[27] A Bayesian Learning Automaton for Solving Two-Armed Bernoulli Bandit Problems
Granmo, Ole-Christoffer
[J]. SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 23 - 30
[28] Bees in two-armed bandit situations: foraging choices and possible decision mechanisms
Keasar, T
Rashkovich, E
Cohen, D
Shmida, A
[J]. BEHAVIORAL ECOLOGY, 2002, 13 (06) : 757 - 765
[29] Solving two-armed Bernoulli bandit problems using a Bayesian learning automaton
Granmo, Ole-Christoffer
[J]. INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2010, 3 (02) : 207 - 234
[30] Stiffness optimization for two-armed robotic sculpting
Owen, William
Croft, Elizabeth
Benhabib, Beno
[J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2008, 35 (01): : 46 - 57

← 1 2 3 4 5 →