Gaussian Two-Armed Bandit and Optimization of Batch Data Processing

被引:0
|
作者
A. V. Kolnogorov
机构
[1] Yaroslav-the-Wise Novgorod State University,Department of Applied Mathematics and Information Science
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
We consider the minimax setting for the two-armed bandit problem with normally distributed incomes having a priori unknown mathematical expectations and variances. This setting naturally arises in optimization of batch data processing where two alternative processing methods are available with different a priori unknown efficiencies. During the control process, it is required to determine the most efficient method and ensure its predominant application. We use the main theorem of game theory to search for minimax strategy and minimax risk as Bayesian ones corresponding to the worst-case prior distribution. To find them, a recursive integro-difference equation is obtained. We show that batch data processing almost does not increase the minimax risk if the number of batches is large enough.
引用
收藏
页码:84 / 100
页数:16
相关论文
共 50 条
  • [1] Gaussian Two-Armed Bandit and Optimization of Batch Data Processing
    Kolnogorov, A. V.
    [J]. PROBLEMS OF INFORMATION TRANSMISSION, 2018, 54 (01) : 84 - 100
  • [2] Adaptive Normal Two-Armed Bandit and Data Processing Optimization
    Kolnogorov, Alexander V.
    [J]. IFAC PAPERSONLINE, 2016, 49 (13): : 241 - 246
  • [3] Gaussian Two-Armed Bandit: Limiting Description
    Kolnogorov, A. V.
    [J]. PROBLEMS OF INFORMATION TRANSMISSION, 2020, 56 (03) : 278 - 301
  • [4] Gaussian Two-Armed Bandit: Limiting Description
    A. V. Kolnogorov
    [J]. Problems of Information Transmission, 2020, 56 : 278 - 301
  • [5] Two-armed bandit problem for parallel data processing systems
    Kolnogorov, A. V.
    [J]. PROBLEMS OF INFORMATION TRANSMISSION, 2012, 48 (01) : 72 - 84
  • [6] Two-armed bandit problem for parallel data processing systems
    A. V. Kolnogorov
    [J]. Problems of Information Transmission, 2012, 48 : 72 - 84
  • [7] Two-Armed Bandit Problem and Batch Version of the Mirror Descent Algorithm
    Kolnogorov, A., V
    Nazin, A., V
    Shiyan, D. N.
    [J]. AUTOMATION AND REMOTE CONTROL, 2022, 83 (08) : 1288 - 1307
  • [8] Two-Armed Bandit Problem and Batch Version of the Mirror Descent Algorithm
    A. V. Kolnogorov
    A. V. Nazin
    D. N. Shiyan
    [J]. Automation and Remote Control, 2022, 83 : 1288 - 1307
  • [9] A Bayesian two-armed bandit model
    Wang, Xikui
    Liang, You
    Porth, Lysa
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2019, 35 (03) : 624 - 636
  • [10] Poissonian Two-Armed Bandit: A New Approach
    A. V. Kolnogorov
    [J]. Problems of Information Transmission, 2022, 58 : 160 - 183