Gaussian Two-Armed Bandit: Limiting Description

被引:0
|
作者
A. V. Kolnogorov
机构
[1] Yaroslav-the-Wise Novgorod State University,Department of Applied Mathematics and Information Science
来源
关键词
Gaussian two-armed bandit; minimax and Bayesian approaches; batch processing; asymptotic minimax theorem;
D O I
暂无
中图分类号
学科分类号
摘要
For a Gaussian two-armed bandit, which arises when batch data processing is analyzed, the minimax risk limiting behavior is investigated as the control horizon N grows infinitely. The minimax risk is searched for as the Bayesian one computed with respect to the worst-case prior distribution. We show that the highest requirements are imposed on the control in the domain of "close” distributions where mathematical expectations of incomes differ by a quantity of the order of N−1/2. In the domain of "close” distributions, we obtain a recursive integro-difference equation for finding the Bayesian risk with respect to the worst-case prior distribution, in invariant form with control horizon one, and also a second-order partial differential equation in the limiting case. The results allow us to estimate the performance of batch processing. For example, the minimax risk corresponding to batch processing of data partitioned into 50 batches can be only 2% greater than its limiting value when the number of batches grows infinitely. In the case of a Bernoulli two-armed bandit, we show that optimal one-by-one data processing is not more efficient than batch processing as N grows infinitely.
引用
收藏
页码:278 / 301
页数:23
相关论文
共 50 条
  • [1] Gaussian Two-Armed Bandit: Limiting Description
    Kolnogorov, A. V.
    [J]. PROBLEMS OF INFORMATION TRANSMISSION, 2020, 56 (03) : 278 - 301
  • [2] Gaussian Two-Armed Bandit and Optimization of Batch Data Processing
    A. V. Kolnogorov
    [J]. Problems of Information Transmission, 2018, 54 : 84 - 100
  • [3] Gaussian Two-Armed Bandit and Optimization of Batch Data Processing
    Kolnogorov, A. V.
    [J]. PROBLEMS OF INFORMATION TRANSMISSION, 2018, 54 (01) : 84 - 100
  • [4] A Bayesian two-armed bandit model
    Wang, Xikui
    Liang, You
    Porth, Lysa
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2019, 35 (03) : 624 - 636
  • [5] Poissonian Two-Armed Bandit: A New Approach
    A. V. Kolnogorov
    [J]. Problems of Information Transmission, 2022, 58 : 160 - 183
  • [6] Poissonian Two-Armed Bandit: A New Approach
    Kolnogorov, A., V
    [J]. PROBLEMS OF INFORMATION TRANSMISSION, 2022, 58 (02) : 160 - 183
  • [7] Noradrenergic Regulation of Two-Armed Bandit Performance
    Swanson, Kyra
    Averbeck, Bruno B.
    Laubach, Mark
    [J]. BEHAVIORAL NEUROSCIENCE, 2022, 136 (01) : 84 - 99
  • [8] Minimax lower bounds for the two-armed bandit problem
    Kulkarni, SR
    Lugosi, G
    [J]. PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 2293 - 2297
  • [9] On the Conjecture of Berry Regarding a Bernoulli Two-Armed Bandit
    Zhang, Jichen
    Wu, Panyu
    [J]. MATHEMATICS, 2023, 11 (03)
  • [10] When can the two-armed bandit algorithm be trusted?
    Lamberton, D
    Pagès, G
    Tarrès, P
    [J]. ANNALS OF APPLIED PROBABILITY, 2004, 14 (03): : 1424 - 1454