共 50 条
- [3] Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem) [J]. Automation and Remote Control, 2011, 72 : 1017 - 1027
- [4] Minimax Normal Two-Armed Bandit with Indefinite Control Horizon [J]. 2016 INTERNATIONAL CONFERENCE APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND SYSTEMS ENGINEERING, 2017, 9
- [6] Two-armed bandit problem for parallel data processing systems [J]. Problems of Information Transmission, 2012, 48 : 72 - 84
- [10] Two-Armed Bandit Problem and Batch Version of the Mirror Descent Algorithm [J]. Automation and Remote Control, 2022, 83 : 1288 - 1307