共 50 条
- [41] Tug-of-War Model for Multi-armed Bandit Problem UNCONVENTIONAL COMPUTATION, PROCEEDINGS, 2010, 6079 : 69 - +
- [42] Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1152 - 1161
- [46] Scaling Multi-Armed Bandit Algorithms KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 1449 - 1459
- [47] A stochastic multi-armed bandit approach to nonparametric H∞-norm estimation 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
- [50] IMPROVING STRATEGIES FOR THE MULTI-ARMED BANDIT MARKOV PROCESS AND CONTROL THEORY, 1989, 54 : 158 - 163