共 50 条
- [11] Pareto-Optimal Model Selection via SPRINT-Race [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (02) : 596 - 610
- [12] The Pareto Regret Frontier for Bandits [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [14] Near-Optimal Representation Learning for Linear Bandits and Linear RL [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [15] Optimal Best-arm Identification in Linear Bandits [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [16] Provably Optimal Algorithms for Generalized Linear Contextual Bandits [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [17] Dynamic Balancing for Model Selection in Bandits and RL [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [18] Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design [J]. STOC '21: PROCEEDINGS OF THE 53RD ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2021, : 74 - 87
- [19] Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [20] Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202 : 691 - 717