共 50 条
- [24] Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [25] Sublinear Optimal Policy Value Estimation in Contextual Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4377 - 4386
- [26] Contextual bandits with surrogate losses: Margin bounds and efficient algorithms ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [27] Best-of-Both-Worlds Algorithms for Linear Contextual Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [29] Jointly Efficient and Optimal Algorithms for Logistic Bandits INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 546 - 580
- [30] Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202 : 691 - 717