共 50 条
- [1] Nash Regret Guarantees for Linear Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [2] Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [3] Tight Regret Bounds for Infinite-armed Linear Contextual Bandits 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 370 - 378
- [4] On Learning Whittle Index Policy for Restless Bandits With Scalable Regret IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2024, 11 (03): : 1190 - 1202
- [5] Neural Contextual Bandits without Regret INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 240 - 278
- [6] Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [8] Learning in Generalized Linear Contextual Bandits with Stochastic Delays ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [9] Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
- [10] Offline Contextual Bandits with High Probability Fairness Guarantees ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32