共 50 条
- [31] Regret, portfolio choice, and guarantees in defined contribution schemes INSURANCE MATHEMATICS & ECONOMICS, 2006, 39 (02): : 219 - 229
- [32] Online Learning for Predictive Control with Provable Regret Guarantees 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 6666 - 6671
- [33] No Regret Bound for Extreme Bandits ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 259 - 267
- [34] Nearly Optimal Latent State Decoding in Block MDPs INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [35] Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [36] Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs) JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2017, 59 : 229 - 264
- [37] Copeland Dueling Bandit Problem: Regret Lower Bound, Optimal Algorithm, and Computationally Efficient Algorithm INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [38] Rate-matching the regret lower-bound in the linear quadratic regulator with unknown dynamics 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 536 - 541
- [40] Data-Driven Online Model Selection With Regret Guarantees INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238