共 50 条
- [1] Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration & Planning THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9336 - 9344
- [2] A Lower Bound for Regret in Logistic Regression 2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 2507 - 2512
- [3] Dynamic Regret of Adversarial Linear Mixture MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [4] An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality Gap ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [5] Refined Regret for Adversarial MDPs with Linear Function Approximation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [6] Regret Minimization in MDPs with Options without Prior Knowledge ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [7] Nash Regret Guarantees for Linear Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [8] Regret Guarantees for Online Deep Control LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211