共 50 条
- [1] Online Learning in Kernelized Markov Decision Processes [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [2] Online Learning of Safety function for Markov Decision Processes [J]. 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
- [3] Online Learning in Markov Decision Processes with Continuous Actions [J]. ALGORITHMIC LEARNING THEORY, ALT 2015, 2015, 9355 : 302 - 316
- [4] Blackwell optimality in Markov decision processes with partial observation [J]. ANNALS OF STATISTICS, 2002, 30 (04): : 1178 - 1193
- [5] Online Markov Decision Processes [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 726 - 736
- [6] Online Learning in Markov Decision Processes with Changing Cost Sequences [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
- [7] Online Learning with Implicit Exploration in Episodic Markov Decision Processes [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1953 - 1958
- [8] Blackwell optimality in Markov decision processes with a Borel state space [J]. PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 2827 - 2830
- [9] Online Learning in Markov Decision Processes with Arbitrarily Changing Rewards and Transitions [J]. 2009 INTERNATIONAL CONFERENCE ON GAME THEORY FOR NETWORKS (GAMENETS 2009), 2009, : 314 - 322