共 50 条
- [1] Online Learning in Markov Decision Processes with Arbitrarily Changing Rewards and Transitions [J]. 2009 INTERNATIONAL CONFERENCE ON GAME THEORY FOR NETWORKS (GAMENETS 2009), 2009, : 314 - 322
- [2] Blackwell Online Learning for Markov Decision Processes [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
- [3] Online Learning in Kernelized Markov Decision Processes [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [4] Online Learning of Safety function for Markov Decision Processes [J]. 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
- [5] Online Learning in Markov Decision Processes with Continuous Actions [J]. ALGORITHMIC LEARNING THEORY, ALT 2015, 2015, 9355 : 302 - 316
- [7] Online Markov Decision Processes [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (03) : 726 - 736
- [8] Online Learning with Implicit Exploration in Episodic Markov Decision Processes [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1953 - 1958
- [10] Online Markov Decision Processes with Kullback-Leibler Control Cost [J]. 2012 AMERICAN CONTROL CONFERENCE (ACC), 2012, : 1388 - 1393