共 50 条
- [2] Online Learning with Implicit Exploration in Episodic Markov Decision Processes [J]. 2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1953 - 1958
- [3] Differentially Private Regret Minimization in Episodic Markov Decision Processes [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6375 - 6383
- [4] Learning to Collaborate in Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [5] Learning in Constrained Markov Decision Processes [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
- [6] Pure Exploration in Episodic Fixed-Horizon Markov Decision Processes [J]. AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1703 - 1704
- [7] Blackwell Online Learning for Markov Decision Processes [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
- [8] Online Learning in Kernelized Markov Decision Processes [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [9] Learning Factored Markov Decision Processes with Unawareness [J]. 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 123 - 133
- [10] Bayesian Learning of Noisy Markov Decision Processes [J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2013, 23 (01):