共 50 条
- [1] Counterexample-guided permissive supervisor synthesis for probabilistic systems through learning [J]. 2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 2894 - 2899
- [2] Learning Parameterized Policies for Markov Decision Processes through Demonstrations [J]. 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 7087 - 7092
- [3] Learning to Collaborate in Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [4] Learning in Constrained Markov Decision Processes [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01): : 441 - 453
- [5] Counterexample-guided Distributed Permissive Supervisor Synthesis for Probabilistic Multi-agent Systems through Learning [J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5519 - 5524
- [7] Blackwell Online Learning for Markov Decision Processes [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
- [8] Online Learning in Kernelized Markov Decision Processes [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [9] Learning Factored Markov Decision Processes with Unawareness [J]. 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 123 - 133
- [10] Bayesian Learning of Noisy Markov Decision Processes [J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2013, 23 (01):