共 50 条
- [1] A reinforcement learning based algorithm for finite horizon Markov decision processes [J]. PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 5519 - 5524
- [2] Reinforcement learning algorithm for partially observable Markov decision processes [J]. Kongzhi yu Juece/Control and Decision, 2004, 19 (11): : 1263 - 1266
- [3] An Inverse Reinforcement Learning Algorithm for semi-Markov Decision Processes [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1256 - 1261
- [4] Reinforcement Learning for Constrained Markov Decision Processes [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [6] A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes [J]. IEEE ACCESS, 2018, 6 : 49089 - 49102
- [7] Kernel-Based Reinforcement Learning in Robust Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [8] Reinforcement learning based algorithms for average cost Markov Decision Processes [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2007, 17 (01): : 23 - 52
- [9] Reinforcement Learning Based Algorithms for Average Cost Markov Decision Processes [J]. Discrete Event Dynamic Systems, 2007, 17 : 23 - 52
- [10] A sensitivity view of Markov decision processes and reinforcement learning [J]. MODELING, CONTROL AND OPTIMIZATION OF COMPLEX SYSTEMS: IN HONOR OF PROFESSOR YU-CHI HO, 2003, 14 : 261 - 283