共 50 条
- [31] From Perturbation Analysis to Markov Decision Processes and Reinforcement Learning [J]. Discrete Event Dynamic Systems, 2003, 13 : 9 - 39
- [32] From perturbation analysis to Markov decision processes and reinforcement learning [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2003, 13 (1-2): : 9 - 39
- [33] Reinforcement Learning for Cost-Aware Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [34] Online Learning for Markov Decision Processes in Nonstationary Environments: A Dynamic Regret Analysis [J]. 2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 1232 - 1237
- [35] Reinforcement Learning with State Observation Costs in Action-Contingent Noiselessly Observable Markov Decision Processes [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [39] Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes [J]. Doklady Mathematics, 2023, 108 : S382 - S392
- [40] Model-Free Reinforcement Learning for Branching Markov Decision Processes [J]. COMPUTER AIDED VERIFICATION, PT II, CAV 2021, 2021, 12760 : 651 - 673