共 50 条
- [32] Distributed optimization of Markov reward processes PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 3921 - 3926
- [34] Markov reward models and markov decision processes in discrete and continuous time: Performance evaluation and optimization Gouberman, Alexander (alexander.gouberman@unibw.de), 1600, Springer Verlag (8453): : 156 - 241
- [37] RVI Reinforcement Learning for Semi-Markov Decision Processes with Average Reward 2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 1674 - 1679
- [40] Incremental Improvements of Heuristic Policies for Average-Reward Markov Decision Processes IFAC PAPERSONLINE, 2020, 53 (02): : 1721 - 1728