共 50 条
- [1] Technical Note: On Ordinal Comparison of Policies in Markov Reward Processes Journal of Optimization Theory and Applications, 2004, 122 : 207 - 217
- [2] Incremental Improvements of Heuristic Policies for Average-Reward Markov Decision Processes IFAC PAPERSONLINE, 2020, 53 (02): : 1721 - 1728
- [4] Markov Decision Processes with Arbitrary Reward Processes RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 268 - +
- [6] Ordinal Decision Models for Markov Decision Processes 20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 828 - 833
- [8] Adaptive optimization of Markov reward processes 2005 44TH IEEE CONFERENCE ON DECISION AND CONTROL & EUROPEAN CONTROL CONFERENCE, VOLS 1-8, 2005, : 8034 - 8041
- [9] Distributed optimization of Markov reward processes PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 3921 - 3926
- [10] EXISTENCE OF OPTIMAL STATIONARY POLICIES IN AVERAGE REWARD MARKOV DECISION-PROCESSES WITH A RECURRENT STATE APPLIED MATHEMATICS AND OPTIMIZATION, 1992, 26 (02): : 171 - 194