共 50 条
- [1] Average Reward Reinforcement Learning for Semi-Markov Decision Processes [J]. NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 768 - 777
- [2] RVI Reinforcement Learning for Semi-Markov Decision Processes with Average Reward [J]. 2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 1674 - 1679
- [3] A Unified Approach for Semi-Markov Decision Processes with Discounted and Average Reward Criteria [J]. 2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1741 - 1744
- [7] Computing semi-stationary optimal policies for multichain semi-Markov decision processes [J]. Annals of Operations Research, 2020, 287 : 843 - 865
- [8] Constrained semi-markov decision processes with average rewards [J]. ZOR. Zeitschrift Fuer Operations Research, 1994, 40 (03):
- [9] Risk-Sensitivity and Average Optimality in Markov and Semi-Markov Reward Processes [J]. 38TH INTERNATIONAL CONFERENCE ON MATHEMATICAL METHODS IN ECONOMICS (MME 2020), 2020, : 537 - 543