共 50 条
- [33] Average Reward Reinforcement Learning for Semi-Markov Decision Processes NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 768 - 777
- [39] Risk Aversion in Finite Markov Decision Processes Using Total Cost Criteria and Average Value at Risk 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 335 - 342