共 50 条
- [41] On mean reward variance in semi-Markov processes Mathematical Methods of Operations Research, 2005, 62 : 387 - 397
- [42] Loop Estimator for Discounted Values in Markov Reward Processes THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7169 - 7175
- [45] CONVERGING MARKOV DECISION PROCESSES WITH MULTIPLICATIVE REWARD SYSTEM Bulletin of the Kyushu Institute of Technology - Pure and Applied Mathematics, 2023, 2023 (70): : 33 - 41
- [46] Robust Average-Reward Markov Decision Processes THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15215 - 15223
- [47] Simulation-based optimization of Markov reward processes Proceedings of the IEEE Conference on Decision and Control, 1998, 3 : 2698 - 2703
- [48] Average-Reward Decentralized Markov Decision Processes 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1997 - 2002
- [49] Simulation-based optimization of Markov reward processes PROCEEDINGS OF THE 37TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 1998, : 2698 - 2703