共 50 条
- [31] Transient solutions for multidimensional denumerable state Markov processes Queueing Syst., 1-4 (317-329):
- [34] Pseudometrics for state aggregation in average reward Markov decision processes ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2007, 4754 : 373 - 387
- [35] An average-value-at-risk criterion for Markov decision processes with unbounded costs Frontiers of Mathematics in China, 2022, 17 : 673 - 687
- [38] OPTIMIZATION OF DENUMERABLE SEMI-MARKOV DECISION PROCESSES. Systems Science, 1980, 6 (02): : 129 - 141
- [39] Denumerable controlled markov chains with average reward criterion. Sample path optimality ZOR. Zeitschrift Fuer Operations Research, 41 (01):