共 50 条
- [21] Bounded parameter Markov decision processes with average reward criterion [J]. LEARNING THEORY, PROCEEDINGS, 2007, 4539 : 263 - +
- [23] MULTIOBJECTIVE MARKOV DECISION-PROCESS WITH AVERAGE REWARD CRITERION [J]. LARGE SCALE SYSTEMS IN INFORMATION AND DECISION TECHNOLOGIES, 1986, 10 (03): : 215 - 226
- [24] Optimal switching problem for countable Markov chains: average reward criterion [J]. Mathematical Methods of Operations Research, 2001, 53 : 1 - 24
- [26] COUNTEREXAMPLE IN CONTINUOUS MARKOV DECISION CHAINS [J]. MANAGEMENT SCIENCE SERIES A-THEORY, 1974, 21 (03): : 358 - 359
- [29] Another Set of Optimality Conditions for Zero-Sum Stochastic Games with Sample-Path Average Payoffs [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2014, 52 (07): : 23 - 37