共 50 条
- [1] MULTIOBJECTIVE MARKOV DECISION-PROCESS WITH AVERAGE REWARD CRITERION [J]. LARGE SCALE SYSTEMS IN INFORMATION AND DECISION TECHNOLOGIES, 1986, 10 (03): : 215 - 226
- [3] Bounded parameter Markov decision processes with average reward criterion [J]. LEARNING THEORY, PROCEEDINGS, 2007, 4539 : 263 - +
- [5] REWARD REVISION AND THE AVERAGE REWARD MARKOV DECISION-PROCESS [J]. OR SPEKTRUM, 1987, 9 (04) : 203 - 211
- [9] Optimal control of average reward constrained continuous-time finite Markov Decision Processes [J]. PROCEEDINGS OF THE 41ST IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, 2002, : 3805 - 3810
- [10] Continuous-time Markov Decision Process with Average Reward: Using Reinforcement Learning Method [J]. 2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3097 - 3100