共 50 条
- [24] A learning algorithm for Markov decision processes with adaptive state aggregation PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 3351 - 3356
- [25] On the controller synthesis for finite-state Markov decision processes FSTTCS 2005: FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE, PROCEEDINGS, 2005, 3821 : 541 - 552
- [29] Blackwell optimality in Markov decision processes with a Borel state space PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 2827 - 2830
- [30] Pseudometrics for state aggregation in average reward Markov decision processes ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2007, 4754 : 373 - 387