共 50 条
- [41] Fuzzy Reinforcement Learning Control for Decentralized Partially Observable Markov Decision Processes IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1422 - 1429
- [42] Topological Value Iteration Algorithm for Markov Decision Processes 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1860 - 1865
- [44] New prioritized value iteration for Markov decision processes Artificial Intelligence Review, 2012, 37 : 157 - 167
- [45] Approximate Policy Iteration for Markov Control Revisited COMPLEX ADAPTIVE SYSTEMS 2012, 2012, 12 : 90 - 95
- [48] Mean Field Approximation of the Policy Iteration Algorithm for Graph-based Markov Decision Processes ECAI 2006, PROCEEDINGS, 2006, 141 : 595 - +
- [49] Average-Reward Decentralized Markov Decision Processes 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1997 - 2002
- [50] Solving transition independent decentralized Markov decision processes Journal of Artificial Intelligence Research, 1600, 22 : 423 - 455