共 50 条
- [1] Polynomial-time reinforcement learning of near-optimal policies EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 205 - 210
- [2] A Bayesian reinforcement learning approach in markov games for computing near-optimal policies Annals of Mathematics and Artificial Intelligence, 2023, 91 : 675 - 690
- [8] Near-optimal Reinforcement Learning in Factored MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
- [10] Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies Soft Computing, 2019, 23 : 3591 - 3604