共 50 条
- [21] Towards Efficient Computation of Error Bounded Solutions in POMDPs: Expected Value Approximation and Dynamic Disjunctive Beliefs 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2638 - 2643
- [22] Improved Planning for Infinite-Horizon Interactive POMDPs Using Probabilistic Inference PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1839 - 1840
- [23] Discrete-Time Nonlinear Generalized Policy Iteration for Optimal Control Using Neural Networks NEURAL INFORMATION PROCESSING (ICONIP 2014), PT I, 2014, 8834 : 389 - 396
- [24] On Generalized Policy Iteration for Continuous-Time Linear Systems 2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 1722 - 1728
- [25] Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7941 - 7948
- [27] Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2162 - 2164
- [28] GLOBAL WEAK SOLUTIONS FOR GENERALIZED SQG IN BOUNDED DOMAINS ANALYSIS & PDE, 2018, 11 (04): : 1029 - 1047
- [30] Learning Others' Intentional Models in Multi-Agent Settings Using Interactive POMDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31