共 14 条
- [1] Multi-Agent Reinforcement Learning Algorithm with Variable Optimistic-Pessimistic Criterion ECAI 2008, PROCEEDINGS, 2008, 178 : 433 - +
- [2] DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [4] Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (06): : 5974 - 5981
- [6] Distributed safe reinforcement learning for multi-robot motion planning 2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 1209 - 1214
- [8] Safe multi-agent motion planning via filtered reinforcement learning 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7270 - 7276