共 50 条
- [21] Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (01): : 45 - 67
- [22] Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [23] Reward Redistribution for Reinforcement Learning of Dynamic Nonprehensile Manipulation 2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2021, : 326 - 331
- [24] Sparse Reward based Manipulator Motion Planning by Using High Speed Learning from Demonstrations 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 518 - 523
- [25] Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [28] Learning from Corrective Demonstrations HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 712 - 714