共 50 条
- [1] Learning Reward Functions by Integrating Human Demonstrations and Preferences [J]. ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
- [3] Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (01): : 45 - 67
- [4] Reward Learning from Narrated Demonstrations [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7004 - 7013
- [5] Reward Learning From Very Few Demonstrations [J]. IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (03) : 893 - 904
- [6] Model-based Adversarial Imitation Learning from Demonstrations and Human Reward [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1683 - 1690
- [8] Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery [J]. 2024 INTERNATIONAL SYMPOSIUM ON MEDICAL ROBOTICS, ISMR 2024, 2024,
- [9] Active Reward Learning from Online Preferences [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 7511 - 7518
- [10] Objective learning from human demonstrations [J]. ANNUAL REVIEWS IN CONTROL, 2021, 51 : 111 - 129