共 50 条
- [2] Reward learning from human preferences and demonstrations in Atari ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [3] Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery 2024 INTERNATIONAL SYMPOSIUM ON MEDICAL ROBOTICS, ISMR 2024, 2024,
- [5] Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [6] Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1146 - 1156
- [7] Identifying Reusable Primitives in Narrated Demonstrations ELEVENTH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN ROBOT INTERACTION (HRI'16), 2016, : 479 - 480
- [8] Deep Reward Shaping from Demonstrations 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 510 - 517
- [9] Model-based Adversarial Imitation Learning from Demonstrations and Human Reward 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1683 - 1690
- [10] Learning Reward Functions by Integrating Human Demonstrations and Preferences ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,