共 50 条
- [1] Reward Learning From Very Few Demonstrations [J]. IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (03) : 893 - 904
- [2] Reward learning from human preferences and demonstrations in Atari [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [3] Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery [J]. 2024 INTERNATIONAL SYMPOSIUM ON MEDICAL ROBOTICS, ISMR 2024, 2024,
- [4] Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [5] Identifying Reusable Primitives in Narrated Demonstrations [J]. ELEVENTH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN ROBOT INTERACTION (HRI'16), 2016, : 479 - 480
- [6] Deep Reward Shaping from Demonstrations [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 510 - 517
- [7] Model-based Adversarial Imitation Learning from Demonstrations and Human Reward [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1683 - 1690
- [8] Learning Reward Functions by Integrating Human Demonstrations and Preferences [J]. ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
- [9] Learning to Run with Potential-Based Reward Shaping and Demonstrations from Video Data [J]. 2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 286 - 291
- [10] DROID: Learning from Offline Heterogeneous Demonstrations via Reward-Policy Distillation [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229