共 50 条
- [1] Reward learning from human preferences and demonstrations in Atari [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [2] Learning Reward Functions by Integrating Human Demonstrations and Preferences [J]. ROBOTICS: SCIENCE AND SYSTEMS XV, 2019,
- [3] Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (01): : 45 - 67
- [4] Reward Learning from Narrated Demonstrations [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7004 - 7013
- [5] Deep Reward Shaping from Demonstrations [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 510 - 517
- [6] Model-based Adversarial Imitation Learning from Demonstrations and Human Reward [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1683 - 1690
- [7] Reward Learning From Very Few Demonstrations [J]. IEEE TRANSACTIONS ON ROBOTICS, 2021, 37 (03) : 893 - 904
- [9] Reward currency modulates human risk preferences [J]. EVOLUTION AND HUMAN BEHAVIOR, 2016, 37 (02) : 159 - 168
- [10] A New Reward System Based on Human Demonstrations for Hard Exploration Games [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 2401 - 2414