共 50 条
- [1] Sequential Preference Ranking for Efficient Reinforcement Learning from Human Feedback ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [2] Deep Reinforcement Learning from Human Preferences ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [4] Hierarchical learning from human preferences and curiosity Applied Intelligence, 2022, 52 : 7459 - 7479
- [5] Extensive and efficient search of human movements with hierarchical reinforcement learning CA 2002: PROCEEDINGS OF THE COMPUTER ANIMATION 2002, 2002, : 103 - 107
- [6] Human Social Feedback for Efficient Interactive Reinforcement Agent Learning 2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 706 - 712
- [7] Root Cause Analysis for Microservice Systems via Hierarchical Reinforcement Learning from Human Feedback PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5116 - 5125
- [8] Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft NEURIPS 2019 COMPETITION AND DEMONSTRATION TRACK, VOL 123, 2019, 123 : 67 - 76
- [9] Towards Sample Efficient Reinforcement Learning PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5739 - 5743
- [10] Sample Efficient Reinforcement Learning with REINFORCE THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10887 - 10895