共 50 条
- [21] Dueling Posterior Sampling for Preference-Based Reinforcement Learning [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 1029 - 1038
- [24] APReL: A Library for Active Preference-based Reward Learning Algorithms [J]. PROCEEDINGS OF THE 2022 17TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI '22), 2022, : 613 - 617
- [25] Preference-Based Assistance Map Learning With Robust Adaptive Oscillators [J]. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2022, 4 (04): : 1000 - 1009
- [26] Contextual Bandits and Imitation Learning with Preference-Based Active Queries [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [27] Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [28] A Policy Iteration Algorithm for Learning from Preference-Based Feedback [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XII, 2013, 8207 : 427 - 437
- [29] Active Preference-Based Gaussian Process Regression for Reward Learning [J]. ROBOTICS: SCIENCE AND SYSTEMS XVI, 2020,
- [30] Preference-based Reinforcement Learning with Finite-Time Guarantees [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33