共 50 条
- [31] Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [35] NAO Robot Learns to Interact with Humans through Imitation Learning from Video Observation Journal of Intelligent & Robotic Systems, 2023, 109
- [36] On Efficient Online Imitation Learning via Classification ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [38] Efficient Imitation Learning with Conservative World Models 6TH ANNUAL LEARNING FOR DYNAMICS & CONTROL CONFERENCE, 2024, 242 : 1776 - 1789
- [40] Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,