共 115 条
- [1] OSA T, PAJARINEN J, NEUMANN G, Et al., An Algorithmic Perspective on Imitation Learning, in Robo-tics, 7, pp. 1-179, (2018)
- [2] SUTTON R S, BARTO A G., Reinforcement Learning: An Introduction, (1998)
- [3] AKKAYA I, ANDRYCHOWICZ M, CHOCIEJ M, Et al., Solving Rubik's Cube with a Robot Hand
- [4] LEVINE S, FINN C, DARRELL T, Et al., End-to-End Training of Deep Visuomotor Policies, Journal of Machine Learning Research, 17, 1, pp. 1334-1373, (2016)
- [5] FAZELI N, OLLER M, WU J, Et al., See, Feel, Act: Hierarchical Learning for Complex Manipulation Skills with Multisensory Fusion, Science Robotics, 4, 26, (2019)
- [6] FISAC J F, AKAMETALU A K, ZEILINGER M N, Et al., A Gene-ral Safety Framework for Learning-Based Control in Uncertain Robotic Systems, IEEE Transactions on Automatic Control, 64, 7, pp. 2737-2752, (2019)
- [7] KROEMER O, NIEKUM S, KONIDARIS G., A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms, Journal of Machine Learning Research, 22, pp. 1-82, (2021)
- [8] BELLMAN R., On the Theory of Dynamic Programming, Proceedings of the National Academy of Sciences of the United States of America, 38, 8, pp. 716-719, (1952)
- [9] MOERLAND T M, BROEKENS J, JONKER C M., Model-Based Reinforcement Learning: A Survey
- [10] SILVER D, SCHRITTWIESER J, SIMONYAN K, Et al., Mastering the Game of Go without Human Knowledge, Nature, 550, 7676, pp. 354-359, (2017)