Model-Based Reinforcement Learning in Robotics: A Survey

被引：0

作者：

Sun S. ^{[1
]}

Lan X. ^{[1
]}

Zhang H. ^{[1
]}

Zheng N. ^{[1
]}

机构：

[1] Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an

来源：

Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence | 2022年 / 35卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Artificial Intelligence; Model-Based Reinforcement Learning; Reinforcement Learning; Robot Learning;

D O I：

10.16451/j.cnki.issn1003-6059.202201001

中图分类号：

学科分类号：

摘要：

The model-based reinforcement learning makes robots closer to human-like learning and interaction by learning an environment model and optimizing policy or planning based on the model. In this paper, the definition of robot learning problems is described, and model-based reinforcement learning methods in robot learning are introduced, including mainstream model learning and model utilization methods. The mainstream model learning methods are given including the forward dynamics model, the inverse dynamics model and the implicit model. The model utilization methods are presented including model-based planning, model-based policy learning and implicit planning. The current problems on model-based reinforcement learning are discussed. Aiming at the problems of the robot learning task in reality, the application of model-based reinforcement learning is illustrated and the future research directions are analyzed. © 2022, Science Press. All right reserved.

引用

页码：1 / 16

页数：15

共 115 条

[1] OSA T, PAJARINEN J, NEUMANN G, Et al., An Algorithmic Perspective on Imitation Learning, in Robo-tics, 7, pp. 1-179, (2018)
[2] SUTTON R S, BARTO A G., Reinforcement Learning: An Introduction, (1998)
[3] AKKAYA I, ANDRYCHOWICZ M, CHOCIEJ M, Et al., Solving Rubik's Cube with a Robot Hand
[4] LEVINE S, FINN C, DARRELL T, Et al., End-to-End Training of Deep Visuomotor Policies, Journal of Machine Learning Research, 17, 1, pp. 1334-1373, (2016)
[5] FAZELI N, OLLER M, WU J, Et al., See, Feel, Act: Hierarchical Learning for Complex Manipulation Skills with Multisensory Fusion, Science Robotics, 4, 26, (2019)
[6] FISAC J F, AKAMETALU A K, ZEILINGER M N, Et al., A Gene-ral Safety Framework for Learning-Based Control in Uncertain Robotic Systems, IEEE Transactions on Automatic Control, 64, 7, pp. 2737-2752, (2019)
[7] KROEMER O, NIEKUM S, KONIDARIS G., A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms, Journal of Machine Learning Research, 22, pp. 1-82, (2021)
[8] BELLMAN R., On the Theory of Dynamic Programming, Proceedings of the National Academy of Sciences of the United States of America, 38, 8, pp. 716-719, (1952)
[9] MOERLAND T M, BROEKENS J, JONKER C M., Model-Based Reinforcement Learning: A Survey
[10] SILVER D, SCHRITTWIESER J, SIMONYAN K, Et al., Mastering the Game of Go without Human Knowledge, Nature, 550, 7676, pp. 354-359, (2017)

← 1 2 3 4 5 6 7 8 9 10 →