Model-Based Reinforcement Learning in Robotics: A Survey

被引:0
|
作者
Sun S. [1 ]
Lan X. [1 ]
Zhang H. [1 ]
Zheng N. [1 ]
机构
[1] Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an
基金
中国国家自然科学基金;
关键词
Artificial Intelligence; Model-Based Reinforcement Learning; Reinforcement Learning; Robot Learning;
D O I
10.16451/j.cnki.issn1003-6059.202201001
中图分类号
学科分类号
摘要
The model-based reinforcement learning makes robots closer to human-like learning and interaction by learning an environment model and optimizing policy or planning based on the model. In this paper, the definition of robot learning problems is described, and model-based reinforcement learning methods in robot learning are introduced, including mainstream model learning and model utilization methods. The mainstream model learning methods are given including the forward dynamics model, the inverse dynamics model and the implicit model. The model utilization methods are presented including model-based planning, model-based policy learning and implicit planning. The current problems on model-based reinforcement learning are discussed. Aiming at the problems of the robot learning task in reality, the application of model-based reinforcement learning is illustrated and the future research directions are analyzed. © 2022, Science Press. All right reserved.
引用
收藏
页码:1 / 16
页数:15
相关论文
共 115 条
  • [1] OSA T, PAJARINEN J, NEUMANN G, Et al., An Algorithmic Perspective on Imitation Learning, in Robo-tics, 7, pp. 1-179, (2018)
  • [2] SUTTON R S, BARTO A G., Reinforcement Learning: An Introduction, (1998)
  • [3] AKKAYA I, ANDRYCHOWICZ M, CHOCIEJ M, Et al., Solving Rubik's Cube with a Robot Hand
  • [4] LEVINE S, FINN C, DARRELL T, Et al., End-to-End Training of Deep Visuomotor Policies, Journal of Machine Learning Research, 17, 1, pp. 1334-1373, (2016)
  • [5] FAZELI N, OLLER M, WU J, Et al., See, Feel, Act: Hierarchical Learning for Complex Manipulation Skills with Multisensory Fusion, Science Robotics, 4, 26, (2019)
  • [6] FISAC J F, AKAMETALU A K, ZEILINGER M N, Et al., A Gene-ral Safety Framework for Learning-Based Control in Uncertain Robotic Systems, IEEE Transactions on Automatic Control, 64, 7, pp. 2737-2752, (2019)
  • [7] KROEMER O, NIEKUM S, KONIDARIS G., A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms, Journal of Machine Learning Research, 22, pp. 1-82, (2021)
  • [8] BELLMAN R., On the Theory of Dynamic Programming, Proceedings of the National Academy of Sciences of the United States of America, 38, 8, pp. 716-719, (1952)
  • [9] MOERLAND T M, BROEKENS J, JONKER C M., Model-Based Reinforcement Learning: A Survey
  • [10] SILVER D, SCHRITTWIESER J, SIMONYAN K, Et al., Mastering the Game of Go without Human Knowledge, Nature, 550, 7676, pp. 354-359, (2017)