Reinforcement learning in robotics: A survey

被引:1609
|
作者
Kober, Jens [1 ,2 ]
Bagnell, J. Andrew [3 ]
Peters, Jan [4 ,5 ]
机构
[1] Univ Bielefeld, CoR Lab Res Inst Cognit & Robot, D-33615 Bielefeld, Germany
[2] Honda Res Inst Europe, Offenbach, Germany
[3] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
[4] Max Planck Inst Intelligent Syst, Dept Empir Inference, Tubingen, Germany
[5] Tech Univ Darmstadt, FB Informat, FG Intelligent Autonomous Syst, Darmstadt, Germany
来源
关键词
Reinforcement learning; learning control; robot; survey; REAL ROBOT; POLICY GRADIENT; BEHAVIOR; ACQUISITION; PERCEPTION; LOCOMOTION; SKILLS;
D O I
10.1177/0278364913495721
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this article, we attempt to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as notable successes. We discuss how contributions tamed the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. As a result, a particular focus of our paper lies on the choice between model-based and model-free as well as between value-function-based and policy-search methods. By analyzing a simple problem in some detail we demonstrate how reinforcement learning approaches may be profitably applied, and we note throughout open questions and the tremendous potential for future research.
引用
收藏
页码:1238 / 1274
页数:37
相关论文
共 50 条
  • [1] Reinforcement Learning for Mobile Robotics Exploration: A Survey
    Garaffa, Luiza Caetano
    Basso, Maik
    Konzen, Andrea Aparecida
    de Freitas, Edison Pignaton
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 3796 - 3810
  • [2] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Eduardo F. Morales
    Rafael Murrieta-Cid
    Israel Becerra
    Marco A. Esquivel-Basaldua
    [J]. Intelligent Service Robotics, 2021, 14 : 773 - 805
  • [3] A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning
    Morales, Eduardo F.
    Murrieta-Cid, Rafael
    Becerra, Israel
    Esquivel-Basaldua, Marco A.
    [J]. INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 773 - 805
  • [4] Reinforcement Learning for Pick and Place Operations in Robotics: A Survey
    Lobbezoo, Andrew
    Qian, Yanjun
    Kwon, Hyock-Ju
    [J]. ROBOTICS, 2021, 10 (03)
  • [5] Safe reinforcement learning and its applications in robotics: A survey
    Zhang, Chang-Xin
    Zhang, Xing-Long
    Xu, Xin
    Lu, Yang
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (12): : 2090 - 2103
  • [6] Model-Based Reinforcement Learning in Robotics: A Survey
    Sun, Shiguang
    Lan, Xuguang
    Zhang, Hanbo
    Zheng, Nanning
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (01): : 1 - 16
  • [7] Survey of Model-Based Reinforcement Learning: Applications on Robotics
    Athanasios S. Polydoros
    Lazaros Nalpantidis
    [J]. Journal of Intelligent & Robotic Systems, 2017, 86 : 153 - 173
  • [8] Survey of Model-Based Reinforcement Learning: Applications on Robotics
    Polydoros, Athanasios S.
    Nalpantidis, Lazaros
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 86 (02) : 153 - 173
  • [9] Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
    Zhao, Wenshuai
    Queralta, Jorge Pena
    Westerlund, Tomi
    [J]. 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 737 - 744
  • [10] REINFORCEMENT LEARNING ALGORITHMS IN ROBOTICS
    Bocsi, Botond
    Csato, Lehel
    [J]. KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 131 - 142