Reinforcement learning in robotics: A survey

被引:1609
|
作者
Kober, Jens [1 ,2 ]
Bagnell, J. Andrew [3 ]
Peters, Jan [4 ,5 ]
机构
[1] Univ Bielefeld, CoR Lab Res Inst Cognit & Robot, D-33615 Bielefeld, Germany
[2] Honda Res Inst Europe, Offenbach, Germany
[3] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
[4] Max Planck Inst Intelligent Syst, Dept Empir Inference, Tubingen, Germany
[5] Tech Univ Darmstadt, FB Informat, FG Intelligent Autonomous Syst, Darmstadt, Germany
来源
关键词
Reinforcement learning; learning control; robot; survey; REAL ROBOT; POLICY GRADIENT; BEHAVIOR; ACQUISITION; PERCEPTION; LOCOMOTION; SKILLS;
D O I
10.1177/0278364913495721
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this article, we attempt to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as notable successes. We discuss how contributions tamed the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. As a result, a particular focus of our paper lies on the choice between model-based and model-free as well as between value-function-based and policy-search methods. By analyzing a simple problem in some detail we demonstrate how reinforcement learning approaches may be profitably applied, and we note throughout open questions and the tremendous potential for future research.
引用
收藏
页码:1238 / 1274
页数:37
相关论文
共 50 条
  • [31] Evolutionary Reinforcement Learning: A Survey
    Bai, Hui
    Cheng, Ran
    Jin, Yaochu
    [J]. Intelligent Computing, 2023, 2
  • [32] Reinforcement Learning on Graphs: A Survey
    Nie, Mingshuo
    Chen, Dongming
    Wang, Dongqi
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (04): : 1065 - 1082
  • [33] Safe Reinforcement Learning: A Survey
    Wang, Xue-Song
    Wang, Rong-Rong
    Cheng, Yu-Hu
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (09): : 1813 - 1835
  • [34] Ensemble reinforcement learning: A survey
    Song, Yanjie
    Suganthan, Ponnuthurai Nagaratnam
    Pedrycz, Witold
    Ou, Junwei
    He, Yongming
    Chen, Yingwu
    Wu, Yutong
    [J]. APPLIED SOFT COMPUTING, 2023, 149
  • [35] A survey of inverse reinforcement learning
    Adams, Stephen
    Cody, Tyler
    Beling, Peter A.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (06) : 4307 - 4346
  • [36] A survey of inverse reinforcement learning
    Stephen Adams
    Tyler Cody
    Peter A. Beling
    [J]. Artificial Intelligence Review, 2022, 55 : 4307 - 4346
  • [37] Survey on Inverse Reinforcement Learning
    Zhang, Li-Hua
    Liu, Quan
    Huang, Zhi-Gang
    Zhu, Fei
    [J]. Ruan Jian Xue Bao/Journal of Software, 2023, 34 (10): : 4772 - 4803
  • [38] Deep reinforcement learning: a survey
    Hao-nan Wang
    Ning Liu
    Yi-yun Zhang
    Da-wei Feng
    Feng Huang
    Dong-sheng Li
    Yi-ming Zhang
    [J]. Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 1726 - 1744
  • [39] Deep Reinforcement Learning: A Survey
    Wang, Xu
    Wang, Sen
    Liang, Xingxing
    Zhao, Dawei
    Huang, Jincai
    Xu, Xin
    Dai, Bin
    Miao, Qiguang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5064 - 5078
  • [40] A TUTORIAL SURVEY OF REINFORCEMENT LEARNING
    KEERTHI, SS
    RAVINDRAN, B
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1994, 19 : 851 - 889