Reinforcement learning from expert demonstrations with application to redundant robot control

被引:5
|
作者
Ramirez, Jorge [1 ]
Yu, Wen [1 ]
机构
[1] CINVESTAV IPN, Nat Polytech Inst, Dept Control Automat, Mexico City, Mexico
关键词
Reinforcement learning; Expert demonstrations; Biased exploration; Robot manipulator;
D O I
10.1016/j.engappai.2022.105753
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current methods of reinforcement learning from expert demonstrations require humans to give all possible demonstrations in the learning phase, which is very difficult for continuous or high-dimensional spaces. In this paper, we proposed biased exploration reinforcement learning to avoid the exploration of unnecessary states and actions of the expert demonstrations. We present a convergence analysis of the novel method. This method is applied to learn the control of a redundant robot manipulator with 7-degree-of-freedom. The experimental results demonstrate that the proposed method accelerates the learning phase. The obtained policy can successfully achieve the pretended task.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Redundant robot control with learning from expert demonstrations
    Ramirez, Jorge
    Yu, Wen
    [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 715 - 720
  • [2] Leveraging Expert Demonstrations in Robot Cooperation with Multi-Agent Reinforcement Learning
    Zhang, Zhaolong
    Li, Yihui
    Rojas, Juan
    Guan, Yisheng
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2021, PT II, 2021, 13014 : 211 - 222
  • [3] Redundant Robot Control Using Multi Agent Reinforcement Learning
    Perrusquia, Adolfo
    Yu, Wen
    Li, Xiaoou
    [J]. 2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1650 - 1655
  • [4] On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations
    Rudner, Tim G. J.
    Lu, Cong
    Osborne, Michael A.
    Gal, Yarin
    Teh, Yee Whye
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Forgetful experience replay in hierarchical reinforcement learning from expert demonstrations
    Skrynnik, Alexey
    Staroverov, Aleksey
    Aitygulov, Ermek
    Aksenov, Kirill
    Davydov, Vasilii
    Panov, Aleksandr, I
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 218
  • [6] Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance
    Jing, Mingxuan
    Ma, Xiaojian
    Huang, Wenbing
    Sun, Fuchun
    Yang, Chao
    Fang, Bin
    Liu, Huaping
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5109 - 5116
  • [7] Model-free reinforcement learning from expert demonstrations: a survey
    Ramirez, Jorge
    Yu, Wen
    Perrusquia, Adolfo
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (04) : 3213 - 3241
  • [8] Model-free reinforcement learning from expert demonstrations: a survey
    Jorge Ramírez
    Wen Yu
    Adolfo Perrusquía
    [J]. Artificial Intelligence Review, 2022, 55 : 3213 - 3241
  • [9] Learning Control Barrier Functions from Expert Demonstrations
    Robey, Alexander
    Hu, Haimin
    Lindemann, Lars
    Zhang, Hanwen
    Dimarogonas, Dimos, V
    Tu, Stephen
    Matni, Nikolai
    [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3717 - 3724
  • [10] Application of reinforcement learning to dexterous robot control
    Bucak, IO
    Zohdy, MA
    [J]. PROCEEDINGS OF THE 1998 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 1998, : 1405 - 1409