A Model-Based Method for Learning Locomotion Skills from Demonstration

被引:2
|
作者
Park, Hyunseong [1 ]
Yoon, Sukmin [1 ]
Kim, Yong-Duk [1 ]
机构
[1] Agcy Def Dev, Dajeon, South Korea
关键词
D O I
10.1109/SMC52423.2021.9658875
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While Generative Adversarial Imitation Learning (GAIL) shows remarkable performance in many high dimensional imitation learning tasks, it requires too many sampled transitions, which are infeasible for some real world problems. In this paper, we demonstrate how exploiting the reward function in GAIL can improve sample efficiency. We design our algorithm end-to-end differentiable so that the learned reward function can directly participate in policy updates. End-to-end differentiability can be achieved by introducing a forward model of the environment, enabling direct calculation of the cumulative reward function. However, using a forward model has two significant limitations that it heavily relies on the performance of the forward model and requires multi-step prediction, which causes severe error accumulation. The proposed end-to-end differentiable adversarial imitation learning algorithm alleviates these limitations. Also, we suggest applying several existing regularization techniques for robust training of a forward model. We call our algorithm, integrated with these regularization methods, fully Differentiable Regularized GAIL (DRGAIL), and test DRGAIL on continuous control tasks.
引用
收藏
页码:327 / 332
页数:6
相关论文
共 50 条
  • [21] Continual learning from demonstration of robotics skills
    Auddy, Sayantan
    Hollenstein, Jakob
    Saveriano, Matteo
    Rodriguez-Sanchez, Antonio
    Piater, Justus
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 165
  • [22] Accessibility-Based Clustering for Efficient Learning of Locomotion Skills
    Zhang, Chong
    Yu, Wanming
    Li, Zhibin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 1600 - 1606
  • [23] Model-Based Reinforcement Learning Method for Microgrid Optimization Scheduling
    Yao, Jinke
    Xu, Jiachen
    Zhang, Ning
    Guan, Yajuan
    [J]. SUSTAINABILITY, 2023, 15 (12)
  • [24] A Robust Model-Based Biped Locomotion Framework Based on Three-Mass Model: From Planning to Control
    Kasaei, Mohammadreza
    Ahmadi, Ali
    Lau, Nuno
    Pereira, Artur
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 257 - 262
  • [25] Model Learning and Model-Based Testing
    Aichernig, Bernhard K.
    Mostowski, Wojciech
    Mousavi, Mohammad Reza
    Tappler, Martin
    Taromirad, Masoumeh
    [J]. MACHINE LEARNING FOR DYNAMIC SOFTWARE ANALYSIS: POTENTIALS AND LIMITS, 2018, 11026 : 74 - 100
  • [26] Learning Agile Locomotion Skills with a Mentor
    Iscen, Atil
    Yu, George
    Escontrela, Alejandro
    Jain, Deepali
    Tan, Jie
    Caluwaerts, Ken
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 2019 - 2025
  • [27] Learning locomotion skills in evolvable robots
    Lan, Gongjin
    van Hooft, Maarten
    De Carlo, Matteo
    Tomczak, Jakub M.
    Eiben, A. E.
    [J]. NEUROCOMPUTING, 2021, 452 : 294 - 306
  • [28] Incremental Learning of Primitive Skills from Demonstration of a Task
    Lee, Sang Hyoung
    Kim, Hyung Kyu
    Suh, Il Hong
    [J]. PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, : 185 - 186
  • [29] Learning locomotion skills in evolvable robots
    [J]. Lan, Gongjin (g.lan@vu.nl), 1600, Elsevier B.V. (452):
  • [30] Model-Based Deep Learning
    Shlezinger, Nir
    Whang, Jay
    Eldar, Yonina C.
    Dimakis, Alexandros G.
    [J]. PROCEEDINGS OF THE IEEE, 2023, 111 (05) : 465 - 499