Meta Reinforcement Learning for Sim-to-real Domain Adaptation

被引:0
|
作者
Arndt, Karol [1 ]
Hazara, Murtaza [1 ,3 ,4 ]
Ghadirzadeh, Ali [1 ,2 ]
Kyrki, Ville [1 ]
机构
[1] Aalto Univ, Espoo, Finland
[2] KTH Royal Inst Technol, Stockholm, Sweden
[3] Katholieke Univ Leuven, Dept Mech Engn, Leuven, Belgium
[4] Flanders Make, Robot Core Lab, Lommel, Belgium
基金
芬兰科学院;
关键词
SIMULATION;
D O I
10.1109/icra40945.2020.9196540
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern reinforcement learning methods suffer from low sample efficiency and unsafe exploration, making it infeasible to train robotic policies entirely on real hardware. In this work, we propose to address the problem of sim-to-real domain transfer by using meta learning to train a policy that can adapt to a variety of dynamic conditions, and using a task-specific trajectory generation model to provide an action space that facilitates quick exploration. We evaluate the method by performing domain adaptation in simulation and analyzing the structure of the latent space during adaptation. We then deploy this policy on a KUKA LBR 4+ robot and evaluate its performance on a task of hitting a hockey puck to a target. Our method shows more consistent and stable domain adaptation than the baseline, resulting in better overall performance.
引用
收藏
页码:2725 / 2731
页数:7
相关论文
共 50 条
  • [1] Efficient Sim-to-Real Transfer in Reinforcement Learning Through Domain Randomization and Domain Adaptation
    Shakerimov, Aidar
    Alizadeh, Tohid
    Varol, Huseyin Atakan
    [J]. IEEE ACCESS, 2023, 11 : 136809 - 136824
  • [2] Accelerated Robot Skill Acquisition by Reinforcement Learning-Aided Sim-to-Real Domain Adaptation
    Loncarcvic, Zvezdan
    Ude, Ales
    Gams, Andrej
    [J]. 2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 269 - 274
  • [3] Sim-to-Real in Reinforcement Learning for Everyone
    Vacaro, Juliano
    Marques, Guilherme
    Oliveira, Bruna
    Paz, Gabriel
    Paula, Thomas
    Staehler, Wagston
    Murphy, David
    [J]. 2019 LATIN AMERICAN ROBOTICS SYMPOSIUM, 2019 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR) AND 2019 WORKSHOP ON ROBOTICS IN EDUCATION (LARS-SBR-WRE 2019), 2019, : 305 - 310
  • [4] Grounded action transformation for sim-to-real reinforcement learning
    Josiah P. Hanna
    Siddharth Desai
    Haresh Karnan
    Garrett Warnell
    Peter Stone
    [J]. Machine Learning, 2021, 110 : 2469 - 2499
  • [5] Grounded action transformation for sim-to-real reinforcement learning
    Hanna, Josiah P.
    Desai, Siddharth
    Karnan, Haresh
    Warnell, Garrett
    Stone, Peter
    [J]. MACHINE LEARNING, 2021, 110 (09) : 2469 - 2499
  • [6] Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
    Zhao, Wenshuai
    Queralta, Jorge Pena
    Westerlund, Tomi
    [J]. 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 737 - 744
  • [7] Survey on Sim-to-real Transfer Reinforcement Learning in Robot Systems
    Lin, Qian
    Yu, Chao
    Wu, Xia-Wei
    Dong, Yin-Zhao
    Xu, Xin
    Zhang, Qiang
    Guo, Xian
    [J]. Ruan Jian Xue Bao/Journal of Software, 2024, 35 (02): : 711 - 738
  • [8] Dynamic Bipedal Turning through Sim-to-Real Reinforcement Learning
    Yu, Fangzhou
    Batke, Ryan
    Dao, Jeremy
    Hurst, Jonathan
    Green, Kevin
    Fern, Alan
    [J]. 2022 IEEE-RAS 21ST INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2022, : 903 - 910
  • [9] Cyclic policy distillation: Sample-efficient sim-to-real reinforcement learning with domain randomization
    Kadokawa, Yuki
    Zhu, Lingwei
    Tsurumine, Yoshihisa
    Matsubara, Takamitsu
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 165
  • [10] A Domain Data Pattern Randomization based Deep Reinforcement Learning method for Sim-to-Real transfer
    Gong, Peng
    Shi, Dianxi
    Xue, Chao
    Chen, Xucan
    [J]. 2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 1 - 7