Learning Variable Impedance Control via Inverse Reinforcement Learning for Force-Related Tasks

被引:65
|
作者
Zhang, Xiang [1 ]
Sun, Liting [1 ]
Kuang, Zhian [1 ,2 ]
Tomizuka, Masayoshi [1 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
[2] Harbin Inst Technol, Res Inst Intelligent Control & Syst, Harbin 150001, Peoples R China
关键词
Compliance and impedance control; learning from demonstration; machine learning for robot control;
D O I
10.1109/LRA.2021.3061374
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Many manipulation tasks require robots to interact with unknown environments. In such applications, the ability to adapt the impedance according to different task phases and environment constraints is crucial for safety and performance. Although many approaches based on deep reinforcement learning (RI) and learning from demonstration (LfD) have been proposed to obtain variable impedance skills on contact-rich manipulation tasks, these skills are typically task-specific and could be sensitive to changes in task settings. This letter proposes an inverse reinforcement learning (IRL) based approach to recover both the variable impedance policy and reward function from expert demonstrations. We explore different action space of the reward functions to achieve a more general representation of expert variable impedance skills. Experiments on two variable impedance tasks (Peg-in-Hole and Cup-on-Plate) were conducted in both simulations and on a real FANUC LR Mate 200iD/7 L industrial robot. The comparison results with behavior cloning and force-based IRL proved that the learned reward function in the gain action space has better transferability than in the force space. Experiment videos are available at https://msc.berkeley.edu/research/impedance-irl.html.
引用
收藏
页码:2225 / 2232
页数:8
相关论文
共 50 条
  • [1] Learning Tasks in Intelligent Environments via Inverse Reinforcement Learning
    Shah, Syed Ihtesham Hussain
    Coronato, Antonio
    2021 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE), 2021,
  • [2] Learning variable impedance control based on reinforcement learning
    Li C.
    Zhang Z.
    Xia G.
    Xie X.
    Zhu Q.
    Liu Q.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2019, 40 (02): : 304 - 311
  • [3] Learning Variable Impedance Control for Contact Sensitive Tasks
    Bogdanovic, Miroslav
    Khadiv, Majid
    Righetti, Ludovic
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6129 - 6136
  • [4] Reinforcement Learning of Impedance Control In Stochastic Force Fields
    Stulp, Freek
    Buchli, Jonas
    Ellmer, Alice
    Mistry, Michael
    Theodorou, Evangelos
    Schaal, Stefan
    2011 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING (ICDL), 2011,
  • [5] Reinforcement Learning Based Variable Impedance Control for High Precision Human-robot Collaboration Tasks
    Meng, Yan
    Su, Jianhua
    Wu, Jiaxi
    2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 560 - 565
  • [6] Data-Efficient Reinforcement Learning for Variable Impedance Control
    Anand, Akhil S.
    Kaushik, Rituraj
    Gravdahl, Jan Tommy
    Abu-Dakka, Fares J.
    IEEE ACCESS, 2024, 12 : 15631 - 15641
  • [7] Learning Variable Impedance Control for Robotic Massage With Deep Reinforcement Learning: A Novel Learning Framework
    Li, Zhuoran
    Zeng, Chao
    Deng, Zhen
    Xu, Qinling
    He, Bingwei
    Zhang, Jianwei
    IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2024, 10 (01): : 17 - 27
  • [8] Learning Assembly Tasks in a Few Minutes by Combining Impedance Control and Residual Recurrent Reinforcement Learning
    Kulkarni, Padmaja
    Kober, Jens
    Babuska, Robert
    Della Santina, Cosimo
    ADVANCED INTELLIGENT SYSTEMS, 2022, 4 (01)
  • [9] Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation
    Gamrian, Shani
    Goldberg, Yoav
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [10] Learning variable impedance control
    Buchli, Jonas
    Stulp, Freek
    Theodorou, Evangelos
    Schaal, Stefan
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2011, 30 (07): : 820 - 833