Model accelerated reinforcement learning for high precision robotic assembly

被引:13
|
作者
Zhao, Xin [1 ]
Zhao, Huan [1 ]
Chen, Pengfei [1 ]
Ding, Han [1 ]
机构
[1] Huazhong Univ Sci & Technol, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Robotic assembly; Reinforcement learning; Peg-in-hole; Model acceleration;
D O I
10.1007/s41315-020-00138-z
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Peg-in-hole assembly with narrow clearance is a typical robotic contact-rich task in industrial manufacturing. Robot learning allows robots to directly acquire the assembly skills for this task without modeling and recognizing the complex contact states. However, learning such skills is still challenging for robot because of the difficulties in collecting massive transitions data and transferring skills to new tasks, which inevitably leads to low training efficiency. This paper formulated the assembly task as a Markov decision process, and proposed a model accelerated reinforcement learning method to efficiently learn assembly policy. In this method, the assembly policy is learned with the maximum entropy reinforcement learning framework and executed with an impedance controller, which ensures exploration efficiency meanwhile allows transferring skills between tasks. To reduce sample complexity and improve training efficiency, the proposed method learns the environment dynamics with Gaussian Process while training policy, then, the learned dynamic model is utilized to improve target value estimation and generate virtual data to argument transition samples. This method can robustly learn assembly skills while minimizing real-world interaction requirements which makes it suitable for realistic assembly scenarios. To verify the proposed method, experiments on an industrial robot are conducted, and the results demonstrate that the proposed method improves the training efficiency by 31% compared with the method without model acceleration and the learned skill can be transferred to new tasks to accelerate the training for new policies.
引用
收藏
页码:202 / 216
页数:15
相关论文
共 50 条
  • [31] Reinforcement Learning for Robotic Assembly Using Non-Diagonal Stiffness Matrix
    Oikawa, Masahide
    Kusakabe, Tsukasa
    Kutsuzawa, Kyo
    Sakaino, Sho
    Tsuji, Toshiaki
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 2737 - 2744
  • [32] Data-Efficient Hierarchical Reinforcement Learning for Robotic Assembly Control Applications
    Hou, Zhimin
    Fei, Jiajun
    Deng, Yuelin
    Xu, Jing
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (11) : 11565 - 11575
  • [33] Robotic precision assembly system for microstructures
    Shao, Chao
    Ye, Xin
    Qian, Jiahui
    Zhang, Zhijing
    Zhu, Dongsheng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2020, 234 (08) : 948 - 958
  • [34] Robotic Table Tennis with Model-Free Reinforcement Learning
    Gao, Wenbo
    Graesser, Laura
    Choromanski, Krzysztof
    Song, Xingyou
    Lazic, Nevena
    Sanketi, Pannag
    Sindhwani, Vikas
    Jaitly, Navdeep
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5556 - 5563
  • [35] MODEL-FREE ONLINE REINFORCEMENT LEARNING OF A ROBOTIC MANIPULATOR
    Sweafford, Jerry, Jr.
    Fahimi, Farbod
    MECHATRONIC SYSTEMS AND CONTROL, 2019, 47 (03): : 136 - 143
  • [36] Reinforcement Learning for Precision Oncology
    Eckardt, Jan-Niklas
    Wendt, Karsten
    Bornhaeuser, Martin
    Middeke, Jan Moritz
    CANCERS, 2021, 13 (18)
  • [37] Robotic Peg-in-Hole Assembly Strategy Research Based on Reinforcement Learning Algorithm
    Li, Shaodong
    Yuan, Xiaogang
    Niu, Jie
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [38] Local connection reinforcement learning method for efficient robotic peg-in-hole assembly
    Gai, Yuhang
    Zhang, Jiwen
    Wu, Dan
    Chen, Ken
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [39] Integrated robotic system for high precision assembly in a semi-structured environment
    Chen, Heping
    Zhang, George
    Zhang, Hui
    Fuhlbrigge, Thomas A.
    ASSEMBLY AUTOMATION, 2007, 27 (03) : 247 - 252
  • [40] THE CLOSED-LOOP ASSEMBLY MICROPOSITIONER (CLAMP) END EFFECTOR FOR HIGH-PRECISION ROBOTIC ASSEMBLY
    DERBY, S
    ROBOTS 13: CONFERENCE PROCEEDINGS, 1989, : I39 - I50