Model accelerated reinforcement learning for high precision robotic assembly

被引：13

作者：

Zhao, Xin ^{[1
]}

Zhao, Huan ^{[1
]}

Chen, Pengfei ^{[1
]}

Ding, Han ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS | 2020年 / 4卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Robotic assembly; Reinforcement learning; Peg-in-hole; Model acceleration;

D O I：

10.1007/s41315-020-00138-z

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Peg-in-hole assembly with narrow clearance is a typical robotic contact-rich task in industrial manufacturing. Robot learning allows robots to directly acquire the assembly skills for this task without modeling and recognizing the complex contact states. However, learning such skills is still challenging for robot because of the difficulties in collecting massive transitions data and transferring skills to new tasks, which inevitably leads to low training efficiency. This paper formulated the assembly task as a Markov decision process, and proposed a model accelerated reinforcement learning method to efficiently learn assembly policy. In this method, the assembly policy is learned with the maximum entropy reinforcement learning framework and executed with an impedance controller, which ensures exploration efficiency meanwhile allows transferring skills between tasks. To reduce sample complexity and improve training efficiency, the proposed method learns the environment dynamics with Gaussian Process while training policy, then, the learned dynamic model is utilized to improve target value estimation and generate virtual data to argument transition samples. This method can robustly learn assembly skills while minimizing real-world interaction requirements which makes it suitable for realistic assembly scenarios. To verify the proposed method, experiments on an industrial robot are conducted, and the results demonstrate that the proposed method improves the training efficiency by 31% compared with the method without model acceleration and the learned skill can be transferred to new tasks to accelerate the training for new policies.

引用

页码：202 / 216

页数：15

共 50 条

[31] Reinforcement Learning for Robotic Assembly Using Non-Diagonal Stiffness Matrix
Oikawa, Masahide
Kusakabe, Tsukasa
Kutsuzawa, Kyo
Sakaino, Sho
Tsuji, Toshiaki
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 2737 - 2744
[32] Data-Efficient Hierarchical Reinforcement Learning for Robotic Assembly Control Applications
Hou, Zhimin
Fei, Jiajun
Deng, Yuelin
Xu, Jing
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (11) : 11565 - 11575
[33] Robotic precision assembly system for microstructures
Shao, Chao
Ye, Xin
Qian, Jiahui
Zhang, Zhijing
Zhu, Dongsheng
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2020, 234 (08) : 948 - 958
[34] Robotic Table Tennis with Model-Free Reinforcement Learning
Gao, Wenbo
Graesser, Laura
Choromanski, Krzysztof
Song, Xingyou
Lazic, Nevena
Sanketi, Pannag
Sindhwani, Vikas
Jaitly, Navdeep
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5556 - 5563
[35] MODEL-FREE ONLINE REINFORCEMENT LEARNING OF A ROBOTIC MANIPULATOR
Sweafford, Jerry, Jr.
Fahimi, Farbod
MECHATRONIC SYSTEMS AND CONTROL, 2019, 47 (03): : 136 - 143
[36] Reinforcement Learning for Precision Oncology
Eckardt, Jan-Niklas
Wendt, Karsten
Bornhaeuser, Martin
Middeke, Jan Moritz
CANCERS, 2021, 13 (18)
[37] Robotic Peg-in-Hole Assembly Strategy Research Based on Reinforcement Learning Algorithm
Li, Shaodong
Yuan, Xiaogang
Niu, Jie
APPLIED SCIENCES-BASEL, 2022, 12 (21):
[38] Local connection reinforcement learning method for efficient robotic peg-in-hole assembly
Gai, Yuhang
Zhang, Jiwen
Wu, Dan
Chen, Ken
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[39] Integrated robotic system for high precision assembly in a semi-structured environment
Chen, Heping
Zhang, George
Zhang, Hui
Fuhlbrigge, Thomas A.
ASSEMBLY AUTOMATION, 2007, 27 (03) : 247 - 252
[40] THE CLOSED-LOOP ASSEMBLY MICROPOSITIONER (CLAMP) END EFFECTOR FOR HIGH-PRECISION ROBOTIC ASSEMBLY
DERBY, S
ROBOTS 13: CONFERENCE PROCEEDINGS, 1989, : I39 - I50

← 1 2 3 4 5 →