Living Object Grasping Using Two-Stage Graph Reinforcement Learning

被引：6

作者：

Hu, Zhe ^{[1
,2
]}

Zheng, Yu ^{[2
]}

Pan, Jia ^{[3
]}

机构：

[1] City Univ Hong Kong, Dept Biomed Engn, Kowloon Tong, Hong Kong 999077, Peoples R China

[2] Tencent Robot X, Shenzhen 518057, Guangdong, Peoples R China

[3] Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期

关键词：

Deep learning in grasping and manipulation; dexterous manipulation; grasping; in-hand manipulation; reinforcement learning;

D O I：

10.1109/LRA.2021.3060636

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Living objects are hard to grasp because they can actively dodge and struggle by writhing or deforming while or even prior to being contacted and modeling or predicting their responses to grasping is extremely difficult. This letter presents an algorithm based on reinforcement learning (RL) to attack this challenging problem. Considering the complexity of living object grasping, we divide the whole task into pre-grasp and in-hand stages and let the algorithm switch between the stages automatically. The pre-grasp stage is aimed at finding a good pose of a robot hand approaching a living object for performing a grasp. Dense reward functions are proposed for facilitating the learning of right hand actions based on the poses of both hand and object. Since an object held in hand may struggle to escape, the robot hand needs to adjust its configuration and respond correctly to the object's movement. Hence, the goal of the in-hand stage is to determine an appropriate adjustment of finger configuration in order for the robot hand to keep holding the object. At this stage, we treat the robot hand as a graph and use the graph convolutional network (GCN) to determine the hand action. We test our algorithm with both simulation and real experiments, which show its good performance in living object grasping. More results are available on our website: https://sites.google.com/view/graph-rl.

引用

页码：1950 / 1957

页数：8

共 50 条

[21] Distributed secondary control for DC microgrids using two-stage multi-agent reinforcement learning
[J]. Zhou, Yun (zhouyun0110@vip.163.com), 2025, 164
[22] Robotic Grasping using Deep Reinforcement Learning
Joshi, Shirin
Kumra, Sulabh
Sahin, Ferat
[J]. 2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1461 - 1466
[23] Two-stage selection of distributed data centers based on deep reinforcement learning
Qirui Li
Zhiping Peng
Delong Cui
Jianpeng Lin
Jieguang He
[J]. Cluster Computing, 2022, 25 : 2699 - 2714
[24] A Reinforcement Learning Based Two-Stage Model for Emotion Cause Pair Extraction
Chen, Xinhong
Li, Qing
Li, Zongxi
Xie, Haoran
Wang, Fu Lee
Wang, Jianping
[J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1779 - 1790
[25] Two-stage reinforcement-learning-based cognitive radio with exploration control
Jiang, T.
Grace, D.
Liu, Y.
[J]. IET COMMUNICATIONS, 2011, 5 (05) : 644 - 651
[26] Two-Stage Safe Reinforcement Learning for High-Speed Autonomous Racing
Niu, Jingyu
Hu, Yu
Jin, Beibei
Han, Yinhe
Li, Xiaowei
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3934 - 3941
[27] Two-stage selection of distributed data centers based on deep reinforcement learning
Li, Qirui
Peng, Zhiping
Cui, Delong
Lin, Jianpeng
He, Jieguang
[J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04): : 2699 - 2714
[28] Two-Stage Reinforcement Learning Based on Genetic Network Programming for Mobile Robot
Sendari, Siti
Mabu, Shingo
Hirasawa, Kotaro
[J]. 2012 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2012, : 95 - 100
[29] A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots
Zaragoza, Julio H.
Morales, Eduardo F.
[J]. MICAI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5845 : 337 - 348
[30] Two-stage selection of distributed data centers based on deep reinforcement learning
Li, Qirui
Peng, Zhiping
Cui, Delong
Lin, Jianpeng
He, Jieguang
[J]. Cluster Computing, 2022, 25 (04) : 2699 - 2714

← 1 2 3 4 5 →