Living Object Grasping Using Two-Stage Graph Reinforcement Learning

被引:6
|
作者
Hu, Zhe [1 ,2 ]
Zheng, Yu [2 ]
Pan, Jia [3 ]
机构
[1] City Univ Hong Kong, Dept Biomed Engn, Kowloon Tong, Hong Kong 999077, Peoples R China
[2] Tencent Robot X, Shenzhen 518057, Guangdong, Peoples R China
[3] Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China
来源
关键词
Deep learning in grasping and manipulation; dexterous manipulation; grasping; in-hand manipulation; reinforcement learning;
D O I
10.1109/LRA.2021.3060636
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Living objects are hard to grasp because they can actively dodge and struggle by writhing or deforming while or even prior to being contacted and modeling or predicting their responses to grasping is extremely difficult. This letter presents an algorithm based on reinforcement learning (RL) to attack this challenging problem. Considering the complexity of living object grasping, we divide the whole task into pre-grasp and in-hand stages and let the algorithm switch between the stages automatically. The pre-grasp stage is aimed at finding a good pose of a robot hand approaching a living object for performing a grasp. Dense reward functions are proposed for facilitating the learning of right hand actions based on the poses of both hand and object. Since an object held in hand may struggle to escape, the robot hand needs to adjust its configuration and respond correctly to the object's movement. Hence, the goal of the in-hand stage is to determine an appropriate adjustment of finger configuration in order for the robot hand to keep holding the object. At this stage, we treat the robot hand as a graph and use the graph convolutional network (GCN) to determine the hand action. We test our algorithm with both simulation and real experiments, which show its good performance in living object grasping. More results are available on our website: https://sites.google.com/view/graph-rl.
引用
收藏
页码:1950 / 1957
页数:8
相关论文
共 50 条
  • [22] Robotic Grasping using Deep Reinforcement Learning
    Joshi, Shirin
    Kumra, Sulabh
    Sahin, Ferat
    [J]. 2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1461 - 1466
  • [23] Two-stage selection of distributed data centers based on deep reinforcement learning
    Qirui Li
    Zhiping Peng
    Delong Cui
    Jianpeng Lin
    Jieguang He
    [J]. Cluster Computing, 2022, 25 : 2699 - 2714
  • [24] A Reinforcement Learning Based Two-Stage Model for Emotion Cause Pair Extraction
    Chen, Xinhong
    Li, Qing
    Li, Zongxi
    Xie, Haoran
    Wang, Fu Lee
    Wang, Jianping
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1779 - 1790
  • [25] Two-stage reinforcement-learning-based cognitive radio with exploration control
    Jiang, T.
    Grace, D.
    Liu, Y.
    [J]. IET COMMUNICATIONS, 2011, 5 (05) : 644 - 651
  • [26] Two-Stage Safe Reinforcement Learning for High-Speed Autonomous Racing
    Niu, Jingyu
    Hu, Yu
    Jin, Beibei
    Han, Yinhe
    Li, Xiaowei
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3934 - 3941
  • [27] Two-stage selection of distributed data centers based on deep reinforcement learning
    Li, Qirui
    Peng, Zhiping
    Cui, Delong
    Lin, Jianpeng
    He, Jieguang
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (04): : 2699 - 2714
  • [28] Two-Stage Reinforcement Learning Based on Genetic Network Programming for Mobile Robot
    Sendari, Siti
    Mabu, Shingo
    Hirasawa, Kotaro
    [J]. 2012 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2012, : 95 - 100
  • [29] A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots
    Zaragoza, Julio H.
    Morales, Eduardo F.
    [J]. MICAI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5845 : 337 - 348
  • [30] Two-stage selection of distributed data centers based on deep reinforcement learning
    Li, Qirui
    Peng, Zhiping
    Cui, Delong
    Lin, Jianpeng
    He, Jieguang
    [J]. Cluster Computing, 2022, 25 (04) : 2699 - 2714