GanHand: Predicting Human Grasp Affordances in Multi-Object Scenes

被引:78
|
作者
Corona, Enric [1 ,2 ]
Pumarola, Albert [1 ]
Alenya, Guillem [1 ]
Moreno-Noguer, Francesc [1 ]
Rogez, Gregory [2 ]
机构
[1] CSIC UPC, Inst Robot & Informat Ind, Barcelona, Spain
[2] NAVER LABS Europe, Meylan, France
关键词
D O I
10.1109/CVPR42600.2020.00508
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rise of deep learning has brought remarkable progress in estimating hand geometry from images where the hands are part of the scene. This paper focuses on a new problem not explored so far, consisting in predicting how a human would grasp one or several objects, given a single RGB image of these objects. This is a problem with enormous potential in e.g. augmented reality, robotics or prosthetic design. In order to predict feasible grasps, we need to understand the semantic content of the image, its geometric structure and all potential interactions with a hand physical model. To this end, we introduce a generative model that jointly reasons in all these levels and 1) regresses the 3D shape and pose of the objects in the scene; 2) estimates the grasp types; and 3) refines the 51-DoF of a 3D hand model that minimize a graspability loss. To train this model we build the YCB-Affordance dataset, that contains more than 133k images of 21 objects in the YCB-Video dataset [69]. We have annotated these images with more than 28M plausible 3D human grasps according to a 33-class taxonomy. A thorough evaluation in synthetic and real images shows that our model can robustly predict realistic grasps, even in cluttered scenes with multiple objects in close contact.
引用
收藏
页码:5030 / 5040
页数:11
相关论文
共 50 条
  • [1] Keypoint-Based Robotic Grasp Detection Scheme in Multi-Object Scenes
    Li, Tong
    Wang, Fei
    Ru, Changlei
    Jiang, Yong
    Li, Jinghong
    [J]. SENSORS, 2021, 21 (06) : 1 - 15
  • [2] A novel vision-based multi-task robotic grasp detection method for multi-object scenes
    Yanan Song
    Liang Gao
    Xinyu Li
    Weiming Shen
    Kunkun Peng
    [J]. Science China Information Sciences, 2022, 65
  • [3] A novel vision-based multi-task robotic grasp detection method for multi-object scenes
    Song, Yanan
    Gao, Liang
    Li, Xinyu
    Shen, Weiming
    Peng, Kunkun
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (12)
  • [4] A novel vision-based multi-task robotic grasp detection method for multi-object scenes
    Yanan SONG
    Liang GAO
    Xinyu LI
    Weiming SHEN
    Kunkun PENG
    [J]. Science China(Information Sciences), 2022, 65 (12) : 157 - 169
  • [5] A Method for Object Recognition and Robot Grasping Detection in Multi-object Scenes
    Zheng, Jiajun
    Zou, Yuanyuan
    Xu, Jie
    Fang, Lingshen
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT III, 2022, 13457 : 189 - 196
  • [6] Multi-Object Grasping - Estimating the Number of Objects in a Robotic Grasp
    Chen, Tianze
    Shenoy, Adheesh
    Kolinko, Anzhelika
    Shah, Syed
    Sun, Yu
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4995 - 5001
  • [7] A Multi-Object Grasp Technique for Placement of Objects in Virtual Reality
    Fernandez, Unai J.
    Elizondo, Sonia
    Iriarte, Naroa
    Morales, Rafael
    Ortiz, Amalia
    Marichal, Sebastian
    Ardaiz, Oscar
    Marzo, Asier
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [8] Multi-object reconstruction from dynamic scenes: An object-centered approach
    Shin, Young Min
    Cho, Minsu
    Lee, Kyoung Mu
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (11) : 1575 - 1588
  • [9] Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
    Heravi, Negin
    Wahid, Ayzaan
    Lynch, Corey
    Florence, Pete
    Armstrong, Travis
    Tompson, Jonathan
    Sermanet, Pierre
    Bohg, Jeannette
    Dwibedi, Debidatta
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9515 - 9522
  • [10] Pure phase correlation applied to multi-object colour scenes
    Turon, F
    ChalasinskaMacukow, K
    Campos, J
    Yzuel, MJ
    [J]. JOURNAL OF OPTICS-NOUVELLE REVUE D OPTIQUE, 1997, 28 (03): : 112 - 117