Real-Time Pixel-Wise Grasp Detection Based on RGB-D Feature Dense Fusion

被引:2
|
作者
Wu, Yongxiang [1 ]
Fu, Yili [1 ]
Wang, Shuguo [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
Grasp detection; feature fusion; grasping; deep learning;
D O I
10.1109/ICMA52036.2021.9512605
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a real-time fully convolutional network for detecting grasp pose and confidence of each pixel in RGB-D images. Instead of processing RGB-D data equally, we transform the depth image into point cloud and use a heterogeneous architecture to embed and densely fuse RGB-D information into semantically rich features. To improve the computational efficiency, we propose and integrate a novel point sampling and matching mechanism into the dense fusion. A proposed Uniform Index Sampling (UIS) algorithm is used to sample points uniformly and quickly, and corresponding color and geometry features are matched via a designed Index Image, which is also used for the consistent transformation of RGB-D data. By making full use of RGB-D information effectively, our model achieves a better accuracy of 99.1% on Cornell dataset and 96.4% on Jacquard dataset than current state-of-the-art methods. Moreover, benefiting from the efficient point sampling and matching mechanism, our methods runs at a real-time speed of 8 millisecond per frame. The proposed method is robust for physical grasping and achieves a success rate of 97% on household set, 90% on adversarial set and 91% when grasping in clutter.
引用
收藏
页码:970 / 975
页数:6
相关论文
共 50 条
  • [1] Lightweight Pixel-Wise Generative Robot Grasping Detection Based on RGB-D Dense Fusion
    Tian, Hongkun
    Song, Kechen
    Li, Song
    Ma, Shuai
    Yan, Yunhui
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [2] RGB-D Fusion: Real-time Robust Tracking and Dense Mapping with RGB-D Data Fusion
    Lee, Seong-Oh
    Lim, Hwasup
    Kim, Hyoung-Gon
    Ahn, Sang Chul
    [J]. 2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 2749 - 2754
  • [3] Real-time pixel-wise grasp affordance prediction based on multi-scale context information fusion
    Wu, Yongxiang
    Fu, Yili
    Wang, Shuguo
    [J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2022, 49 (02): : 368 - 381
  • [4] Point-line feature fusion based field real-time RGB-D SLAM
    Li, Qingyu
    Wang, Xin
    Wu, Tian
    Yang, Huijun
    [J]. COMPUTERS & GRAPHICS-UK, 2022, 107 : 10 - 19
  • [5] Real-time large-scale dense RGB-D SLAM with volumetric fusion
    Whelan, Thomas
    Kaess, Michael
    Johannsson, Hordur
    Fallon, Maurice
    Leonard, John J.
    McDonald, John
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2015, 34 (4-5): : 598 - 626
  • [6] Cross Modal Multiscale Fusion Net for Real-time RGB-D Detection
    Yin, Kejie
    Liu, Sheng
    Liu, Ruyu
    Chen, Yibin
    Shen, Kang
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2386 - 2391
  • [7] Real-Time RGB-D based Template Matching Pedestrian Detection
    Jafari, Omid Hosseini
    Yang, Michael Ying
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 5520 - 5527
  • [8] Real-time depth enhancement by fusion for RGB-D cameras
    Garcia, Frederic
    Aouada, Djamila
    Solignac, Thomas
    Mirbach, Bruno
    Ottersten, Bjoern
    [J]. IET COMPUTER VISION, 2013, 7 (05) : 335 - 345
  • [9] KINECT-BASED REAL-TIME RGB-D IMAGE FUSION METHOD
    Guo, Wei
    Du, Tangwu
    Zhu, Xinyan
    Hu, Tao
    [J]. XXII ISPRS CONGRESS, TECHNICAL COMMISSION III, 2012, 39-B3 : 275 - 279
  • [10] Real-Time Visual Odometry from Dense RGB-D Images
    Steinbruecker, Frank
    Sturm, Juergen
    Cremers, Daniel
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,