Real-time pixel-wise grasp affordance prediction based on multi-scale context information fusion

被引:4
|
作者
Wu, Yongxiang [1 ]
Fu, Yili [1 ]
Wang, Shuguo [1 ]
机构
[1] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin, Peoples R China
关键词
Grasp detection; Multi-scale feature fusion; Pixel-wise prediction; Robotic grasping;
D O I
10.1108/IR-06-2021-0118
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Purpose This paper aims to use fully convolutional network (FCN) to predict pixel-wise antipodal grasp affordances for unknown objects and improve the grasp detection performance through multi-scale feature fusion. Design/methodology/approach A modified FCN network is used as the backbone to extract pixel-wise features from the input image, which are further fused with multi-scale context information gathered by a three-level pyramid pooling module to make more robust predictions. Based on the proposed unify feature embedding framework, two head networks are designed to implement different grasp rotation prediction strategies (regression and classification), and their performances are evaluated and compared with a defined point metric. The regression network is further extended to predict the grasp rectangles for comparisons with previous methods and real-world robotic grasping of unknown objects. Findings The ablation study of the pyramid pooling module shows that the multi-scale information fusion significantly improves the model performance. The regression approach outperforms the classification approach based on same feature embedding framework on two data sets. The regression network achieves a state-of-the-art accuracy (up to 98.9%) and speed (4 ms per image) and high success rate (97% for household objects, 94.4% for adversarial objects and 95.3% for objects in clutter) in the unknown object grasping experiment. Originality/value A novel pixel-wise grasp affordance prediction network based on multi-scale feature fusion is proposed to improve the grasp detection performance. Two prediction approaches are formulated and compared based on the proposed framework. The proposed method achieves excellent performances on three benchmark data sets and real-world robotic grasping experiment.
引用
收藏
页码:368 / 381
页数:14
相关论文
共 50 条
  • [1] Real-Time Pixel-Wise Grasp Detection Based on RGB-D Feature Dense Fusion
    Wu, Yongxiang
    Fu, Yili
    Wang, Shuguo
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2021), 2021, : 970 - 975
  • [2] Real-Time Robotic Grasp Detection with Multi-Scale Feature Fusion
    Ma, Hao
    Yuan, Ding
    Cao, Zhe
    Yin, Jihao
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE-RCAR 2020), 2020, : 140 - 145
  • [3] Efficient multi-stage network with pixel-wise degradation prediction for real-time motion deblurring
    Hao, Zeyu
    Wang, Hang
    Zhang, Xuchong
    Li, Yuhai
    Sun, Hongbin
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 233
  • [4] A new multi-scale CNN with pixel-wise attention for image denoising
    Deng, Jibin
    Hu, Chaohua
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 2733 - 2741
  • [5] A new multi-scale CNN with pixel-wise attention for image denoising
    Jibin Deng
    Chaohua Hu
    [J]. Signal, Image and Video Processing, 2024, 18 : 2733 - 2741
  • [6] A Pixel-Wise Foreign Object Debris Detection Method Based on Multi-Scale Feature Inpainting
    Jing, Ying
    Zheng, Hong
    Zheng, Wentao
    Dong, Kaihan
    [J]. AEROSPACE, 2022, 9 (09)
  • [7] Robust Real-Time Visual Tracking Using Pixel-Wise Posteriors
    Bibby, Charles
    Reid, Ian
    [J]. COMPUTER VISION - ECCV 2008, PT II, PROCEEDINGS, 2008, 5303 : 831 - 844
  • [8] MSCFNet: A Lightweight Network With Multi-Scale Context Fusion for Real-Time Semantic Segmentation
    Gao, Guangwei
    Xu, Guoan
    Yu, Yi
    Xie, Jin
    Yang, Jian
    Yue, Dong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 25489 - 25499
  • [9] DDocE: Deep Document Enhancement with Multi-scale Feature Aggregation and Pixel-Wise Adjustments
    Bogdan, Karina O. M.
    Megeto, Guilherme A. S.
    Leal, Rovilson
    Souza, Gustavo
    Valente, Augusto C.
    Kirsten, Lucas N.
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 229 - 244
  • [10] Pixel-wise real-time advanced calibration method for thermal infrared cameras
    Tremblay, Pierre
    Belhumeur, Louis
    Chamberland, Martin
    Villemaire, Andre
    Dubois, Patrick
    Marcotte, Frederic
    Belzile, Charles
    Farley, Vincent
    Lagueux, Philippe
    [J]. INFRARED IMAGING SYSTEMS: DESIGN, ANALYSIS, MODELING, AND TESTING XXI, 2010, 7662