Attention-based efficient robot grasp detection network

被引:3
|
作者
Qin, Xiaofei [1 ]
Hu, Wenkai [1 ]
Xiao, Chen [2 ]
He, Changxiang [2 ]
Pei, Songwen [1 ,3 ,4 ]
Zhang, Xuedian [1 ,3 ,4 ,5 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
[2] Univ Shanghai Sci & Technol, Coll Sci, Shanghai 200093, Peoples R China
[3] Shanghai Key Lab Modern Opt Syst, Shanghai 200093, Peoples R China
[4] Minist Educ, Key Lab Biomed Opt Technol & Devices, Shanghai 200093, Peoples R China
[5] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201210, Peoples R China
关键词
Robot grasp detection; Attention mechanism; Encoder-decoder; Neural network;
D O I
10.1631/FITEE.2200502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To balance the inference speed and detection accuracy of a grasp detection algorithm, which are both important for robot grasping tasks, we propose an encoder-decoder structured pixel-level grasp detection neural network named the attention-based efficient robot grasp detection network (AE-GDN). Three spatial attention modules are introduced in the encoder stages to enhance the detailed information, and three channel attention modules are introduced in the decoder stages to extract more semantic information. Several lightweight and efficient DenseBlocks are used to connect the encoder and decoder paths to improve the feature modeling capability of AE-GDN. A high intersection over union (IoU) value between the predicted grasp rectangle and the ground truth does not necessarily mean a high-quality grasp configuration, but might cause a collision. This is because traditional IoU loss calculation methods treat the center part of the predicted rectangle as having the same importance as the area around the grippers. We design a new IoU loss calculation method based on an hourglass box matching mechanism, which will create good correspondence between high IoUs and high-quality grasp configurations. AEGDN achieves the accuracy of 98.9% and 96.6% on the Cornell and Jacquard datasets, respectively. The inference speed reaches 43.5 frames per second with only about 1.2 x 10(6) parameters. The proposed AE-GDN has also been deployed on a practical robotic arm grasping system and performs grasping well.
引用
收藏
页码:1430 / 1444
页数:15
相关论文
共 50 条
  • [41] EDCoA-net: A generative Grasp Detection Network Based on Coordinate Attention
    Liu, Xianghui
    Wu, Haonan
    Luo, Jiaguo
    Chen, Yijun
    Zhou, Qiao
    Zhong, Xungao
    ACM International Conference Proceeding Series,
  • [42] A Spatial Attention-Based Sensory Network for Fuzzy Controller of Mobile Robot in Dynamic Environments
    Shoji, Masaya
    Oshio, Kohei
    Hong, Chin Wei
    Saputra, Azhar Aulia
    Kubota, Naoyuki
    2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2022,
  • [43] Attention-Based Vandalism Detection in OpenStreetMap
    Tempelmeier, Nicolas
    Demidova, Elena
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 643 - 651
  • [44] EGNet: Efficient Robotic Grasp Detection Network
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (04) : 4058 - 4067
  • [45] Attention-based vanishing point detection
    Stentiford, Fred
    2006 IEEE International Conference on Image Processing, ICIP 2006, Proceedings, 2006, : 417 - 420
  • [46] An Attention-Based Residual Neural Network for Efficient Noise Suppression in Signal Processing
    Lan, Tianwei
    Han, Liguo
    Zeng, Zhaofa
    Zeng, Jingwen
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [47] Residual attention-based tracking-by-detection network with attention-driven data augmentation
    Shi, Zaifeng
    Sun, Cheng
    Cao, Qingjie
    Wang, Zhe
    Fan, Qiangqiang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
  • [48] Multi-domain Network Intrusion Detection Based on Attention-based Bidirectional LSTM
    Wang, Xiaoning
    ITNEC 2023 - IEEE 6th Information Technology, Networking, Electronic and Automation Control Conference, 2023, : 805 - 810
  • [49] Attention-based interaction between human and the robot Chiye
    Nagashima, K
    Yoshiike, T
    Konno, A
    Inaba, M
    Inoue, H
    RO-MAN '97 SENDAI: 6TH IEEE INTERNATIONAL WORKSHOP ON ROBOT AND HUMAN COMMUNICATION, PROCEEDINGS, 1997, : 100 - 105
  • [50] Vertical Attention-Based Siamese ConvLSTM Network for Argo Data Error Detection
    Zhang, Shuyu
    Gao, Fan
    Shi, Zhaoji
    Wu, Chuhong
    Zhang, Zhiyuan
    Li, Yan
    Liao, Xiaomei
    Mu, Lin
    Jia, Sen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15