Attention-based efficient robot grasp detection network

被引:3
|
作者
Qin, Xiaofei [1 ]
Hu, Wenkai [1 ]
Xiao, Chen [2 ]
He, Changxiang [2 ]
Pei, Songwen [1 ,3 ,4 ]
Zhang, Xuedian [1 ,3 ,4 ,5 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
[2] Univ Shanghai Sci & Technol, Coll Sci, Shanghai 200093, Peoples R China
[3] Shanghai Key Lab Modern Opt Syst, Shanghai 200093, Peoples R China
[4] Minist Educ, Key Lab Biomed Opt Technol & Devices, Shanghai 200093, Peoples R China
[5] Tongji Univ, Shanghai Inst Intelligent Sci & Technol, Shanghai 201210, Peoples R China
关键词
Robot grasp detection; Attention mechanism; Encoder-decoder; Neural network;
D O I
10.1631/FITEE.2200502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To balance the inference speed and detection accuracy of a grasp detection algorithm, which are both important for robot grasping tasks, we propose an encoder-decoder structured pixel-level grasp detection neural network named the attention-based efficient robot grasp detection network (AE-GDN). Three spatial attention modules are introduced in the encoder stages to enhance the detailed information, and three channel attention modules are introduced in the decoder stages to extract more semantic information. Several lightweight and efficient DenseBlocks are used to connect the encoder and decoder paths to improve the feature modeling capability of AE-GDN. A high intersection over union (IoU) value between the predicted grasp rectangle and the ground truth does not necessarily mean a high-quality grasp configuration, but might cause a collision. This is because traditional IoU loss calculation methods treat the center part of the predicted rectangle as having the same importance as the area around the grippers. We design a new IoU loss calculation method based on an hourglass box matching mechanism, which will create good correspondence between high IoUs and high-quality grasp configurations. AEGDN achieves the accuracy of 98.9% and 96.6% on the Cornell and Jacquard datasets, respectively. The inference speed reaches 43.5 frames per second with only about 1.2 x 10(6) parameters. The proposed AE-GDN has also been deployed on a practical robotic arm grasping system and performs grasping well.
引用
收藏
页码:1430 / 1444
页数:15
相关论文
共 50 条
  • [1] Attention-Based Grasp Detection With Monocular Depth Estimation
    Xuan Tan, Phan
    Hoang, Dinh-Cuong
    Nguyen, Anh-Nhat
    Nguyen, Van-Thiep
    Vu, Van-Duc
    Nguyen, Thu-Uyen
    Hoang, Ngoc-Anh
    Phan, Khanh-Toan
    Tran, Duc-Thanh
    Vu, Duy-Quang
    Ngo, Phuc-Quan
    Duong, Quang-Tri
    Ho, Ngoc-Trung
    Tran, Cong-Trinh
    Duong, Van-Hiep
    Mai, Anh-Truong
    IEEE ACCESS, 2024, 12 : 65041 - 65057
  • [2] Attention-based efficient robot grasp detection network基于注意力的高效机器人抓取检测网络
    Xiaofei Qin
    Wenkai Hu
    Chen Xiao
    Changxiang He
    Songwen Pei
    Xuedian Zhang
    Frontiers of Information Technology & Electronic Engineering, 2023, 24 : 1430 - 1444
  • [3] Graph Attention-Based Robotic Policy for Efficient Robot Learning
    Zhang, Fengyi
    Bai, Yang
    PROCEEDINGS OF 2024 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL II, CISC 2024, 2024, 1284 : 449 - 459
  • [4] Attention-based robot control
    Kasderidis, S
    Taylor, JG
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2003, 2774 : 615 - 621
  • [5] Efficient attention-based networks for fire and smoke detection
    Xiao, Bowei
    Yan, Chunman
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
  • [6] An Attention-Based Network for Textured Surface Anomaly Detection
    Liu, Gaokai
    Yang, Ning
    Guo, Lei
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [7] Attention-based Weighted Fusion Network for Object Detection
    Yu, Ruixing
    Wang, Chuyin
    Tang, Yifei
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2024, 68 (06) : 1 - 18
  • [8] Attention-based Deep Learning for Network Intrusion Detection
    Guo, Naiwang
    Tian, Yingjie
    Li, Fan
    Yang, Hongshan
    2020 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING AND ARTIFICIAL INTELLIGENCE, 2020, 11584
  • [9] Attention-based Neural Network for Traffic Sign Detection
    Zhang, Jing
    Hui, Le
    Lu, Jianfeng
    Zhu, Yuhua
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1839 - 1844
  • [10] SGSIN: Simultaneous Grasp and Suction Inference Network via Attention-Based Affordance Learning
    Wang, Wenshuo
    Zhu, Haiyue
    Ang Jr, Marcelo H.
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024,