VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes

被引:4
|
作者
Lu, Yuhao [1 ]
Fan, Yixuan [1 ]
Deng, Beixing [1 ]
Liu, Fangfu [1 ]
Li, Yali [1 ]
Wang, Shengjin [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
关键词
D O I
10.1109/IROS55552.2023.10341379
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robotic grasping faces new challenges in human-robot-interaction scenarios. We consider the task that the robot grasps a target object designated by human's language directives. The robot not only needs to locate a target based on vision-and-language information, but also needs to predict the reasonable grasp pose candidate at various views and postures. In this work, we propose a novel interactive grasp policy, named Visual-Lingual-Grasp (VL-Grasp), to grasp the target specified by human language. First, we build a new challenging visual grounding dataset to provide functional training data for robotic interactive perception in indoor environments. Second, we propose a 6-Dof interactive grasp policy combined with visual grounding and 6-Dof grasp pose detection to extend the universality of interactive grasping. Third, we design a grasp pose filter module to enhance the performance of the policy. Experiments demonstrate the effectiveness and extendibility of the VL-Grasp in real world. The VL-Grasp achieves a success rate of 72.5% in different indoor scenes. The code and dataset is available at https://github.com/luyh20/VL-Grasp.
引用
收藏
页码:976 / 983
页数:8
相关论文
共 13 条
  • [1] Towards Scale Balanced 6-DoF Grasp Detection in Cluttered Scenes
    Ma, Haoxiang
    Huang, Di
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2004 - 2013
  • [2] Contact-GraspNet: Efficient 6-DoF Grasp Generation in Cluttered Scenes
    Sundermeyer, Martin
    Mousavian, Arsalan
    Triebel, Rudolph
    Fox, Dieter
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13438 - 13444
  • [3] Efficient Heatmap-Guided 6-Dof Grasp Detection in Cluttered Scenes
    Chen, Siang
    Tang, Wei
    Xie, Pengwei
    Yang, Wenming
    Wang, Guijin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 4895 - 4902
  • [4] Contact-GraspNet: Efficient 6-DoF Grasp Generation in Cluttered Scenes
    Sundermeyer, Martin
    Mousavian, Arsalan
    Triebel, Rudolph
    Fox, Dieter
    Proceedings - IEEE International Conference on Robotics and Automation, 2021, 2021-May : 3133 - 3139
  • [5] 6-DOF Grasp Detection for Unknown Objects
    Schaub, Henry
    Schoettl, Alfred
    2020 10TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER INFORMATION TECHNOLOGIES (ACIT), 2020, : 400 - 403
  • [6] GraNet: A Multi-Level Graph Network for 6-DoF Grasp Pose Generation in Cluttered Scenes
    Wang, Haowen
    Niu, Wanhao
    Zhuang, Chungang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 937 - 943
  • [7] High Precision 6-DoF Grasp Detection in Cluttered Scenes Based on Network Optimization and Pose Propagation
    Tang, Wenjun
    Tang, Kai
    Zi, Bin
    Qian, Sen
    Zhang, Dan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4407 - 4414
  • [8] Sim-Grasp: Learning 6-DOF Grasp Policies for Cluttered Environments Using a Synthetic Benchmark
    Li, Juncheng
    Cappelleri, David J.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7645 - 7652
  • [9] Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions
    Cheang, Chilam
    Lin, Haitao
    Fu, Yanwei
    Xue, Xiangyang
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8476 - 8482
  • [10] Learning 6-DoF Task-oriented Grasp Detection via Implicit Estimation and Visual Affordance
    Chen, Wenkai
    Liang, Hongzhuo
    Chen, Zhaopeng
    Sun, Fuchun
    Zhang, Jianwei
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 762 - 769