CCAN: Constraint Co-Attention Network for Instance Grasping

被引:0
|
作者
Cai, Junhao [1 ]
Tao, Xuefeng [1 ]
Cheng, Hui [1 ]
Zhang, Zhanpeng [2 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
[2] Sensetime Grp Ltd, Shenzhen, Peoples R China
关键词
D O I
10.1109/icra40945.2020.9197182
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Instance grasping is a challenging robotic grasping task when a robot aims to grasp a specified target object in cluttered scenes. In this paper, we propose a novel end-to-end instance grasping method using only monocular workspace and query images, where the workspace image includes several objects and the query image only contains the target object. To effectively extract discriminative features and facilitate the training process, a learning-based method, referred to as Constraint Co-Attention Network (CCAN), is proposed which consists of a constraint co-attention module and a grasp affordance predictor. An effective co-attention module is presented to construct the features of a workspace image from the extracted features of the query image. By introducing soft constraints into the co-attention module, it highlights the target object's features while trivializes other objects' features in the workspace image. Using the features extracted from the co-attention module, the cascaded grasp affordance interpreter network only predicts the grasp configuration for the target object. The training of the CCAN is totally based on simulated self-supervision. Extensive qualitative and quantitative experiments show the effectiveness of our method both in simulated and real-world environments even for totally unseen objects.
引用
收藏
页码:8353 / 8359
页数:7
相关论文
共 50 条
  • [21] Deep Co-Attention Network for Multi-View Subspace Learning
    Zheng, Lecheng
    Cheng, Yu
    Yang, Hongxia
    Cao, Nan
    He, Jingrui
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 1528 - 1539
  • [22] Co-attention dictionary network for weakly-supervised semantic segmentation
    Wan, Weitao
    Chen, Jiansheng
    Yang, Ming-Hsuan
    Ma, Huimin
    NEUROCOMPUTING, 2022, 486 : 272 - 285
  • [23] Deep Modular Co-Attention Shifting Network for Multimodal Sentiment Analysis
    Shi, Piao
    Hu, Min
    Shi, Xuefeng
    Ren, Fuji
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
  • [24] Co-Attention Memory Network for Multimodal Microblog's Hashtag Recommendation
    Ma, Renfeng
    Qiu, Xipeng
    Zhang, Qi
    Hu, Xiangkun
    Jiang, Yu-Gang
    Huang, Xuanjing
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (02) : 388 - 400
  • [25] CANet: Co-attention network for RGB-D semantic segmentation
    Zhou, Hao
    Qi, Lu
    Huang, Hai
    Yang, Xu
    Wan, Zhaoliang
    Wen, Xianglong
    PATTERN RECOGNITION, 2022, 124
  • [26] Hierarchical Co-Attention Selection Network for Interpretable Fake News Detection
    Ge, Xiaoyi
    Hao, Shuai
    Li, Yuxiao
    Wei, Bin
    Zhang, Mingshu
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (03)
  • [27] Pyramid Co-Attention Compare Network for Few-Shot Segmentation
    Zhang, Defu
    Luo, Ronghua
    Chen, Xuebin
    Chen, Lingwei
    IEEE ACCESS, 2021, 9 : 137249 - 137259
  • [28] Spatiotemporal-Textual Co-Attention Network for Video Question Answering
    Zha, Zheng-Jun
    Liu, Jiawei
    Yang, Tianhao
    Zhang, Yongdong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (02)
  • [29] Context-aware Co-Attention Neural Network for Service Recommendations
    Li, Lei
    Dong, Ruihai
    Chen, Li
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 201 - 208
  • [30] Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation
    Feng, Guang
    Hu, Zhiwei
    Zhang, Lihe
    Lu, Huchuan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15501 - 15510