Attention-guided RGB-D Fusion Network for Category-level 6D Object Pose Estimation

被引:3
|
作者
Wang, Hao [1 ]
Li, Weiming [1 ]
Kim, Jiyeon [2 ]
Wang, Qiang [1 ]
机构
[1] Samsung Res Ctr, SAIT China Lab, Beijing, Peoples R China
[2] Samsung Adv Inst Technol SAIT, Suwon, South Korea
关键词
D O I
10.1109/IROS47612.2022.9981242
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work focuses on estimating 6D poses and sizes of category-level objects from a single RGB-D image. How to exploit the complementary RGB and depth features plays an important role in this task yet remains an open question. Due to the large intra-category texture and shape variations, an object instance in test may have different RGB and depth features from those of the object instances in training, which poses challenges to previous RGB-D fusion methods. To deal with such problem, an Attention-guided RGB-D Fusion Network (ARF-Net) is proposed in this work. Our key design is an ARF module that learns to adaptively fuse RGB and depth features with guidance from both structure-aware attention and relation-aware attention. Specifically, the structure-aware attention captures spatial relationship among object parts and the relation-aware attention captures the RGB-to-depth correlations between the appearance and geometric features. Our ARF-Net directly establishes canonical correspondences with a compact decoder based on the multi-modal features from our ARF module. Extensive experiments show that our method can effectively fuse RGB features to various popular point cloud encoders and provide consistent performance improvement. In particular, without reconstructing instance 3D models, our method with its relatively compact architecture outperforms all state-of-the-art models on CAMERA25 and REAL275 benchmarks by a large margin.
引用
收藏
页码:10651 / 10658
页数:8
相关论文
共 50 条
  • [21] Category-Level 6D Object Pose Estimation via Cascaded Relation and Recurrent Reconstruction Networks
    Wang, Jiaze
    Chen, Kai
    Dou, Qi
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4807 - 4814
  • [22] Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image
    Fan, Zhaoxin
    Song, Zhenbo
    Xu, Jian
    Wang, Zhicheng
    Wu, Kejian
    Liu, Hongyan
    He, Jun
    [J]. COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 220 - 236
  • [23] Refined Prior Guided Category-Level 6D Pose Estimation and Its Application on Robotic Grasping
    Sun, Huimin
    Zhang, Yilin
    Sun, Honglin
    Hashimoto, Kenji
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [24] Texture-less object detection and 6D pose estimation in RGB-D images
    Zhang, Haoruo
    Cao, Qixin
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2017, 95 : 64 - 79
  • [25] Holistic and local patch framework for 6D object pose estimation in RGB-D images
    Zhang, Haoruo
    Cao, Qixin
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 180 : 59 - 73
  • [26] 6D Gripper Pose Estimation from RGB-D Image
    Tang, Qirong
    Hu, Xue
    Chu, Zhugang
    Wu, Shun
    [J]. COMPUTER VISION SYSTEMS (ICVS 2019), 2019, 11754 : 120 - 125
  • [27] 6D Object Pose Estimation With Color/Geometry Attention Fusion
    Yuan, Honglin
    Veltkamp, Remco C.
    [J]. 16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 529 - 535
  • [28] Self-Supervised Category-Level 6D Object Pose Estimation with Deep Implicit Shape Representation
    Peng, Wanli
    Yan, Jianhang
    Wen, Hongtao
    Sun, Yi
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2082 - 2090
  • [29] Category-Level Object Pose Estimation with Statistic Attention
    Jiang, Changhong
    Mu, Xiaoqiao
    Zhang, Bingbing
    Liang, Chao
    Xie, Mujun
    [J]. SENSORS, 2024, 24 (16)
  • [30] Learning geometric consistency and discrepancy for category-level 6D object pose estimation from point clouds
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    [J]. PATTERN RECOGNITION, 2024, 145