Bi-directional attention based RGB-D fusion for category-level object pose and shape estimation

被引:0
|
作者
Tang, Kaifeng [1 ,2 ,3 ]
Xu, Chi [1 ,2 ,3 ]
Chen, Ming [1 ,2 ,3 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Hubei Key Lab Adv Control & Intelligent Automat Co, Wuhan, Hubei, Peoples R China
[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Object pose estimation; Object shape estimation; Attention; RGB-D image; Robotic vision;
D O I
10.1007/s11042-023-17626-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RGB-D images contain color and geometric information which are complementary for object pose and shape estimation. Normally, dense-fusion scheme is used to fuse the features extracted from the RGB-D channels for pose estimation of instance-level objects. However, for category-level objects, the effectiveness of dense-fusion feature is unfortunately affected by the significant intra-class variations between color and geometry. To address this problem, we propose AttentionFusion, a bi-directional attention-based RGB-D fusion framework for category-level object pose and shape estimation. In this framework, the complex contextual relationship between the color and geometric features is effectively explored by bi-directional cross-attention mechanism on a global scale for feature fusion. Based on the fused feature, 6D pose of the category-level object instance is refined iteratively, and object shape is also estimated precisely. Experimental results show that, the proposed method can achieve state-of-the-art performance for object pose and shape estimation on REAL275 datasets.
引用
收藏
页码:53043 / 53063
页数:21
相关论文
共 50 条
  • [41] Category-Level Object Pose Estimation in Heavily Cluttered Scenes by Generalized Two-Stage Shape Reconstructor
    Tatemichi, Hiroki
    Kawanishi, Yasutomo
    Deguchi, Daisuke
    Ide, Ichiro
    Murase, Hiroshi
    [J]. IEEE ACCESS, 2024, 12 : 33440 - 33448
  • [42] Optimal and Robust Category-Level Perception: Object Pose and Shape Estimation From 2-D and 3-D Semantic Keypoints
    Shi, Jingnan
    Yang, Heng
    Carlone, Luca
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (05) : 4131 - 4151
  • [43] Category-Level 6D Object Pose Recovery in Depth Images
    Sahin, Caner
    Kim, Tae-Kyun
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 665 - 681
  • [44] Simultaneous 3D Object Recognition and Pose Estimation Based on RGB-D Images
    Tsai, Chi-Yi
    Tsai, Shu-Hsiang
    [J]. IEEE ACCESS, 2018, 6 : 28859 - 28869
  • [45] Perception Subsystem for Object Recognition and Pose Estimation in RGB-D Images
    Kornuta, Tomasz
    Laszkowski, Michal
    [J]. CHALLENGES IN AUTOMATION, ROBOTICS AND MEASUREMENT TECHNIQUES, 2016, 440 : 597 - 607
  • [46] Object Pose Estimation Based on RGB-D Sensor for Cooperative Spray Painting Robot
    Wang, Zhe
    Jing, Fengshui
    Fan, Junfeng
    Liu, Zhaoyang
    Tian, Yunong
    Gao, Zishu
    [J]. 2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 311 - 316
  • [47] FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism
    Chen, Wei
    Jia, Xi
    Chang, Hyung Jin
    Duan, Jinming
    Shen, Linlin
    Leonardis, Ales
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1581 - 1590
  • [48] SOCS: Semantically-aware Object Coordinate Space for Category-Level 6D Object Pose Estimation under Large Shape Variations
    Wan, Boyan
    Shi, Yifei
    Xu, Kai
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14019 - 14028
  • [49] UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose Estimation
    Lee, Taeyeop
    Lee, Byeong-Uk
    Shin, Inkyu
    Choe, Jaesung
    Shin, Ukcheol
    Kweon, In So
    Yoon, Kuk-Jin
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14871 - 14880
  • [50] Template based Human Pose and Shape Estimation from a Single RGB-D Image
    Li, Zhongguo
    Heyden, Anders
    Oskarsson, Magnus
    [J]. ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 574 - 581