SD-Pose: Structural Discrepancy Aware Category-Level 6D Object Pose Estimation

被引:1
|
作者
Li, Guowei [1 ,2 ]
Zhu, Dongchen [1 ,2 ]
Zhang, Guanghui [1 ]
Shi, Wenjun [1 ]
Zhang, Tianyu [1 ,2 ]
Zhang, Xiaolin [1 ,2 ,3 ,4 ,5 ]
Li, Jiamao [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Shanghai Inst Microsyst & Informat Technol, State Key Lab Transducer Technol, Bion Vis Syst Lab, Shanghai 200050, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Xiongan Inst Innovat, Xiongan 071700, Peoples R China
[4] Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China
[5] ShanghaiTech Univ, Shanghai 201210, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/WACV56688.2023.00564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level 6D object pose estimation aims to predict the full pose and size information for previously unseen instances from known categories, which is an essential portion of robot grasping and augmented reality. However, the core challenge of this task still is the enormous shape variation within each category. With regard to the challenge, we propose a novel framework SD-Pose, which utilizes the instance-category structural discrepancy and the potential geometric-semantic association to enhance the exploration of the intra-class shape information. Specifically, an information exchange augmentation (IEA) module is introduced to supplement the instance-category structural information by their structural discrepancy, thus facilitating the enhanced geometric information to contain both the character of instance shape and the commonality of category structure. For complementing the deficiencies of structural information adaptively, a semantic dynamic fusion (SDF) module is further designed to fuse semantic and geometric features. Finally, the proposed SD-Pose framework equipped with the IEA and SDF modules hierarchically supplements instance-category structural information in a stacked manner and achieves state-of-the-art performance on the CAMERA25 and REAL275 datasets.
引用
收藏
页码:5674 / 5683
页数:10
相关论文
共 50 条
  • [1] An efficient network for category-level 6D object pose estimation
    Sun, Shantong
    Liu, Rongke
    Sun, Shuqiao
    Yang, Xinxin
    Lu, Guangshan
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (07) : 1643 - 1651
  • [2] RANSAC Optimization for Category-level 6D Object Pose Estimation
    Chen, Ying
    Kang, Guixia
    Wang, Yiping
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 50 - 56
  • [3] CatFormer: Category-Level 6D Object Pose Estimation with Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6808 - 6816
  • [4] An efficient network for category-level 6D object pose estimation
    Shantong Sun
    Rongke Liu
    Shuqiao Sun
    Xinxin Yang
    Guangshan Lu
    [J]. Signal, Image and Video Processing, 2021, 15 : 1643 - 1651
  • [5] SD-Pose: Semantic Decomposition for Cross-Domain 6D Object Pose Estimation
    Li, Zhigang
    Hu, Yinlin
    Salzmann, Mathieu
    Ji, Xiangyang
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2020 - 2028
  • [6] Learning geometric consistency and discrepancy for category-level 6D object pose estimation from point clouds
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    [J]. PATTERN RECOGNITION, 2024, 145
  • [7] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
    Wang, He
    Sridhar, Srinath
    Huang, Jingwei
    Valentin, Julien
    Song, Shuran
    Guibas, Leonidas J.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2637 - 2646
  • [8] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
    Liu, Jierui
    Cao, Zhiqiang
    Tang, Yingbo
    Liu, Xilong
    Tan, Min
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740
  • [9] Category-Level 6D Object Pose Recovery in Depth Images
    Sahin, Caner
    Kim, Tae-Kyun
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 665 - 681
  • [10] GSNet: Model Reconstruction Network for Category-level 6D Object Pose and Size Estimation
    Liu, Penglei
    Zhang, Qieshi
    Cheng, Jun
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2898 - 2904