Learning geometric consistency and discrepancy for category-level 6D object pose estimation from point clouds

被引:1
|
作者
Zou, Lu [1 ]
Huang, Zhangjin [1 ,2 ,3 ]
Gu, Naijie [1 ,2 ]
Wang, Guoping [4 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Peoples R China
[2] Anhui Prov Key Lab Software Comp & Commun, Hefei 230027, Peoples R China
[3] USTC, Deqing Alpha Innovat Res Inst, Huzhou 313299, Peoples R China
[4] Peking Univ, Beijing 100871, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
6D object pose estimation; 3D object detection; Point cloud processing; Shape recovery;
D O I
10.1016/j.patcog.2023.109896
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level 6D object pose estimation aims to predict the position and orientation of unseen object instances, which is a fundamental problem in robotic applications. Previous works mainly focused on exploiting visual cues from RGB images, while depth images received less attention. However, depth images contain rich geometric attributes about the object's shape, which are crucial for inferring the object's pose. This work achieves category-level 6D object pose estimation by performing sufficient geometric learning from depth images represented by point clouds. Specifically, we present a novel geometric consistency and geometric discrepancy learning framework called CD-Pose to resolve the intra-category variation, inter-category similarity, and objects with complex structures. Our network consists of a Pose-Consistent Module and a Pose-Discrepant Module. First, a simple MLP-based Pose-Consistent Module is utilized to extract geometrically consistent pose features of objects from the pre-computed object shape priors for each category. Then, the Pose Discrepant Module, designed as a multi-scale region-guided transformer network, is dedicated to exploring each instance's geometrically discrepant features. Next, the NOCS model of the object is reconstructed according to the integration of consistent and discrepant geometric representations. Finally, 6D object poses are obtained by solving the similarity transformation between the reconstruction and the observed point cloud. Experiments on the benchmark datasets show that our CD-Pose produces superior results to state-of-the-art competitors.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] SD-Pose: Structural Discrepancy Aware Category-Level 6D Object Pose Estimation
    Li, Guowei
    Zhu, Dongchen
    Zhang, Guanghui
    Shi, Wenjun
    Zhang, Tianyu
    Zhang, Xiaolin
    Li, Jiamao
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5674 - 5683
  • [2] An efficient network for category-level 6D object pose estimation
    Sun, Shantong
    Liu, Rongke
    Sun, Shuqiao
    Yang, Xinxin
    Lu, Guangshan
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (07) : 1643 - 1651
  • [3] CatFormer: Category-Level 6D Object Pose Estimation with Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6808 - 6816
  • [4] RANSAC Optimization for Category-level 6D Object Pose Estimation
    Chen, Ying
    Kang, Guixia
    Wang, Yiping
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 50 - 56
  • [5] An efficient network for category-level 6D object pose estimation
    Shantong Sun
    Rongke Liu
    Shuqiao Sun
    Xinxin Yang
    Guangshan Lu
    [J]. Signal, Image and Video Processing, 2021, 15 : 1643 - 1651
  • [6] DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency
    Lin, Jiehong
    Wei, Zewei
    Li, Zhihao
    Xu, Songcen
    Jia, Kui
    Li, Yuanqing
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3540 - 3549
  • [7] Self-Supervised Category-Level 6D Object Pose Estimation With Optical Flow Consistency
    Zaccaria, Michela
    Manhardt, Fabian
    Di, Yan
    Tombari, Federico
    Aleotti, Jacopo
    Giorgini, Mikhail
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05): : 2510 - 2517
  • [8] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
    Wang, He
    Sridhar, Srinath
    Huang, Jingwei
    Valentin, Julien
    Song, Shuran
    Guibas, Leonidas J.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2637 - 2646
  • [9] Adversarial imitation learning-based network for category-level 6D object pose estimation
    Sun, Shantong
    Bao, Xu
    Kaushik, Aryan
    [J]. MACHINE VISION AND APPLICATIONS, 2024, 35 (05)
  • [10] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
    Liu, Jierui
    Cao, Zhiqiang
    Tang, Yingbo
    Liu, Xilong
    Tan, Min
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740