Category-Level Object Pose Estimation with Statistic Attention

被引:0
|
作者
Jiang, Changhong [1 ]
Mu, Xiaoqiao [2 ]
Zhang, Bingbing [3 ]
Liang, Chao [4 ]
Xie, Mujun [1 ]
机构
[1] Changchun Univ Technol, Sch Elect & Elect Engn, Changchun 130012, Peoples R China
[2] Changchun Univ Technol, Sch Mech & Elect Engn, Changchun 130012, Peoples R China
[3] Dalian Minzu Univ, Sch Comp Sci & Engn, Dalian 116602, Peoples R China
[4] Changchun Univ Technol, Collage Comp Sci & Engn, Changchun 130012, Peoples R China
关键词
pose estimation; long-range dependencies; higher-order;
D O I
10.3390/s24165347
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Six-dimensional object pose estimation is a fundamental problem in the field of computer vision. Recently, category-level object pose estimation methods based on 3D-GC have made significant breakthroughs due to advancements in 3D-GC. However, current methods often fail to capture long-range dependencies, which are crucial for modeling complex and occluded object shapes. Additionally, discerning detailed differences between different objects is essential. Some existing methods utilize self-attention mechanisms or Transformer encoder-decoder structures to address the lack of long-range dependencies, but they only focus on first-order information of features, failing to explore more complex information and neglecting detailed differences between objects. In this paper, we propose SAPENet, which follows the 3D-GC architecture but replaces the 3D-GC in the encoder part with HS-layer to extract features and incorporates statistical attention to compute higher-order statistical information. Additionally, three sub-modules are designed for pose regression, point cloud reconstruction, and bounding box voting. The pose regression module also integrates statistical attention to leverage higher-order statistical information for modeling geometric relationships and aiding regression. Experiments demonstrate that our method achieves outstanding performance, attaining an mAP of 49.5 on the 5 degrees 2 cm metric, which is 3.4 higher than the baseline model. Our method achieves state-of-the-art (SOTA) performance on the REAL275 dataset.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation
    Lin, Yunzhi
    Tremblay, Jonathan
    Tyree, Stephen
    Vela, Patricio A.
    Birchfield, Stan
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
  • [32] GPT-COPE: A Graph-Guided Point Transformer for Category-Level Object Pose Estimation
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Guoping
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2385 - 2398
  • [33] RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation
    Zhang, Ruida
    Di, Yan
    Lou, Zhiqiang
    Manhardi, Fabian
    Tombari, Federico
    Ji, Xiangyang
    [J]. COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 655 - 672
  • [34] Robotic Grasp Detection Based on Category-Level Object Pose Estimation With Self-Supervised Learning
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    [J]. IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (01) : 625 - 635
  • [35] PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation with Photometrically Challenging Objects
    Wang, Pengyuan
    Jung, HyunJun
    Li, Yitong
    Shen, Siyuan
    Srikanth, Rahul Parthasarathy
    Garattoni, Lorenzo
    Meier, Sven
    Navab, Nassir
    Busam, Benjamin
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21190 - 21199
  • [36] Category-Level 6-D Object Pose Estimation With Shape Deformation for Robotic Grasp Detection
    Yu, Sheng
    Zhai, Di-Hua
    Guan, Yuyin
    Xia, Yuanqing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [37] Toward Real-World Category-Level Articulation Pose Estimation
    Liu, Liu
    Xue, Han
    Xu, Wenqiang
    Fu, Haoyuan
    Lu, Cewu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1072 - 1083
  • [38] Category-Level 6D Object Pose Recovery in Depth Images
    Sahin, Caner
    Kim, Tae-Kyun
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 665 - 681
  • [39] GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting
    Di, Yan
    Zhang, Ruida
    Lou, Zhiqiang
    Manhardt, Fabian
    Ji, Xiangyang
    Navab, Nassir
    Tombari, Federico
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6771 - 6781
  • [40] CATRE: Iterative Point Clouds Alignment for Category-Level Object Pose Refinement
    Liu, Xingyu
    Wang, Gu
    Li, Yi
    Ji, Xiangyang
    [J]. COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 499 - 516