Category-Level Object Pose Estimation with Statistic Attention

被引:0
|
作者
Jiang, Changhong [1 ]
Mu, Xiaoqiao [2 ]
Zhang, Bingbing [3 ]
Liang, Chao [4 ]
Xie, Mujun [1 ]
机构
[1] Changchun Univ Technol, Sch Elect & Elect Engn, Changchun 130012, Peoples R China
[2] Changchun Univ Technol, Sch Mech & Elect Engn, Changchun 130012, Peoples R China
[3] Dalian Minzu Univ, Sch Comp Sci & Engn, Dalian 116602, Peoples R China
[4] Changchun Univ Technol, Collage Comp Sci & Engn, Changchun 130012, Peoples R China
关键词
pose estimation; long-range dependencies; higher-order;
D O I
10.3390/s24165347
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Six-dimensional object pose estimation is a fundamental problem in the field of computer vision. Recently, category-level object pose estimation methods based on 3D-GC have made significant breakthroughs due to advancements in 3D-GC. However, current methods often fail to capture long-range dependencies, which are crucial for modeling complex and occluded object shapes. Additionally, discerning detailed differences between different objects is essential. Some existing methods utilize self-attention mechanisms or Transformer encoder-decoder structures to address the lack of long-range dependencies, but they only focus on first-order information of features, failing to explore more complex information and neglecting detailed differences between objects. In this paper, we propose SAPENet, which follows the 3D-GC architecture but replaces the 3D-GC in the encoder part with HS-layer to extract features and incorporates statistical attention to compute higher-order statistical information. Additionally, three sub-modules are designed for pose regression, point cloud reconstruction, and bounding box voting. The pose regression module also integrates statistical attention to leverage higher-order statistical information for modeling geometric relationships and aiding regression. Experiments demonstrate that our method achieves outstanding performance, attaining an mAP of 49.5 on the 5 degrees 2 cm metric, which is 3.4 higher than the baseline model. Our method achieves state-of-the-art (SOTA) performance on the REAL275 dataset.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Category-Level Articulated Object Pose Estimation
    Li, Xiaolong
    Wang, He
    Yi, Li
    Guibas, Leonidas
    Abbott, A. Lynn
    Song, Shuran
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3703 - 3712
  • [2] A Visual Navigation Perspective for Category-Level Object Pose Estimation
    Guo, Jiaxin
    Zhong, Fangxun
    Xiong, Rong
    Liu, Yunhui
    Wang, Yue
    Liao, Yiyi
    [J]. COMPUTER VISION - ECCV 2022, PT VI, 2022, 13666 : 123 - 141
  • [3] iCaps: Iterative Category-Level Object Pose and Shape Estimation
    Deng, Xinke
    Geng, Junyi
    Bretl, Timothy
    Xiang, Yu
    Fox, Dieter
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 1784 - 1791
  • [4] Zero-Shot Category-Level Object Pose Estimation
    Goodwin, Walter
    Vaze, Sagar
    Havoutis, Ioannis
    Posner, Ingmar
    [J]. COMPUTER VISION, ECCV 2022, PT XXXIX, 2022, 13699 : 516 - 532
  • [5] Category-Level Metric Scale Object Shape and Pose Estimation
    Lee, Taeyeop
    Lee, Byeong-Uk
    Kim, Myungchul
    Kweon, I. S.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04): : 8575 - 8582
  • [6] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
    Liu, Jierui
    Cao, Zhiqiang
    Tang, Yingbo
    Liu, Xilong
    Tan, Min
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740
  • [7] Open-Vocabulary Category-Level Object Pose and Size Estimation
    Cai, Junhao
    He, Yisheng
    Yuan, Weihao
    Zhu, Siyu
    Dong, Zilong
    Bo, Liefeng
    Chen, Qifeng
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7661 - 7668
  • [8] TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation
    Zhan, Yue
    Wang, Xin
    Nie, Lang
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9749 - 9762
  • [9] GenPose: Generative Category-level Object Pose Estimation via Diffusion Models
    Zhang, Jiyao
    Wu, Mingdong
    Dong, Hao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation
    Zheng, Linfang
    Wang, Chen
    Sun, Yinghan
    Dasgupta, Esha
    Chen, Hua
    Leonardis, Ales
    Zhang, Wei
    Chang, Hyung Jin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17163 - 17173