OP-Align: Object-Level and Part-Level Alignment for Self-supervised Category-Level Articulated Object Pose Estimation

被引:0
|
作者
Che, Yuchen [1 ]
Furukawa, Ryo [2 ]
Kanezaki, Asako [1 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
[2] Accenture Japan Ltd, Tokyo, Japan
来源
关键词
6DOF object pose estimation; Dataset creation; Unsupervised learning;
D O I
10.1007/978-3-031-73226-3_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Category-level articulated object pose estimation focuses on the pose estimation of unknown articulated objects within known categories. Despite its significance, this task remains challenging due to the varying shapes and poses of objects, expensive dataset annotation costs, and complex real-world environments. In this paper, we propose a novel self-supervised approach that leverages a single-frame point cloud to solve this task. Our model consistently generates reconstruction with a canonical pose and joint state for the entire input object, and it estimates object-level poses that reduce overall pose variance and part-level poses that align each part of the input with its corresponding part of the reconstruction. Experimental results demonstrate that our approach significantly outperforms previous self-supervised methods and is comparable to the state-of-the-art supervised methods. To assess the performance of our model in real-world scenarios, we also introduce a new real-world articulated object benchmark dataset (Code and dataset are released at https://github.com/YC-Che/OP-Align.).
引用
收藏
页码:72 / 88
页数:17
相关论文
共 50 条
  • [21] An efficient network for category-level 6D object pose estimation
    Shantong Sun
    Rongke Liu
    Shuqiao Sun
    Xinxin Yang
    Guangshan Lu
    Signal, Image and Video Processing, 2021, 15 : 1643 - 1651
  • [22] GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement
    Zheng, Linfang
    Tse, Tze Ho Elden
    Wang, Chen
    Sun, Yinghan
    Chen, Hua
    Leonardis, Ales
    Zhang, Wei
    Chang, Hyung Jin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10693 - 10703
  • [23] GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence
    Wang, Pengyuan
    Ikeda, Takuya
    Lee, Robert
    Nishiwaki, Koichi
    COMPUTER VISION - ECCV 2024, PT XXVII, 2025, 15085 : 108 - 126
  • [24] HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation
    Zheng, Linfang
    Wang, Chen
    Sun, Yinghan
    Dasgupta, Esha
    Chen, Hua
    Leonardis, Ales
    Zhang, Wei
    Chang, Hyung Jin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17163 - 17173
  • [25] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
    Wang, He
    Sridhar, Srinath
    Huang, Jingwei
    Valentin, Julien
    Song, Shuran
    Guibas, Leonidas J.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2637 - 2646
  • [26] Optimal Pose and Shape Estimation for Category-level 3D Object Perception
    Shi, Jingnan
    Yang, Heng
    Carlone, Luca
    ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
  • [27] UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose Estimation
    Lee, Taeyeop
    Lee, Byeong-Uk
    Shin, Inkyu
    Choe, Jaesung
    Shin, Ukcheol
    Kweon, In So
    Yoon, Kuk-Jin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14871 - 14880
  • [28] Fully Convolutional Geometric Features for Category-level Object Alignment
    Feng, Qiaojun
    Atanasov, Nikolay
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 8492 - 8498
  • [29] TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation
    Lee, Taeyeop
    Tremblay, Jonathan
    Blukis, Valts
    Wen, Bowen
    Lee, Byeong-Uk
    Shin, Inkyu
    Birchfield, Stan
    Kweon, In So
    Yoon, Kuk-Jin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21285 - 21295
  • [30] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
    Liu, Jierui
    Cao, Zhiqiang
    Tang, Yingbo
    Liu, Xilong
    Tan, Min
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740