OP-Align: Object-Level and Part-Level Alignment for Self-supervised Category-Level Articulated Object Pose Estimation

被引：0

作者：

Che, Yuchen ^{[1
]}

Furukawa, Ryo ^{[2
]}

Kanezaki, Asako ^{[1
]}

机构：

[1] Tokyo Inst Technol, Tokyo, Japan

[2] Accenture Japan Ltd, Tokyo, Japan

来源：

COMPUTER VISION - ECCV 2024, PT LXXV | 2025年 / 15133卷

关键词：

6DOF object pose estimation; Dataset creation; Unsupervised learning;

D O I：

10.1007/978-3-031-73226-3_5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Category-level articulated object pose estimation focuses on the pose estimation of unknown articulated objects within known categories. Despite its significance, this task remains challenging due to the varying shapes and poses of objects, expensive dataset annotation costs, and complex real-world environments. In this paper, we propose a novel self-supervised approach that leverages a single-frame point cloud to solve this task. Our model consistently generates reconstruction with a canonical pose and joint state for the entire input object, and it estimates object-level poses that reduce overall pose variance and part-level poses that align each part of the input with its corresponding part of the reconstruction. Experimental results demonstrate that our approach significantly outperforms previous self-supervised methods and is comparable to the state-of-the-art supervised methods. To assess the performance of our model in real-world scenarios, we also introduce a new real-world articulated object benchmark dataset (Code and dataset are released at https://github.com/YC-Che/OP-Align.).

引用

页码：72 / 88

页数：17

共 50 条

[21] An efficient network for category-level 6D object pose estimation
Shantong Sun
Rongke Liu
Shuqiao Sun
Xinxin Yang
Guangshan Lu
Signal, Image and Video Processing, 2021, 15 : 1643 - 1651
[22] GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement
Zheng, Linfang
Tse, Tze Ho Elden
Wang, Chen
Sun, Yinghan
Chen, Hua
Leonardis, Ales
Zhang, Wei
Chang, Hyung Jin
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10693 - 10703
[23] GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence
Wang, Pengyuan
Ikeda, Takuya
Lee, Robert
Nishiwaki, Koichi
COMPUTER VISION - ECCV 2024, PT XXVII, 2025, 15085 : 108 - 126
[24] HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation
Zheng, Linfang
Wang, Chen
Sun, Yinghan
Dasgupta, Esha
Chen, Hua
Leonardis, Ales
Zhang, Wei
Chang, Hyung Jin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17163 - 17173
[25] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
Wang, He
Sridhar, Srinath
Huang, Jingwei
Valentin, Julien
Song, Shuran
Guibas, Leonidas J.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2637 - 2646
[26] Optimal Pose and Shape Estimation for Category-level 3D Object Perception
Shi, Jingnan
Yang, Heng
Carlone, Luca
ROBOTICS: SCIENCE AND SYSTEM XVII, 2021,
[27] UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose Estimation
Lee, Taeyeop
Lee, Byeong-Uk
Shin, Inkyu
Choe, Jaesung
Shin, Ukcheol
Kweon, In So
Yoon, Kuk-Jin
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14871 - 14880
[28] Fully Convolutional Geometric Features for Category-level Object Alignment
Feng, Qiaojun
Atanasov, Nikolay
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 8492 - 8498
[29] TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation
Lee, Taeyeop
Tremblay, Jonathan
Blukis, Valts
Wen, Bowen
Lee, Byeong-Uk
Shin, Inkyu
Birchfield, Stan
Kweon, In So
Yoon, Kuk-Jin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21285 - 21295
[30] Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention
Liu, Jierui
Cao, Zhiqiang
Tang, Yingbo
Liu, Xilong
Tan, Min
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6728 - 6740

← 1 2 3 4 5 →