End-to-End Implicit Object Pose Estimation

被引：0

作者：

Cao, Chen ^{[1
]}

Yu, Baocheng ^{[1
]}

Xu, Wenxia ^{[1
]}

Chen, Guojun ^{[1
]}

Ai, Yuming ^{[1
]}

机构：

[1] Wuhan Inst Technol, Sch Comp Sci & Engn, Wuhan 430073, Peoples R China

来源：

SENSORS | 2024年 / 24卷 / 17期

关键词：

deep learning for visual perception; pose estimation; implicit representation; REALITY;

D O I：

10.3390/s24175721

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

To accurately estimate the 6D pose of objects, most methods employ a two-stage algorithm. While such two-stage algorithms achieve high accuracy, they are often slow. Additionally, many approaches utilize encoding-decoding to obtain the 6D pose, with many employing bilinear sampling for decoding. However, bilinear sampling tends to sacrifice the accuracy of precise features. In our research, we propose a novel solution that utilizes implicit representation as a bridge between discrete feature maps and continuous feature maps. We represent the feature map as a coordinate field, where each coordinate pair corresponds to a feature value. These feature values are then used to estimate feature maps of arbitrary scales, replacing upsampling for decoding. We apply the proposed implicit module to a bidirectional fusion feature pyramid network. Based on this implicit module, we propose three network branches: a class estimation branch, a bounding box estimation branch, and the final pose estimation branch. For this pose estimation branch, we propose a miniature dual-stream network, which estimates object surface features and complements the relationship between 2D and 3D. We represent the rotation component using the SVD (Singular Value Decomposition) representation method, resulting in a more accurate object pose. We achieved satisfactory experimental results on the widely used 6D pose estimation benchmark dataset Linemod. This innovative approach provides a more convenient solution for 6D object pose estimation.

引用

页数：14

共 50 条

[1] COPE: End-to-end trainable Constant Runtime Object Pose Estimation
Thalhammer, Stefan
Patten, Timothy
Vincze, Markus
[J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2859 - 2869
[2] FusionNet: An End-to-End Hybrid Model for 6D Object Pose Estimation
Ye, Yuning
Park, Hanhoon
[J]. ELECTRONICS, 2023, 12 (19)
[3] End-to-End 6-DoF Object Pose Estimation Through Differentiable Rasterization
Palazzi, Andrea
Bergamini, Luca
Calderara, Simone
Cucchiara, Rita
[J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 : 702 - 715
[4] AN END-TO-END FRAMEWORK FOR POSE ESTIMATION OF OCCLUDED PEDESTRIANS
Das, Sudip
Kishore, Perla Sai Raj
Bhattacharya, Ujjwal
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1446 - 1450
[5] End-to-End Detection and Pose Estimation of Two Interacting Hands
Kim, Dong Uk
Kim, Kwang In
Baek, Seungryul
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11169 - 11178
[6] End-to-End Multi-Person Pose Estimation with Transformers
Shi, Dahu
Wei, Xing
Li, Liangqi
Ren, Ye
Tan, Wenming
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11059 - 11068
[7] RPNet: An End-to-End Network for Relative Camera Pose Estimation
En, Sovann
Lechervy, Alexis
Jurie, Frederic
[J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 738 - 745
[8] Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation
Liu, Huan
Chen, Qiang
Tan, Zichang
Liu, Jiang-Jiang
Wang, Jian
Su, Xiangbo
Li, Xiaolong
Yao, Kun
Han, Junyu
Ding, Errui
Zhao, Yao
Wang, Jingdong
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14983 - 14992
[9] REDE: End-to-End Object 6D Pose Robust Estimation Using Differentiable Outliers Elimination
Hua, Weitong
Zhou, Zhongxiang
Wu, Jun
Huang, Huang
Wang, Yue
Xiong, Rong
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 2886 - 2893
[10] End-to-end 3D Human Pose Estimation with Transformer
Zhang, Bowei
Cui, Peng
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4529 - 4536

← 1 2 3 4 5 →