End-to-End Implicit Object Pose Estimation

被引:0
|
作者
Cao, Chen [1 ]
Yu, Baocheng [1 ]
Xu, Wenxia [1 ]
Chen, Guojun [1 ]
Ai, Yuming [1 ]
机构
[1] Wuhan Inst Technol, Sch Comp Sci & Engn, Wuhan 430073, Peoples R China
关键词
deep learning for visual perception; pose estimation; implicit representation; REALITY;
D O I
10.3390/s24175721
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
To accurately estimate the 6D pose of objects, most methods employ a two-stage algorithm. While such two-stage algorithms achieve high accuracy, they are often slow. Additionally, many approaches utilize encoding-decoding to obtain the 6D pose, with many employing bilinear sampling for decoding. However, bilinear sampling tends to sacrifice the accuracy of precise features. In our research, we propose a novel solution that utilizes implicit representation as a bridge between discrete feature maps and continuous feature maps. We represent the feature map as a coordinate field, where each coordinate pair corresponds to a feature value. These feature values are then used to estimate feature maps of arbitrary scales, replacing upsampling for decoding. We apply the proposed implicit module to a bidirectional fusion feature pyramid network. Based on this implicit module, we propose three network branches: a class estimation branch, a bounding box estimation branch, and the final pose estimation branch. For this pose estimation branch, we propose a miniature dual-stream network, which estimates object surface features and complements the relationship between 2D and 3D. We represent the rotation component using the SVD (Singular Value Decomposition) representation method, resulting in a more accurate object pose. We achieved satisfactory experimental results on the widely used 6D pose estimation benchmark dataset Linemod. This innovative approach provides a more convenient solution for 6D object pose estimation.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] COPE: End-to-end trainable Constant Runtime Object Pose Estimation
    Thalhammer, Stefan
    Patten, Timothy
    Vincze, Markus
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2859 - 2869
  • [2] FusionNet: An End-to-End Hybrid Model for 6D Object Pose Estimation
    Ye, Yuning
    Park, Hanhoon
    [J]. ELECTRONICS, 2023, 12 (19)
  • [3] End-to-End 6-DoF Object Pose Estimation Through Differentiable Rasterization
    Palazzi, Andrea
    Bergamini, Luca
    Calderara, Simone
    Cucchiara, Rita
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 : 702 - 715
  • [4] AN END-TO-END FRAMEWORK FOR POSE ESTIMATION OF OCCLUDED PEDESTRIANS
    Das, Sudip
    Kishore, Perla Sai Raj
    Bhattacharya, Ujjwal
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1446 - 1450
  • [5] End-to-End Detection and Pose Estimation of Two Interacting Hands
    Kim, Dong Uk
    Kim, Kwang In
    Baek, Seungryul
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11169 - 11178
  • [6] End-to-End Multi-Person Pose Estimation with Transformers
    Shi, Dahu
    Wei, Xing
    Li, Liangqi
    Ren, Ye
    Tan, Wenming
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11059 - 11068
  • [7] RPNet: An End-to-End Network for Relative Camera Pose Estimation
    En, Sovann
    Lechervy, Alexis
    Jurie, Frederic
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 738 - 745
  • [8] Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation
    Liu, Huan
    Chen, Qiang
    Tan, Zichang
    Liu, Jiang-Jiang
    Wang, Jian
    Su, Xiangbo
    Li, Xiaolong
    Yao, Kun
    Han, Junyu
    Ding, Errui
    Zhao, Yao
    Wang, Jingdong
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 14983 - 14992
  • [9] REDE: End-to-End Object 6D Pose Robust Estimation Using Differentiable Outliers Elimination
    Hua, Weitong
    Zhou, Zhongxiang
    Wu, Jun
    Huang, Huang
    Wang, Yue
    Xiong, Rong
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 2886 - 2893
  • [10] End-to-end 3D Human Pose Estimation with Transformer
    Zhang, Bowei
    Cui, Peng
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4529 - 4536