Detecting Object Surface Keypoints From a Single RGB Image via Deep Learning Network for 6-DoF Pose Estimation

被引:3
|
作者
Aing, Lee [1 ]
Lie, Wen-Nung [1 ,2 ,3 ]
机构
[1] Natl Chung Cheng Univ CCU, Dept Elect Engn, Chiayi 62102, Taiwan
[2] Natl Chung Cheng Univ CCU, Ctr Innovat Res Aging Soc CIRAS, Chiayi 62102, Taiwan
[3] Natl Chung Cheng Univ CCU, Adv Inst Mfg High Tech Innovat AIM HI, Chiayi 62102, Taiwan
关键词
Three-dimensional displays; Pose estimation; Training; Shape; Neural networks; Solid modeling; Surface texture; 2D projected keypoints; 3D object keypoints; 6-DoF; deep learning network; PnP algorithm; object pose estimation; surface curvature;
D O I
10.1109/ACCESS.2021.3082406
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating the 6-DoF (Degree of Freedom) object pose from a single RGB image is one of the challenging tasks in the field of computer vision. Before the pose which is defined as the translation and rotation parameters can be derived by the traditional PnP algorithm, 2D image projections of a set of 3D object keypoints must be accurately detected. In this paper, we present techniques for defining 3D object surface keypoints and predicting their corresponding 2D counterparts via deep-learning network architectures. The main technique to designate 3D object keypoints is to employ quadratic fitting scheme for calculating the principal surface curvatures as the weights and then select from all surface points the ones mostly distributive with larger curvatures to describe the object shape as possible. However, the 2D projected keypoints are not directly regressed from the network, but encoded as the unit vector fields pointing to them, so that the voting scheme to recover back those 2D keypoints can be performed. Moreover, an effective loss function with the regularization term is adopted in training ResNet for predicting image projections of object keypoints by focusing on small-scale errors. Experimental results show that our proposed technique outperforms state-of-the-art approaches in both "2D projection" and "3D transformation" metrics.
引用
收藏
页码:77729 / 77741
页数:13
相关论文
共 50 条
  • [21] FingertipCubes: An Inexpensive DIY Wearable for 6-DoF per Fingertip Pose Estimation using a Single RGB Camera
    Gupta, Ojaswi
    Hebbalaguppe, Ramya
    SA'18: SIGGRAPH ASIA 2018 POSTERS, 2018,
  • [22] Uncalibrated stereo vision with deep learning for 6-DOF pose estimation for a robot arm system
    Abdelaal, Mahmoud
    Farag, Ramy M. A.
    Saad, Mohamed S.
    Bahgat, Ahmed
    Emara, Hassan M.
    El-Dessouki, Ayman
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 145
  • [23] Hybrid 6D Object Pose Estimation from the RGB Image
    Staszak, Rafal
    Belter, Dominik
    ICINCO: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL 1, 2019, : 541 - 549
  • [24] 6-DOF Pose Estimation from Single Ultrasound Image Using 3D IP Models
    Zheng, Bo
    Ishikawa, Ryo
    Oishi, Takeshi
    Takamatsu, Jun
    Ikeuchi, Katsushi
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 812 - +
  • [25] 6D Pose Estimation of Transparent Object From Single RGB Image for Robotic Manipulation
    Byambaa, Munkhtulga
    Koutaki, Gou
    Choimaa, Lodoiravsal
    IEEE ACCESS, 2022, 10 : 114897 - 114906
  • [26] Simultaneous Semantic and Collision Learning for 6-DoF Grasp Pose Estimation
    Li, Yiming
    Kong, Tao
    Chu, Ruihang
    Li, Yifeng
    Wang, Peng
    Li, Lei
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3571 - 3578
  • [27] RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization
    Xu, Yan
    Lin, Kwan-Yee
    Zhang, Guofeng
    Wang, Xiaogang
    Li, Hongsheng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14860 - 14870
  • [28] Faster and Finer Pose Estimation for Object Pool in a Single RGB Image
    Aing, Lee
    Lie, Wen-Nung
    Chiang, Jui-Chiu
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [29] End-to-End 6-DoF Object Pose Estimation Through Differentiable Rasterization
    Palazzi, Andrea
    Bergamini, Luca
    Calderara, Simone
    Cucchiara, Rita
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 : 702 - 715
  • [30] Line-Based 6-DoF Object Pose Estimation and Tracking With an Event Camera
    Liu, Zibin
    Guan, Banglei
    Shang, Yang
    Yu, Qifeng
    Kneip, Laurent
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4765 - 4780