Detecting Object Surface Keypoints From a Single RGB Image via Deep Learning Network for 6-DoF Pose Estimation

被引:3
|
作者
Aing, Lee [1 ]
Lie, Wen-Nung [1 ,2 ,3 ]
机构
[1] Natl Chung Cheng Univ CCU, Dept Elect Engn, Chiayi 62102, Taiwan
[2] Natl Chung Cheng Univ CCU, Ctr Innovat Res Aging Soc CIRAS, Chiayi 62102, Taiwan
[3] Natl Chung Cheng Univ CCU, Adv Inst Mfg High Tech Innovat AIM HI, Chiayi 62102, Taiwan
关键词
Three-dimensional displays; Pose estimation; Training; Shape; Neural networks; Solid modeling; Surface texture; 2D projected keypoints; 3D object keypoints; 6-DoF; deep learning network; PnP algorithm; object pose estimation; surface curvature;
D O I
10.1109/ACCESS.2021.3082406
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating the 6-DoF (Degree of Freedom) object pose from a single RGB image is one of the challenging tasks in the field of computer vision. Before the pose which is defined as the translation and rotation parameters can be derived by the traditional PnP algorithm, 2D image projections of a set of 3D object keypoints must be accurately detected. In this paper, we present techniques for defining 3D object surface keypoints and predicting their corresponding 2D counterparts via deep-learning network architectures. The main technique to designate 3D object keypoints is to employ quadratic fitting scheme for calculating the principal surface curvatures as the weights and then select from all surface points the ones mostly distributive with larger curvatures to describe the object shape as possible. However, the 2D projected keypoints are not directly regressed from the network, but encoded as the unit vector fields pointing to them, so that the voting scheme to recover back those 2D keypoints can be performed. Moreover, an effective loss function with the regularization term is adopted in training ResNet for predicting image projections of object keypoints by focusing on small-scale errors. Experimental results show that our proposed technique outperforms state-of-the-art approaches in both "2D projection" and "3D transformation" metrics.
引用
收藏
页码:77729 / 77741
页数:13
相关论文
共 50 条
  • [41] 6DoF Pose Estimation with Object Cutout based on a Deep Autoencoder
    Liu, Xin
    Zhang, Jichao
    He, Xian
    Song, Xiuqiang
    Qin, Xueying
    ADJUNCT PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT 2019), 2019, : 360 - 365
  • [42] A 3D Keypoints Voting Network for 6DoF Pose Estimation in Indoor Scene
    Liu, Huikai
    Liu, Gaorui
    Zhang, Yue
    Lei, Linjian
    Xie, Hui
    Li, Yan
    Sun, Shengli
    MACHINES, 2021, 9 (10)
  • [43] Direct 6-DoF Pose Estimation from Point-Plane Correspondences
    Khoshelham, Kourosh
    2015 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2015, : 295 - 300
  • [44] Image Based 6-DOF Camera Pose Estimation with Weighted RANSAC 3D
    Wetzel, Johannes
    PATTERN RECOGNITION, GCPR 2013, 2013, 8142 : 249 - 254
  • [45] Low-Quality Rendering-Driven 6D Object Pose Estimation from Single RGB Image
    Zuo, Guoyu
    Zhang, Chengwei
    Liu, Hongxing
    Gong, Daoxiong
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [46] Accurate Shape-based 6-DoF Pose Estimation of Single-colored Objects
    Azad, Pedram
    Asfour, Tamim
    Dillmann, Ruediger
    2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, : 2690 - 2695
  • [47] GPDAN: Grasp Pose Domain Adaptation Network for Sim-to-Real 6-DoF Object Grasping
    Zheng, Liming
    Ma, Wenxuan
    Cai, Yinghao
    Lu, Tao
    Wang, Shuo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 4585 - 4592
  • [48] Real-time 6D pose estimation from a single RGB image
    Zhang, Xin
    Jiang, Zhiguo
    Zhang, Haopeng
    IMAGE AND VISION COMPUTING, 2019, 89 : 1 - 11
  • [49] Accurate estimation of 6-DoF tooth pose in 3D intraoral scans for dental applications using deep learning
    Ding, Wanghui
    Sun, Kaiwei
    Yu, Mengfei
    Lin, Hangzheng
    Feng, Yang
    Li, Jianhua
    Liu, Zuozhu
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (09) : 1240 - 1249
  • [50] Deep-learning-based head pose estimation from a single RGB image and its application to medical CROM measurement
    Ritthipravat, Panrasee
    Chotikkakamthorn, Kittisak
    Lie, Wen-Nung
    Kusakunniran, Worapan
    Tuakta, Pimchanok
    Benjapornlert, Paitoon
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 77009 - 77028