Detecting Object Surface Keypoints From a Single RGB Image via Deep Learning Network for 6-DoF Pose Estimation

被引:3
|
作者
Aing, Lee [1 ]
Lie, Wen-Nung [1 ,2 ,3 ]
机构
[1] Natl Chung Cheng Univ CCU, Dept Elect Engn, Chiayi 62102, Taiwan
[2] Natl Chung Cheng Univ CCU, Ctr Innovat Res Aging Soc CIRAS, Chiayi 62102, Taiwan
[3] Natl Chung Cheng Univ CCU, Adv Inst Mfg High Tech Innovat AIM HI, Chiayi 62102, Taiwan
关键词
Three-dimensional displays; Pose estimation; Training; Shape; Neural networks; Solid modeling; Surface texture; 2D projected keypoints; 3D object keypoints; 6-DoF; deep learning network; PnP algorithm; object pose estimation; surface curvature;
D O I
10.1109/ACCESS.2021.3082406
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating the 6-DoF (Degree of Freedom) object pose from a single RGB image is one of the challenging tasks in the field of computer vision. Before the pose which is defined as the translation and rotation parameters can be derived by the traditional PnP algorithm, 2D image projections of a set of 3D object keypoints must be accurately detected. In this paper, we present techniques for defining 3D object surface keypoints and predicting their corresponding 2D counterparts via deep-learning network architectures. The main technique to designate 3D object keypoints is to employ quadratic fitting scheme for calculating the principal surface curvatures as the weights and then select from all surface points the ones mostly distributive with larger curvatures to describe the object shape as possible. However, the 2D projected keypoints are not directly regressed from the network, but encoded as the unit vector fields pointing to them, so that the voting scheme to recover back those 2D keypoints can be performed. Moreover, an effective loss function with the regularization term is adopted in training ResNet for predicting image projections of object keypoints by focusing on small-scale errors. Experimental results show that our proposed technique outperforms state-of-the-art approaches in both "2D projection" and "3D transformation" metrics.
引用
收藏
页码:77729 / 77741
页数:13
相关论文
共 50 条
  • [1] Detecting Object Surface Keypoints from a Single RGB Image via Deep Learning Network for 6DoF Pose Estimation
    Aing, Lee
    Lie, Wen-Nung
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 1673 - 1678
  • [2] B-Pose: Bayesian Deep Network for Camera 6-DoF Pose Estimation From RGB Images
    Rekavandi, Aref Miri
    Boussaid, Farid
    Seghouane, Abd-Krim
    Bennamoun, Mohammed
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6747 - 6754
  • [3] 6-DoF Pose Estimation and CAD Model Retrieval for XR Interface from a Single RGB Image
    Park, Sieun
    Jeong, Wonje
    Park, Soon-Yong
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED VISUAL INTERFACES, AVI 2024, 2024,
  • [4] Object 6-DoF pose estimation using auxiliary learning
    Chen M.
    Gai S.
    Da F.
    Yu J.
    [J]. Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (06): : 901 - 914
  • [5] 6DoF Pose Estimation of Transparent Object from a Single RGB-D Image
    Xu, Chi
    Chen, Jiale
    Yao, Mengyang
    Zhou, Jun
    Zhang, Lijun
    Liu, Yi
    [J]. SENSORS, 2020, 20 (23) : 1 - 19
  • [6] NEMA: 6-DoF Pose Estimation Dataset for Deep Learning
    Roman, Philippe Perez de San
    Desbarats, Pascal
    Domenger, Jean-Philippe
    Buendia, Axel
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 682 - 690
  • [7] Deep object 6-DoF pose estimation using instance segmentation
    Pujolle, Victor
    Hayashi, Eiji
    [J]. PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 241 - 244
  • [8] Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset
    Zheng, Tianyu
    Zhang, Chunyan
    Zhang, Shengwen
    Wang, Yanyan
    [J]. SENSORS, 2023, 23 (24)
  • [9] KVNet: An iterative 3D keypoints voting network for real-time 6-DoF object pose estimation
    Wang, Fei
    Zhang, Xing
    Chen, Tianyue
    Shen, Ze
    Liu, Shangdong
    He, Zhenquan
    [J]. NEUROCOMPUTING, 2023, 530 : 11 - 22
  • [10] RNNPose: 6-DoF Object Pose Estimation via Recurrent Correspondence Field Estimation and Pose Optimization
    Xu, Yan
    Lin, Kwan-Yee
    Zhang, Guofeng
    Wang, Xiaogang
    Li, Hongsheng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (07) : 4669 - 4683