Object 6D pose estimation algorithm based on improved heatmap loss function

被引:3
|
作者
Lin Lin [1 ,2 ]
Wang Yan-jie [1 ]
Sun Hai-chao [1 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
deep learning; pose estimation; loss function; heatmap;
D O I
10.37188/CJLCD.2021-0317
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
In view of the problem of low precision and slow training of heatmap regression network trained by mean square error (MSE) loss function used in traditional heatmap regression, the loss function Heatmap Wing Loss (HWing Loss) for heatmap regression is proposed in this thesis. In terms of different pixel values, the loss function has different loss function values, and the loss function gradient of foreground pixels is larger, which can make the network focus more on the foreground pixels and make the heatmap regression more accurate and faster. In line with the distribution characteristics of the heatmap, the keypoint inference method based on the Gaussian distribution is adopted in this thesis to reduce the quantization error when the heatmap infers the keypoints. By taking the two points as the basis, it constructs a new monocular pose estimation algorithm based on keypoint positioning. According to the experiments, in contrast with the algorithm using MSE Loss, the pose estimation algorithm using HWing Loss has a higher ADD(-S)accuracy rate, which reaches 88. 8% on the LINEMOD dataset. Meanwhile, the performance is better than other recent pose estimation algorithms based on deep learning. The algorithm in this thesis can run at the fastest speed of 25 fps on RTX3080 GPU, in which the high speed and performance can be both embodied.
引用
收藏
页码:913 / 923
页数:11
相关论文
共 18 条
  • [1] Speeded-Up Robust Features (SURF)
    Bay, Herbert
    Ess, Andreas
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) : 346 - 359
  • [2] Ding Nan-nan, 2012, Chinese Journal of Liquid Crystals and Displays, V27, P125, DOI 10.3788/YJYXS20122701.0125
  • [3] Do TT, 2018, Arxiv, DOI [arXiv:1802.10367, 10.48550/ARXIV.1802.10367]
  • [4] EPnP: An Accurate O(n) Solution to the PnP Problem
    Lepetit, Vincent
    Moreno-Noguer, Francesc
    Fua, Pascal
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 81 (02) : 155 - 166
  • [5] Li Y, 2020, INT J COMPUT VISION, V128, P657, DOI [10.1007/s11263-019-01250-9, 10.1007/978-3-030-01231-1_42]
  • [6] Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose Estimation
    Oberweger, Markus
    Rad, Mahdi
    Lepetit, Vincent
    [J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 125 - 141
  • [7] Pix2Pose: Pixel-Wise Coordinate Regression of Objects for 6D Pose Estimation
    Park, Kiru
    Patten, Timothy
    Vincze, Markus
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7667 - 7676
  • [8] PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation
    Peng, Sida
    Liu, Yuan
    Huang, Qixing
    Zhou, Xiaowei
    Bao, Hujun
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4556 - 4565
  • [9] Rublee E, 2011, IEEE I CONF COMP VIS, P2564, DOI 10.1109/ICCV.2011.6126544
  • [10] Deep High-Resolution Representation Learning for Human Pose Estimation
    Sun, Ke
    Xiao, Bin
    Liu, Dong
    Wang, Jingdong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5686 - 5696