Object 6D pose estimation algorithm based on improved heatmap loss function

被引：3

作者：

Lin Lin ^{[1
,2
]}

Wang Yan-jie ^{[1
]}

Sun Hai-chao ^{[1
]}

机构：

[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

来源：

CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS | 2022年 / 37卷 / 07期

关键词：

deep learning; pose estimation; loss function; heatmap;

D O I：

10.37188/CJLCD.2021-0317

中图分类号：

O7 [晶体学];

学科分类号：

0702 ; 070205 ; 0703 ; 080501 ;

摘要：

In view of the problem of low precision and slow training of heatmap regression network trained by mean square error (MSE) loss function used in traditional heatmap regression, the loss function Heatmap Wing Loss (HWing Loss) for heatmap regression is proposed in this thesis. In terms of different pixel values, the loss function has different loss function values, and the loss function gradient of foreground pixels is larger, which can make the network focus more on the foreground pixels and make the heatmap regression more accurate and faster. In line with the distribution characteristics of the heatmap, the keypoint inference method based on the Gaussian distribution is adopted in this thesis to reduce the quantization error when the heatmap infers the keypoints. By taking the two points as the basis, it constructs a new monocular pose estimation algorithm based on keypoint positioning. According to the experiments, in contrast with the algorithm using MSE Loss, the pose estimation algorithm using HWing Loss has a higher ADD(-S)accuracy rate, which reaches 88. 8% on the LINEMOD dataset. Meanwhile, the performance is better than other recent pose estimation algorithms based on deep learning. The algorithm in this thesis can run at the fastest speed of 25 fps on RTX3080 GPU, in which the high speed and performance can be both embodied.

引用

页码：913 / 923

页数：11

共 18 条

[1] Speeded-Up Robust Features (SURF)
Bay, Herbert
Ess, Andreas
Tuytelaars, Tinne
Van Gool, Luc
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) : 346 - 359
[2] Ding Nan-nan, 2012, Chinese Journal of Liquid Crystals and Displays, V27, P125, DOI 10.3788/YJYXS20122701.0125
[3] Do TT, 2018, Arxiv, DOI [arXiv:1802.10367, 10.48550/ARXIV.1802.10367]
[4] EPnP: An Accurate O(n) Solution to the PnP Problem
Lepetit, Vincent
Moreno-Noguer, Francesc
Fua, Pascal
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 81 (02) : 155 - 166
[5] Li Y, 2020, INT J COMPUT VISION, V128, P657, DOI [10.1007/s11263-019-01250-9, 10.1007/978-3-030-01231-1_42]
[6] Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose Estimation
Oberweger, Markus
Rad, Mahdi
Lepetit, Vincent
[J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 125 - 141
[7] Pix2Pose: Pixel-Wise Coordinate Regression of Objects for 6D Pose Estimation
Park, Kiru
Patten, Timothy
Vincze, Markus
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7667 - 7676
[8] PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation
Peng, Sida
Liu, Yuan
Huang, Qixing
Zhou, Xiaowei
Bao, Hujun
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4556 - 4565
[9] Rublee E, 2011, IEEE I CONF COMP VIS, P2564, DOI 10.1109/ICCV.2011.6126544
[10] Deep High-Resolution Representation Learning for Human Pose Estimation
Sun, Ke
Xiao, Bin
Liu, Dong
Wang, Jingdong
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5686 - 5696

← 1 2 →