Optimization algorithm for estimating the human pose by using the morphable model

被引:0
|
作者
Li J. [1 ]
Zhang H. [1 ]
He B. [2 ]
机构
[1] School of Electronic Information and Artificial Intelligence, Shaanxi University of Science & Technology, Xi'an
[2] School of Electrical and Information Engineering, Tongji University, Shanghai
关键词
Morphable model; Motion reconstruction; Neural network; Point cloud; Pose estimation;
D O I
10.19665/j.issn1001-2400.2020.02.004
中图分类号
TN94 [电视];
学科分类号
0810 ; 081001 ;
摘要
An optimization algorithm is proposed utilizing the video data and point cloud data captured by the depth camera to solve the problems such as error-proneness and incoherence of motion sequence caused by the existing human pose estimation algorithms based on the morphable model. For video data, the neural network is first used in extracting the model parameters from each color image frame. Next, the human key-points and contour constraint are considered to optimize the above parameters. Then the coherence between every two consecutive frames is utilized to correct the error of pose estimation, thus making the resulting motion sequence smoother. In addition, the point cloud and the model obtained from the corresponding color image frame are used as the joint input to further improve the estimation accuracy. Finally, the distance between the point cloud and the corresponding point of the model is constrained to be as small as possible to obtain a more reasonable solution. The proposed algorithm and the state-of-the-art algorithms are compared qualitatively and quantitatively on the data set and real video set. Experimental results show that the algorithm can effectively correct the error and incoherence in the single-frame pose estimation results and greatly improve the accuracy when using point cloud data optimization. © 2020, The Editorial Board of Journal of Xidian University. All right reserved.
引用
收藏
页码:23 / 31
页数:8
相关论文
共 19 条
  • [1] Wei S.E., Ramakrishna V., Kanade T., Et al., Convolutional Pose Machines, Proceedings of the 2016 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 4724-4732, (2016)
  • [2] Fang H.S., Xie S., Tai Y.W., Et al., Rmpe: Regional Multi-person Pose Estimation, Proceedings of the 2017 IEEE International Conference on Computer Vision, pp. 2353-2362, (2017)
  • [3] Cao Z., Simon T., Wei S.E., Et al., Realtime Multi-person 2d Pose Estimation Using Part Affinity Fields, Proceedings of the 201730th IEEE Conference on Computer Vision and Pattern Recognition, pp. 1302-1310, (2017)
  • [4] Luo Y., Ren J., Wang Z., Et al., LSTM Pose Machines, Proceedings of the 2018 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 5207-5215, (2018)
  • [5] Loper M., Mahmood N., Romero J., Et al., SMPL: a Skinned Multi-person Linear Model, ACM Transactions on Graphics, 34, 6, (2015)
  • [6] Bogo F., Kanazawa A., Lassner C., Et al., Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image, Lecture Notes in Computer Science: 9909, pp. 561-578, (2016)
  • [7] Lassner C., Romero J., Kiefel M., Et al., Unite the People: Closing the Loop between 3d and 2d Human RepresEntations, Proceedings of the 201730th IEEE Conference on Computer Vision and Pattern Recognition, pp. 4704-4713, (2017)
  • [8] Huang Y., Bogo F., Lassner C., Et al., Towards Accurate Marker-less Human Shape and Pose Estimation over Time, Proceedings of the 2017 International Conference on 3D Vision, pp. 421-430, (2018)
  • [9] Alldieck T., Magnor M., Xu W., Et al., Video Based Reconstruction of 3d People Models, Proceedings of the 2018 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8387-8397, (2018)
  • [10] Omran M., Lassner C., Pons-Moll G., Et al., Neural Body Fitting: Unifying Deep Learning and Model Based Human Pose and Shape Estimation, Proceedings of the 2018 International Conference on 3D Vision, pp. 484-494, (2018)