Real-Time Multi-Person Video-Based Pose Estimation

被引:4
|
作者
Yan Fenting [1 ]
Wang Peng [1 ]
Lu Zhigang [1 ]
Ding Zhe [1 ]
Qiao Mengyu [1 ]
机构
[1] Xian Technol Univ, Sch Elect & Informat Engn, Xian 710021, Shaanxi, Peoples R China
关键词
image processing; multi-person pose estimation; spatial transformer network; semantic information; pose distance;
D O I
10.3788/LOP57.021006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For multi-person pose estimation in images and videos, it is necessary to address the inaccurate positioning of the human-bounding box and improve the detection accuracy of hard keypoints. This paper designs a real-time multi-person pose-estimation model based on a top-down framework. First, depth-separable convolution is added to the target-detection algorithm to improve the running speed of the human detector; then, by combining the feature pyramid network with context-semantic information, the online hard-example mining algorithm is used to solve the problem of low detection accuracy at hard keypoints. Finally, combining the spatial-transformation network and pose-similarity calculation, the redundant pose is eliminated and the accuracy of the bounding-box positioning is improved. In this paper, the average detection precision of the proposed model on the 2017MS COCO Test-dev dataset is 14.84% higher than that of the Mask R-CNN model, and 2.43% higher than that of the RMPE model. The frame frequency is 22 frames.s(-1).
引用
收藏
页数:8
相关论文
共 18 条
  • [1] [Anonymous], 2018, ECCV, DOI [DOI 10.1007/978-3-030-01234-249, 10.1007/978-3-030-01234-2_49]
  • [2] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [3] Human Pose Estimation with Iterative Error Feedback
    Carreira, Joao
    Agrawal, Pulkit
    Fragkiadaki, Katerina
    Malik, Jitendra
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4733 - 4742
  • [4] Fan XC, 2015, PROC CVPR IEEE, P1347, DOI 10.1109/CVPR.2015.7298740
  • [5] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [6] Aerial Target Detection Based on Improved Faster R-CNN
    Feng Xiaoyu
    Mei Wei
    Hu Dashuai
    [J]. ACTA OPTICA SINICA, 2018, 38 (06)
  • [7] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [8] Dual-Modal Emotion Recognition Based on Facial Expression and Body Posture in Video Sequences
    Jiang Mingxing
    Hu Min
    Wang Xiaohua
    Ren Fuji
    Wang Haowen
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2018, 55 (07)
  • [9] Stacked Hourglass Networks for Human Pose Estimation
    Newell, Alejandro
    Yang, Kaiyu
    Deng, Jia
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 483 - 499
  • [10] Redmon J, 2018, YOL0V3 INCREMENTAL I