Learning Joint Structure for Human Pose Estimation

被引:0
|
作者
Feng, Shenming [1 ]
Hu, Haifeng [1 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Learning joint structure; structural consistency; heatmap and offset estimation; single person pose estimation;
D O I
10.1145/3392302
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, tremendous progress has been achieved on human pose estimation with the development of convolutional neural networks (CNNs). However, current methods still suffer from severe occlusion, back view, and large pose variation due to the lack of consideration of the spatial relationship between different joints, which can provide strong cues for localizing the hidden keypoints. In this work, we design a Structural Pose Network (SPN) to take full advantage of joint structure for human pose estimation under unconstrained environment. Specifically, the proposed model is composed of two subnets: Structure Residual Network (SRN) and Structure Improving Network (SIN). Given an input image, SRN first captures rich joint structure as priors through a multi-branch feature extraction module, following a hourglass network with pyramid residual units to enlarge the receptive field and further obtain structural feature representations. SIN, based on coordinate regression, can optimize the spatial relationship of different joints via the attention mechanism, thus refining the initial prediction from SRN. In addition, we propose a novel structure-consistency constraint, which can maintain the structural consistency between the joints and body parts via estimating whether the joints are located in their corresponding parts. At the same time, an online hard regions mining (OHRM) strategy is introduced to drive the network to pay corresponding attention to different body parts. The experimental results on three challenging datasets show that our method outperforms other state-of-the-art algorithms.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation
    Nie, Xuecheng
    Feng, Jiashi
    Yan, Shuicheng
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 519 - 534
  • [2] Combining Parsing Information With Joint Structure for Human Pose Estimation
    Feng, Shenming
    Li, Xiying
    Hu, Haifeng
    [J]. IEEE ACCESS, 2020, 8 : 123408 - 123418
  • [3] Human Pose Estimation using Deep Structure Guided Learning
    Ai, Baole
    Zhou, Yu
    Yu, Yao
    Du, Sidan
    [J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 1224 - 1231
  • [4] Joint relation based human pose estimation
    Shuang Liang
    Gang Chu
    Chi Xie
    Jiewen Wang
    [J]. The Visual Computer, 2022, 38 : 1369 - 1381
  • [5] Joint relation based human pose estimation
    Liang, Shuang
    Chu, Gang
    Xie, Chi
    Wang, Jiewen
    [J]. VISUAL COMPUTER, 2022, 38 (04): : 1369 - 1381
  • [6] Active Learning for Human Pose Estimation
    Liu, Buyu
    Ferrari, Vittorio
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4373 - 4382
  • [7] Learning to Refine Human Pose Estimation
    Fieraru, Mihai
    Khoreva, Anna
    Pishchulin, Leonid
    Schiele, Bernt
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 318 - 327
  • [8] A deep structure for human pose estimation
    Zhao, Lin
    Gao, Xinbo
    Tao, Dacheng
    Li, Xuelong
    [J]. SIGNAL PROCESSING, 2015, 108 : 36 - 45
  • [9] Joint Human Pose Estimation and Instance Segmentation with PosePlusSeg
    Ahmad, Niaz
    Khan, Jawad
    Kim, Jeremy Yuhyun
    Lee, Youngmoon
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 69 - 76
  • [10] Learning Feature Pyramids for Human Pose Estimation
    Yang, Wei
    Li, Shuang
    Ouyang, Wanli
    Li, Hongsheng
    Wang, Xiaogang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1290 - 1299