Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference

被引:0
|
作者
Wang, Dongkai [1 ]
Zhang, Shiliang [1 ]
Hua, Gang [2 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Wormpex AI Res, Bellevue, WA USA
基金
北京市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-person pose estimation in crowded scenes is challenging because overlapping and occlusions make it difficult to detect person bounding boxes and infer pose cues from individual keypoints. To address those issues, this paper proposes a direct pose-level inference strategy that is free of bounding box detection and key-point grouping. Instead of inferring individual keypoints, the Pose-level Inference Network (PINet) directly infers the complete pose cues for a person from his/her visible body parts. PINet first applies the Part-based Pose Generation (PPG) to infer multiple coarse poses for each person from his/her body parts. Those coarse poses are refined by the Pose Refinement module through incorporating pose priors, and finally are fused in the Pose Fusion module. PINet relies on discriminative body parts to differentiate overlapped persons, and applies visual body cues to infer the global pose cues. Experiments on several crowded scenes pose estimation benchmarks demonstrate the superiority of PINet. For instance, it achieves 59.8% AP on the OCHuman dataset, outperforming the recent works by a large margin(dagger).
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Dual Graph Networks for Pose Estimation in Crowded Scenes
    Jun Tu
    Gangshan Wu
    Limin Wang
    [J]. International Journal of Computer Vision, 2024, 132 (3) : 633 - 653
  • [2] Dual Graph Networks for Pose Estimation in Crowded Scenes
    Tu, Jun
    Wu, Gangshan
    Wang, Limin
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 633 - 653
  • [3] CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark
    Li, Jiefeng
    Wang, Can
    Zhu, Hao
    Mao, Yihuan
    Fang, Hao-Shu
    Lu, Cewu
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10855 - 10864
  • [4] Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
    Chang, Shuning
    Yuan, Li
    Nie, Xuecheng
    Huang, Ziyuan
    Zhou, Yichen
    Chen, Yupeng
    Feng, Jiashi
    Yan, Shuicheng
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4630 - 4634
  • [5] People detection and articulated pose estimation framework for crowded scenes
    Alyammahi, Sohailah
    Bhaskar, Harish
    Ruta, Dymitr
    Al-Mualla, Mohammed
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 131 : 83 - 104
  • [6] Robust pose estimation for arbitrary objects in complex scenes
    Dörfler, P
    Schnurr, C
    [J]. PATTERN RECOGNITION, 2004, 3175 : 455 - 462
  • [7] Semi-direct Sparse Odometry with Robust and Accurate Pose Estimation for Dynamic Scenes
    Wang, Wufan
    Zhang, Lei
    [J]. COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS, CAD/GRAPHICS 2023, 2024, 14250 : 123 - 137
  • [8] RSGNet: Relation based Skeleton Graph Network for Crowded Scenes Pose Estimation
    Dai, Yan
    Wang, Xuanhan
    Gao, Lianli
    Song, Jingkuan
    Shen, Heng Tao
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1193 - 1200
  • [9] Hybrid Inference Optimization for Robust Pose Graph Estimation
    Segal, Aleksandr V.
    Reid, Ian D.
    [J]. 2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 2675 - 2682
  • [10] Human pose estimation in crowded scenes using Keypoint Likelihood Variance Reduction
    Wei, Longsheng
    Yu, Xuefu
    Liu, Zhiheng
    [J]. DISPLAYS, 2024, 83