Single-Stage is Enough: Multi-Person Absolute 3D Pose Estimation

被引:18
|
作者
Jin, Lei [1 ]
Xu, Chenyang [1 ]
Wang, Xiaojuan [1 ]
Xiao, Yabo [1 ]
Guo, Yandong [2 ]
Nie, Xuecheng [3 ]
Zhao, Jian [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] OPPO Res Inst, Hyderabad, Telangana, India
[3] Natl Univ Singapore, Singapore, Singapore
[4] Inst North Elect Equipment, Bengaluru, Karnataka, India
关键词
D O I
10.1109/CVPR52688.2022.01274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The existing multi-person absolute 3D pose estimation methods are mainly based on two-stage paradigm, i.e., top-down or bottom-up, leading to redundant pipelines with high computation cost. We argue that it is more desirable to simplify such two-stage paradigm to a single-stage one to promote both efficiency and performance. To this end, we present an efficient single-stage solution, Decoupled Regression Model (DRM), with three distinct novelties. First, DRM introduces a new decoupled representation for 3D pose, which expresses the 2D pose in image plane and depth information of each 3D human instance via 2D center point (center of visible keypoints) and root point (denoted as pelvis), respectively. Second, to learn better feature representation for the human depth regression, DRM introduces a 2D Pose-guided Depth Query Module (PDQM) to extract the features in 2D pose regression branch, enabling the depth regression branch to perceive the scale information of instances. Third, DRM leverages a Decoupled Absolute Pose Loss (DAPL) to facilitate the absolute root depth and root-relative depth estimation, thus improving the accuracy of absolute 3D pose. Comprehensive experiments on challenging benchmarks including MuPoTS-3D and Panoptic clearly verify the superiority of our framework, which outperforms the state-of-the-art bottom-up absolute 3D pose estimation methods.
引用
收藏
页码:13076 / 13085
页数:10
相关论文
共 50 条
  • [31] Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation
    Liu, Qihao
    Zhang, Yi
    Bai, Song
    Yuille, Alan
    COMPUTER VISION - ECCV 2022, PT V, 2022, 13665 : 497 - 517
  • [32] PandaNet: Anchor-Based Single-Shot Multi-Person 3D Pose Estimation
    Benzine, Abdallah
    Chabot, Florian
    Luvison, Bertrand
    Quoc Cuong Pham
    Achard, Catherine
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6855 - 6864
  • [33] RF-based Multi-view Pose Machine for Multi-Person 3D Pose Estimation
    Xie, Chunyang
    Zhang, Dongheng
    Wu, Zhi
    Yu, Cong
    Hu, Yang
    Sun, Qibin
    Chen, Yan
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2669 - 2674
  • [34] Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo
    Lin, Jiahao
    Lee, Gim Hee
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11881 - 11890
  • [35] VTP: volumetric transformer for multi-view multi-person 3D pose estimation
    Yuxing Chen
    Renshu Gu
    Ouhan Huang
    Gangyong Jia
    Applied Intelligence, 2023, 53 : 26568 - 26579
  • [36] VTP: volumetric transformer for multi-view multi-person 3D pose estimation
    Chen, Yuxing
    Gu, Renshu
    Huang, Ouhan
    Jia, Gangyong
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26568 - 26579
  • [37] RPM 2.0: RF-Based Pose Machines for Multi-Person 3D Pose Estimation
    Xie, Chunyang
    Zhang, Dongheng
    Wu, Zhi
    Yu, Cong
    Hu, Yang
    Chen, Yan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 490 - 503
  • [38] Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views
    Dong, Junting
    Jiang, Wen
    Huang, Qixing
    Bao, Hujun
    Zhou, Xiaowei
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7784 - 7793
  • [39] CRENet: Crowd region enhancement network for multi-person 3D pose estimation
    Li, Zhaokun
    Liu, Qiong
    IMAGE AND VISION COMPUTING, 2024, 151
  • [40] PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation
    Guo, Wen
    Corona, Enric
    Moreno-Noguer, Francesc
    Alameda-Pineda, Xavier
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2795 - 2805