Multi-person pose estimation based on graph grouping optimization

被引:0
|
作者
Zeng, Qingzhi [1 ]
Hu, Yingsong [1 ]
Li, Dan [1 ]
Sun, Dongya [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
关键词
Human pose estimation; Graph construction; Deconvolution module; Grouping; Pose optimization;
D O I
10.1007/s11042-022-13445-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-person pose estimation has been an increasingly popular topic with the advancements of all kinds of computer vision and human-machine interaction tasks. This study field could further enhance the understanding of human poses and activities. The current mainstream multi-person pose estimation methods are generally divided into two categories: top-down and bottom-up methods. Although top-down methods are capable of achieving better performance by simplifying the problem to single-person pose estimation, while this strategy somewhat greatly increases the time complexity as a trade-off for better accuracy. The bottom-up methods could directly locate all the keypoints in the image, which can be potentially more effective and can be made real-time. However, most of the current bottom-up methods have separated the detection and grouping of keypoints into two independent steps. This greatly hindered the overall performance and computation efficiency of the algorithms. To address this issue, our study proposes an end-to-end bottom-up framework for multi-person pose estimation. Using the HRNet as the backbone structure, we add a deconvolution module to acquire high-resolution feature maps in the keypoints proposal stage. The graph neural network is leveraged in the grouping stage, which is integrated to the backbone so that the whole framework can be trained in an end-to-end manner. Using the keypoint candidates as nodes, two discriminators are exploited to supervise the grouping process. Lastly, a graph-based pose optimization algorithm is explored to refine the results. Experiments on the COCO and CrowdPose datasets show that our method achieves better accuracy and greatly reduce the computation time as well.
引用
下载
收藏
页码:7039 / 7053
页数:15
相关论文
共 50 条
  • [41] LAMP: Leveraging Language Prompts for Multi-person Pose Estimation
    Hu, Shengnan
    Zheng, Ce
    Zhou, Zixiang
    Chen, Chen
    Sukthankar, Gita
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 3759 - 3766
  • [42] Joint Multi-Person Pose Estimation and Semantic Part Segmentation
    Xia, Fangting
    Wang, Peng
    Chen, Xianjie
    Yuille, Alan
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6080 - 6089
  • [43] Multi-Person Pose Estimation With Human Detection: A Parallel Approach
    Van-Thanh Hoang
    Jo, Kang-Hyun
    IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 3269 - 3272
  • [44] Adaptive Hypergraph Neural Network for Multi-Person Pose Estimation
    Xu, Xixia
    Zou, Qi
    Lin, Xue
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2955 - 2963
  • [45] End-to-End Multi-Person Pose Estimation with Transformers
    Shi, Dahu
    Wei, Xing
    Li, Liangqi
    Ren, Ye
    Tan, Wenming
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11059 - 11068
  • [46] Enhanced Two-Stage Multi-person Pose Estimation
    Honda, Hiroto
    Kato, Tomohiro
    Uchida, Yusuke
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 217 - 220
  • [47] NRPose: Towards noise resistance for multi-person pose estimation
    He, Jianhang
    Sun, Junyao
    Liu, Qiong
    Peng, Shaowu
    PATTERN RECOGNITION, 2023, 142
  • [48] Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation
    Jin, Lei
    Wang, Xiaojuan
    Nie, Xuecheng
    Wang, Wendong
    Guo, Yandong
    Yan, Shuicheng
    Zhao, Jian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1436 - 1447
  • [49] Contextual Instance Decoupling for Robust Multi-Person Pose Estimation
    Wang, Dongkai
    Zhang, Shiliang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11050 - 11058
  • [50] Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos
    Cheng, Yu
    Wang, Bo
    Yang, Bo
    Tan, Robby T.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1157 - 1165