Overcoming Data Deficiency for Multi-Person Pose Estimation

被引:0
|
作者
Dai, Yan [1 ,2 ]
Wang, Xuanhan [1 ,2 ]
Gao, Lianli [1 ,2 ]
Song, Jingkuan [3 ]
Zheng, Feng [4 ]
Shen, Heng Tao [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Future Media Ctr, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] Univ Elect Sci & Technol China, Sichuan Prov People Hosp, Chengdu 611731, Peoples R China
[4] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Pose estimation; Task analysis; Training; Integrated circuit modeling; Complexity theory; Measurement; Data models; Data deficiency; data generation; fore-background imbalance; multi-person pose estimation; NETWORK;
D O I
10.1109/TNNLS.2023.3244957
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building multi-person pose estimation (MPPE) models that can handle complex foreground and uncommon scenes is an important challenge in computer vision. Aside from designing novel models, strengthening training data is a promising direction but remains largely unexploited for the MPPE task. In this article, we systematically identify the key deficiencies of existing pose datasets that prevent the power of well-designed models from being fully exploited and propose the corresponding solutions. Specifically, we find that the traditional data augmentation techniques are inadequate in addressing the two key deficiencies, imbalanced instance complexity (IC) (evaluated by our new metric IC) and insufficient realistic scenes. To overcome these deficiencies, we propose a model-agnostic full-view data generation (Full-DG) method to enrich the training data from the perspectives of both poses and scenes. By hallucinating images with more balanced pose complexity and richer real-world scenes, Full-DG can help improve pose estimators' robustness and generalizability. In addition, we introduce a plug-and-play adaptive category-aware loss (AC-loss) to alleviate the severe pixel-level imbalance between keypoints and backgrounds (i.e., around 1:600). Full-DG together with AC-loss can be readily applied to both the bottom-up and top-down models to improve their accuracy. Notably, plugging into the representative estimators HigherHRNet and HRNet, our method achieves substantial performance gains of 1.0%-2.9% AP on the COCO benchmark, and 1.0%-5.1% AP on the CrowdPose benchmark.
引用
收藏
页码:10857 / 10868
页数:12
相关论文
共 50 条
  • [1] Pose Knowledge Transfer for multi-person pose estimation
    Buwei Li
    Yi Ji
    Ying Li
    Yunlong Xu
    Chunping Liu
    [J]. Signal, Image and Video Processing, 2022, 16 : 321 - 328
  • [2] Pose Partition Networks for Multi-person Pose Estimation
    Nie, Xuecheng
    Feng, Jiashi
    Xing, Junliang
    Yan, Shuicheng
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 705 - 720
  • [3] Pose Knowledge Transfer for multi-person pose estimation
    Li, Buwei
    Ji, Yi
    Li, Ying
    Xu, Yunlong
    Liu, Chunping
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 321 - 328
  • [4] Monocular multi-person pose estimation: A survey
    dos Reis, Eduardo Souza
    Seewald, Lucas Adams
    Antunes, Rodolfo Stoffel
    Rodrigues, Vinicius Facco
    Righi, Rodrigo da Rosa
    da Costa, Cristiano Andre
    da Silveira Jr, Luiz Gonzaga
    Eskofier, Bjoern
    Maier, Andreas
    Horz, Tim
    Fahrig, Rebecca
    [J]. PATTERN RECOGNITION, 2021, 118
  • [5] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [6] Multi-Person Pose Estimation on Embedded Device
    Ma, Zhipeng
    Tian, Dawei
    Zhang, Ming
    He, Dingxin
    [J]. 2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 57 - 61
  • [7] The Overview of Multi-person Pose Estimation Method
    Li, Bingyi
    Zou, Jiaqi
    Wang, Luyao
    Li, Xiangyuan
    Li, Yue
    Lei, Rongjia
    Sun, Songlin
    [J]. SIGNAL AND INFORMATION PROCESSING, NETWORKING AND COMPUTERS (ICSINC), 2019, 550 : 600 - 607
  • [8] Multi-Domain Pose Network for Multi-Person Pose Estimation and Tracking
    Guo, Hengkai
    Tang, Tang
    Luo, Guozhong
    Chen, Riwei
    Lu, Yongchen
    Wen, Linfu
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 209 - 216
  • [9] Lite Hourglass Network for Multi-person Pose Estimation
    Zhao, Ying
    Luo, Zhiwei
    Quan, Changqin
    Liu, Dianchao
    Wang, Gang
    [J]. MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 226 - 238
  • [10] Integral Knowledge Distillation for Multi-Person Pose Estimation
    Xu, Xixia
    Zou, Qi
    Lin, Xue
    Huang, Yaping
    Tian, Yi
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 436 - 440