Pose-native Neural Architecture Search for Multi-person Pose Estimation

被引:9
|
作者
Bao, Qian [1 ]
Liu, Wu [1 ]
Hong, Jun [1 ]
Duan, Lingyu [2 ]
Mei, Tao [1 ]
机构
[1] AI Res JD Com, Beijing, Peoples R China
[2] Peking Univ, Natl Engn Lab Video Technol, Beijing, Peoples R China
关键词
Multi-person pose estimation; Neural architecture search;
D O I
10.1145/3394171.3413842
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-person pose estimation has achieved great progress in recent years, even though, the precise prediction for occluded and invisible hard keypoints remains challenging. Most of the human pose estimation networks are equipped with an image classification-based pose encoder for feature extraction and a handcrafted pose decoder for high-resolution representations. However, the pose encoder might be sub-optimal because of the gap between image classification and pose estimation. The widely used multi-scale feature fusion in pose decoder is still coarse and cannot provide sufficient high-resolution details for hard keypoints. Neural Architecture Search (NAS) has shown great potential in many visual tasks to automatically search efficient networks. In this work, we present the Pose-native Network Architecture Search (PoseNAS) to simultaneously design a better pose encoder and pose decoder for pose estimation. Specifically, we directly search a data-oriented pose encoder with stacked searchable cells, which can provide an optimum feature extractor for the pose specific task. In the pose decoder, we exploit scale-adaptive fusion cells to promote rich information exchange across the multi-scale feature maps. Meanwhile, the pose decoder adopts a Fusion-and-Enhancement manner to progressively boost the high-resolution representations that are non-trivial for the precious prediction of hard keypoints. With the exquisitely designed search space and search strategy, PoseNAS can simultaneously search all modules in an end-to-end manner. PoseNAS achieves state-of-the-art performance on three public datasets, MPII, COCO, and PoseTrack, with small-scale parameters compared with the existing methods. Our best model obtains 76.7% mAP and 75.9% mAP on the COCO validation set and test set with only 33.6M parameters. Code and implementation are available at https://github.com/for-code0216/PoseNAS.
引用
收藏
页码:592 / 600
页数:9
相关论文
共 50 条
  • [1] Pose Knowledge Transfer for multi-person pose estimation
    Buwei Li
    Yi Ji
    Ying Li
    Yunlong Xu
    Chunping Liu
    [J]. Signal, Image and Video Processing, 2022, 16 : 321 - 328
  • [2] Pose Partition Networks for Multi-person Pose Estimation
    Nie, Xuecheng
    Feng, Jiashi
    Xing, Junliang
    Yan, Shuicheng
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 705 - 720
  • [3] Pose Knowledge Transfer for multi-person pose estimation
    Li, Buwei
    Ji, Yi
    Li, Ying
    Xu, Yunlong
    Liu, Chunping
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 321 - 328
  • [4] Adaptive Hypergraph Neural Network for Multi-Person Pose Estimation
    Xu, Xixia
    Zou, Qi
    Lin, Xue
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2955 - 2963
  • [5] Monocular multi-person pose estimation: A survey
    dos Reis, Eduardo Souza
    Seewald, Lucas Adams
    Antunes, Rodolfo Stoffel
    Rodrigues, Vinicius Facco
    Righi, Rodrigo da Rosa
    da Costa, Cristiano Andre
    da Silveira Jr, Luiz Gonzaga
    Eskofier, Bjoern
    Maier, Andreas
    Horz, Tim
    Fahrig, Rebecca
    [J]. PATTERN RECOGNITION, 2021, 118
  • [6] Multi-Domain Pose Network for Multi-Person Pose Estimation and Tracking
    Guo, Hengkai
    Tang, Tang
    Luo, Guozhong
    Chen, Riwei
    Lu, Yongchen
    Wen, Linfu
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 209 - 216
  • [7] RMPE: Regional Multi-Person Pose Estimation
    Fang, Hao-Shu
    Xie, Shuqin
    Tai, Yu-Wing
    Lu, Cewu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2353 - 2362
  • [8] Multi-Person Pose Estimation on Embedded Device
    Ma, Zhipeng
    Tian, Dawei
    Zhang, Ming
    He, Dingxin
    [J]. 2020 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND HUMAN-COMPUTER INTERACTION (ICHCI 2020), 2020, : 57 - 61
  • [9] PoseDet: Fast Multi-Person Pose Estimation Using Pose Embedding
    Tian, Chenyu
    Yu, Ran
    Zhao, Xinyuan
    Xia, Weihao
    Wang, Haoqian
    Yang, Yujiu
    [J]. 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [10] The Overview of Multi-person Pose Estimation Method
    Li, Bingyi
    Zou, Jiaqi
    Wang, Luyao
    Li, Xiangyuan
    Li, Yue
    Lei, Rongjia
    Sun, Songlin
    [J]. SIGNAL AND INFORMATION PROCESSING, NETWORKING AND COMPUTERS (ICSINC), 2019, 550 : 600 - 607