Aggregated pyramid gating network for human pose estimation without pre-training

被引:8
|
作者
Jiang, Chenru [1 ,2 ]
Huang, Kaizhu [3 ]
Zhang, Shufei [4 ]
Wang, Xinheng [2 ]
Xiao, Jimin [2 ]
Goulermas, Yannis [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 7ZX, England
[2] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215123, Peoples R China
[3] Duke Kunshan Univ, Data Sci Res Ctr, Kunshan, Duke Ave 8, Suzhou 215316, Peoples R China
[4] Shanghai Artificial Intelligence Lab, 37th floor, AI Tower, 701 Yunjin Rd, Shanghai, Peoples R China
关键词
Pyramid gating system; Stabilization; Human pose estimation;
D O I
10.1016/j.patcog.2023.109429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a comprehensive aggregated residual gating structure, the Pyramid GAting Net-work (PGA-Net) for human pose estimation which can select, distill, and fuse semantic level and natural level information from multiple scales. In comparison, through utilizing multi-scale features, most ex -isting state-of-the-art pose estimation methods are still limited in three aspects. First, multi-scale fea-tures contain massively redundant information, which is unfortunately not distilled by most existing approaches. Second, preferring deeper network structures to extract strong semantic features, the con-ventional methods often ignore original texture information fusion. Third, to attain a good parameter initialization, the current methods heavily rely on pre-training, which is very time-consuming or even unavailable. While better coping with the above problems, our proposed PGA-Net distills high-level se-mantic features and replenishes low-level original information to reinforce module representation capa-bility. Meanwhile, PGA-Net demonstrates notable training stability and superior performance even with-out pre-training. Extensive experiments demonstrate that our method consistently outperforms previous approaches even without pre-training, enabling thus an end-to-end model training from scratch. In COCO benchmark, PGA-Net consistently achieves over 3% improvements than the baseline (without pre-training) under various model configurations.1 (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] BrainNPT: Pre-Training Transformer Networks for Brain Network Classification
    Hu, Jinlong
    Huang, Yangmin
    Wang, Nan
    Dong, Shoubin
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 2727 - 2736
  • [22] Synthetic pre-training for neural-network interatomic potentials
    Gardner, John L. A.
    Baker, Kathryn T.
    Deringer, Volker L.
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (01):
  • [23] Pre-Training of an Artificial Neural Network for Software Fault Prediction
    Owhadi-Kareshk, Moein
    Sedaghat, Yasser
    Akbarzadeh-T, Mohammad-R
    PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2017, : 223 - 228
  • [24] Pre-training Graph Neural Network for Cross Domain Recommendation
    Wang, Chen
    Liang, Yueqing
    Liu, Zhiwei
    Zhang, Tao
    Yu, Philip S.
    2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 140 - 145
  • [25] Unsupervised Pre-training on Improving the Performance of Neural Network in Regression
    Salida, Pallabi
    Vij, Prateek
    Baruah, Rashmi Dutta
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [26] Self Adversarial Training for Human Pose Estimation
    Chou, Chia-Jung
    Chien, Jui-Ting
    Chen, Hwann-Tzong
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 17 - 30
  • [27] PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization
    Xiao, Wen
    Beltagy, Iz
    Carenini, Giuseppe
    Cohan, Arman
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5245 - 5263
  • [28] An efficient interactive segmentation framework for medical images without pre-training
    Sun, Lei
    Tian, Zhiqiang
    Chen, Zhang
    Luo, Wenrui
    Du, Shaoyi
    MEDICAL PHYSICS, 2023, 50 (04) : 2239 - 2248
  • [29] A Lightweight Network Based on Pyramid Residual Module for Human Pose Estimation (vol 29, pg 668, 2019)
    Gao, Bingkun
    Ma, Ke
    Bi, Hongbo
    Wang, Ling
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2020, 30 (03) : 565 - 565
  • [30] Unsupervised Green Object Tracker (GOT) without Offline Pre-training
    Zhou, Zhiruo
    You, Suya
    Kuo, C.-C. Jay
    APSIPA Transactions on Signal and Information Processing, 2024, 13 (01):