Aggregated pyramid gating network for human pose estimation without pre-training

被引:8
|
作者
Jiang, Chenru [1 ,2 ]
Huang, Kaizhu [3 ]
Zhang, Shufei [4 ]
Wang, Xinheng [2 ]
Xiao, Jimin [2 ]
Goulermas, Yannis [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 7ZX, England
[2] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215123, Peoples R China
[3] Duke Kunshan Univ, Data Sci Res Ctr, Kunshan, Duke Ave 8, Suzhou 215316, Peoples R China
[4] Shanghai Artificial Intelligence Lab, 37th floor, AI Tower, 701 Yunjin Rd, Shanghai, Peoples R China
关键词
Pyramid gating system; Stabilization; Human pose estimation;
D O I
10.1016/j.patcog.2023.109429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a comprehensive aggregated residual gating structure, the Pyramid GAting Net-work (PGA-Net) for human pose estimation which can select, distill, and fuse semantic level and natural level information from multiple scales. In comparison, through utilizing multi-scale features, most ex -isting state-of-the-art pose estimation methods are still limited in three aspects. First, multi-scale fea-tures contain massively redundant information, which is unfortunately not distilled by most existing approaches. Second, preferring deeper network structures to extract strong semantic features, the con-ventional methods often ignore original texture information fusion. Third, to attain a good parameter initialization, the current methods heavily rely on pre-training, which is very time-consuming or even unavailable. While better coping with the above problems, our proposed PGA-Net distills high-level se-mantic features and replenishes low-level original information to reinforce module representation capa-bility. Meanwhile, PGA-Net demonstrates notable training stability and superior performance even with-out pre-training. Extensive experiments demonstrate that our method consistently outperforms previous approaches even without pre-training, enabling thus an end-to-end model training from scratch. In COCO benchmark, PGA-Net consistently achieves over 3% improvements than the baseline (without pre-training) under various model configurations.1 (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Structure guided network for human pose estimation
    Chen, Yilei
    Xie, Xuemei
    Yin, Wenjie
    Li, Bo'ao
    Li, Fu
    APPLIED INTELLIGENCE, 2023, 53 (18) : 21012 - 21026
  • [42] Multistage attention network for human pose estimation
    Zhou, Jingyang
    Wen, Guangzhao
    Zhang, Yu
    Geng, Xin
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [43] Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis
    Dobrzycki, Andrzej D.
    Bernardos, Ana M.
    Bergesio, Luca
    Pomirski, Andrzej
    Saez-Trigueros, Daniel
    MATHEMATICS, 2024, 12 (01)
  • [44] Structure guided network for human pose estimation
    Yilei Chen
    Xuemei Xie
    Wenjie Yin
    Bo’ao Li
    Fu Li
    Applied Intelligence, 2023, 53 : 21012 - 21026
  • [45] Multimodal Pre-Training Based on Graph Attention Network for Document Understanding
    Zhang Z.
    Ma J.
    Du J.
    Wang L.
    Zhang J.
    IEEE Transactions on Multimedia, 2023, 25 : 6743 - 6755
  • [46] GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training
    Qiu, Jiezhong
    Chen, Qibin
    Dong, Yuxiao
    Zhang, Jing
    Yang, Hongxia
    Ding, Ming
    Wang, Kuansan
    Tang, Jie
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1150 - 1160
  • [47] Dictionary Temporal Graph Network via Pre-training Embedding Distillation
    Liu, Yipeng
    Zheng, Fang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 336 - 347
  • [48] Always be Pre-Training: Representation Learning for Network Intrusion Detection with GNNs
    Gu, Zhengyao
    Lopez, Diego Troy
    Alrahis, Lilas
    Sinanoglu, Ozgur
    2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
  • [49] Sentiment Evolution in Social Network Based on Joint Pre-training Model
    Wang, Xiaocao
    Han, Chunjing
    Hu, Jingyuan
    Zhang, Xiaodan
    Lv, Honglei
    Huang, Shaoqin
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 1093 - 1098
  • [50] Investigating of Disease Name Normalization Using Neural Network and Pre-Training
    Lou, Yinxia
    Qian, Tao
    Li, Fei
    Zhou, Junxiang
    Ji, Donghong
    Cheng, Ming
    IEEE ACCESS, 2020, 8 : 85729 - 85739