Aggregated pyramid gating network for human pose estimation without pre-training

被引：8

作者：

Jiang, Chenru ^{[1
,2
]}

Huang, Kaizhu ^{[3
]}

Zhang, Shufei ^{[4
]}

Wang, Xinheng ^{[2
]}

Xiao, Jimin ^{[2
]}

Goulermas, Yannis ^{[1
]}

机构：

[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 7ZX, England

[2] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215123, Peoples R China

[3] Duke Kunshan Univ, Data Sci Res Ctr, Kunshan, Duke Ave 8, Suzhou 215316, Peoples R China

[4] Shanghai Artificial Intelligence Lab, 37th floor, AI Tower, 701 Yunjin Rd, Shanghai, Peoples R China

来源：

PATTERN RECOGNITION | 2023年 / 138卷

关键词：

Pyramid gating system; Stabilization; Human pose estimation;

D O I：

10.1016/j.patcog.2023.109429

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we propose a comprehensive aggregated residual gating structure, the Pyramid GAting Net-work (PGA-Net) for human pose estimation which can select, distill, and fuse semantic level and natural level information from multiple scales. In comparison, through utilizing multi-scale features, most ex -isting state-of-the-art pose estimation methods are still limited in three aspects. First, multi-scale fea-tures contain massively redundant information, which is unfortunately not distilled by most existing approaches. Second, preferring deeper network structures to extract strong semantic features, the con-ventional methods often ignore original texture information fusion. Third, to attain a good parameter initialization, the current methods heavily rely on pre-training, which is very time-consuming or even unavailable. While better coping with the above problems, our proposed PGA-Net distills high-level se-mantic features and replenishes low-level original information to reinforce module representation capa-bility. Meanwhile, PGA-Net demonstrates notable training stability and superior performance even with-out pre-training. Extensive experiments demonstrate that our method consistently outperforms previous approaches even without pre-training, enabling thus an end-to-end model training from scratch. In COCO benchmark, PGA-Net consistently achieves over 3% improvements than the baseline (without pre-training) under various model configurations.1 (c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：13

共 50 条

[41] Structure guided network for human pose estimation
Chen, Yilei
Xie, Xuemei
Yin, Wenjie
Li, Bo'ao
Li, Fu
APPLIED INTELLIGENCE, 2023, 53 (18) : 21012 - 21026
[42] Multistage attention network for human pose estimation
Zhou, Jingyang
Wen, Guangzhao
Zhang, Yu
Geng, Xin
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
[43] Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis
Dobrzycki, Andrzej D.
Bernardos, Ana M.
Bergesio, Luca
Pomirski, Andrzej
Saez-Trigueros, Daniel
MATHEMATICS, 2024, 12 (01)
[44] Structure guided network for human pose estimation
Yilei Chen
Xuemei Xie
Wenjie Yin
Bo’ao Li
Fu Li
Applied Intelligence, 2023, 53 : 21012 - 21026
[45] Multimodal Pre-Training Based on Graph Attention Network for Document Understanding
Zhang Z.
Ma J.
Du J.
Wang L.
Zhang J.
IEEE Transactions on Multimedia, 2023, 25 : 6743 - 6755
[46] GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training
Qiu, Jiezhong
Chen, Qibin
Dong, Yuxiao
Zhang, Jing
Yang, Hongxia
Ding, Ming
Wang, Kuansan
Tang, Jie
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1150 - 1160
[47] Dictionary Temporal Graph Network via Pre-training Embedding Distillation
Liu, Yipeng
Zheng, Fang
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 336 - 347
[48] Always be Pre-Training: Representation Learning for Network Intrusion Detection with GNNs
Gu, Zhengyao
Lopez, Diego Troy
Alrahis, Lilas
Sinanoglu, Ozgur
2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
[49] Sentiment Evolution in Social Network Based on Joint Pre-training Model
Wang, Xiaocao
Han, Chunjing
Hu, Jingyuan
Zhang, Xiaodan
Lv, Honglei
Huang, Shaoqin
PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 1093 - 1098
[50] Investigating of Disease Name Normalization Using Neural Network and Pre-Training
Lou, Yinxia
Qian, Tao
Li, Fei
Zhou, Junxiang
Ji, Donghong
Cheng, Ming
IEEE ACCESS, 2020, 8 : 85729 - 85739

← 1 2 3 4 5 →