LSDNet: lightweight stochastic depth network for human pose estimation

被引：0

作者：

Zhang, Hengrui ^{[1
]}

Qi, Yongfeng ^{[1
]}

Chen, Huili ^{[1
]}

Cao, Panpan ^{[1
]}

Liang, Anye ^{[1
]}

Wen, Shengcong ^{[1
]}

机构：

[1] Northwest Normal Univ, Lanzhou, Peoples R China

来源：

VISUAL COMPUTER | 2025年 / 41卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Human pose estimation; Bernoulli distribution; Lightweight network; Keypoints detection;

D O I：

10.1007/s00371-024-03323-4

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Human pose estimation plays a critical role in human-centred vision applications. Its influence extends to various aspects of daily life, from healthcare diagnostics and sports training to augmented reality experiences and gesture-controlled interfaces. While current approaches have achieved impressive accuracy, their high model complexity and slow detection speeds significantly limit their deployment on edge devices with limited computing power, such as mobile phones and IoT devices. In this paper, we introduce a novel lightweight network for 2D human pose estimation, called lightweight stochastic depth network (LSDNet). Our approach is based on the observation that the majority of HRNet's parameters are located in the middle and later stages in the network. We reduce some unnecessary branches to significantly reduce these parameters. This is achieved by leveraging the Bernoulli distribution to randomly remove these redundant branches, which improves the network's efficiency while also increasing its robustness. To further reduce the network's parameter count, we introduce two lightweight blocks with simple yet effective architectures. These blocks achieve significant parameter reduction while maintaining good accuracy. Furthermore, we leverage coordinate attention to effectively fuse features from different branches and scales. This mechanism captures both inter-channel dependencies and spatial context, enabling the network to accurately localize keypoints across the human body. We evaluated the effectiveness of our method on the MPII and COCO datasets, demonstrating superior results on human pose estimation compared to popular lightweight networks. Our code is available at: https://github.com/illusory2333/LSDNet.

引用

页码：257 / 270

页数：14

共 50 条

[41] A lightweight convolutional neural network for pose estimation of a planar model
Ocegueda-Hernandez, Vladimir
Roman-Godinez, Israel
Mendizabal-Ruiz, Gerardo
MACHINE VISION AND APPLICATIONS, 2022, 33 (03)
[42] A lightweight convolutional neural network for pose estimation of a planar model
Vladimir Ocegueda-Hernández
Israel Román-Godínez
Gerardo Mendizabal-Ruiz
Machine Vision and Applications, 2022, 33
[43] SESPnet: a lightweight network with attention mechanism for spacecraft pose estimation
Chen C.
Jing Z.
Pan H.
Dun X.
Huang J.
Wu H.
Cao S.
Aerospace Systems, 2024, 7 (01) : 1 - 10
[44] Efficient Pose Estimation via a Lightweight Single-Branch Pose Distillation Network
Zhang, Shihao
Qiang, Baohua
Yang, Xianyi
Zhou, Mingliang
Chen, Ruidong
Chen, Lirui
IEEE SENSORS JOURNAL, 2023, 23 (22) : 27709 - 27719
[45] UULPN: An ultra-lightweight network for human pose estimation based on unbiased data processing
Wang, Wenming
Zhang, Kaixiang
Ren, Haopan
Wei, Dejian
Gao, Yanyan
Liu, Juncheng
NEUROCOMPUTING, 2022, 480 : 220 - 233
[46] Lightweight Monocular Depth Estimation with an Edge Guided Network
Dong, Xingshuai
Garratt, Matthew A.
Anavatti, Sreenatha G.
Abbass, Hussein A.
Dong, Junyu
2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 204 - 210
[47] A NOVEL LIGHTWEIGHT NETWORK FOR FAST MONOCULAR DEPTH ESTIMATION
Heydrich, Tim
Yang, Yimin
Ma, Xiangyu
Liu, Yu
Du, Shan
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2260 - 2264
[48] Manual Annotations on Depth Maps for Human Pose Estimation
D'Eusanio, Andrea
Pini, Stefano
Borghi, Guido
Vezzani, Roberto
Cucchiara, Rita
IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 233 - 244
[49] Dense depth alignment for human pose and shape estimation
Karagoz, Batuhan
Suat, Ozhan
Uguz, Bedirhan
Akbas, Emre
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (12) : 8577 - 8584
[50] Designing Compact Convolutional Filters for Lightweight Human Pose Estimation
Niu, Shili
Ou, Weihua
Feng, Shihua
Gou, Jianping
Long, Fei
Zhang, Wenchuan
Zeng, Wu
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021

← 1 2 3 4 5 →