LSDNet: lightweight stochastic depth network for human pose estimation

被引:0
|
作者
Zhang, Hengrui [1 ]
Qi, Yongfeng [1 ]
Chen, Huili [1 ]
Cao, Panpan [1 ]
Liang, Anye [1 ]
Wen, Shengcong [1 ]
机构
[1] Northwest Normal Univ, Lanzhou, Peoples R China
来源
VISUAL COMPUTER | 2025年 / 41卷 / 01期
基金
中国国家自然科学基金;
关键词
Human pose estimation; Bernoulli distribution; Lightweight network; Keypoints detection;
D O I
10.1007/s00371-024-03323-4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Human pose estimation plays a critical role in human-centred vision applications. Its influence extends to various aspects of daily life, from healthcare diagnostics and sports training to augmented reality experiences and gesture-controlled interfaces. While current approaches have achieved impressive accuracy, their high model complexity and slow detection speeds significantly limit their deployment on edge devices with limited computing power, such as mobile phones and IoT devices. In this paper, we introduce a novel lightweight network for 2D human pose estimation, called lightweight stochastic depth network (LSDNet). Our approach is based on the observation that the majority of HRNet's parameters are located in the middle and later stages in the network. We reduce some unnecessary branches to significantly reduce these parameters. This is achieved by leveraging the Bernoulli distribution to randomly remove these redundant branches, which improves the network's efficiency while also increasing its robustness. To further reduce the network's parameter count, we introduce two lightweight blocks with simple yet effective architectures. These blocks achieve significant parameter reduction while maintaining good accuracy. Furthermore, we leverage coordinate attention to effectively fuse features from different branches and scales. This mechanism captures both inter-channel dependencies and spatial context, enabling the network to accurately localize keypoints across the human body. We evaluated the effectiveness of our method on the MPII and COCO datasets, demonstrating superior results on human pose estimation compared to popular lightweight networks. Our code is available at: https://github.com/illusory2333/LSDNet.
引用
收藏
页码:257 / 270
页数:14
相关论文
共 50 条
  • [41] A lightweight convolutional neural network for pose estimation of a planar model
    Ocegueda-Hernandez, Vladimir
    Roman-Godinez, Israel
    Mendizabal-Ruiz, Gerardo
    MACHINE VISION AND APPLICATIONS, 2022, 33 (03)
  • [42] A lightweight convolutional neural network for pose estimation of a planar model
    Vladimir Ocegueda-Hernández
    Israel Román-Godínez
    Gerardo Mendizabal-Ruiz
    Machine Vision and Applications, 2022, 33
  • [43] SESPnet: a lightweight network with attention mechanism for spacecraft pose estimation
    Chen C.
    Jing Z.
    Pan H.
    Dun X.
    Huang J.
    Wu H.
    Cao S.
    Aerospace Systems, 2024, 7 (01) : 1 - 10
  • [44] Efficient Pose Estimation via a Lightweight Single-Branch Pose Distillation Network
    Zhang, Shihao
    Qiang, Baohua
    Yang, Xianyi
    Zhou, Mingliang
    Chen, Ruidong
    Chen, Lirui
    IEEE SENSORS JOURNAL, 2023, 23 (22) : 27709 - 27719
  • [45] UULPN: An ultra-lightweight network for human pose estimation based on unbiased data processing
    Wang, Wenming
    Zhang, Kaixiang
    Ren, Haopan
    Wei, Dejian
    Gao, Yanyan
    Liu, Juncheng
    NEUROCOMPUTING, 2022, 480 : 220 - 233
  • [46] Lightweight Monocular Depth Estimation with an Edge Guided Network
    Dong, Xingshuai
    Garratt, Matthew A.
    Anavatti, Sreenatha G.
    Abbass, Hussein A.
    Dong, Junyu
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 204 - 210
  • [47] A NOVEL LIGHTWEIGHT NETWORK FOR FAST MONOCULAR DEPTH ESTIMATION
    Heydrich, Tim
    Yang, Yimin
    Ma, Xiangyu
    Liu, Yu
    Du, Shan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2260 - 2264
  • [48] Manual Annotations on Depth Maps for Human Pose Estimation
    D'Eusanio, Andrea
    Pini, Stefano
    Borghi, Guido
    Vezzani, Roberto
    Cucchiara, Rita
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 233 - 244
  • [49] Dense depth alignment for human pose and shape estimation
    Karagoz, Batuhan
    Suat, Ozhan
    Uguz, Bedirhan
    Akbas, Emre
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (12) : 8577 - 8584
  • [50] Designing Compact Convolutional Filters for Lightweight Human Pose Estimation
    Niu, Shili
    Ou, Weihua
    Feng, Shihua
    Gou, Jianping
    Long, Fei
    Zhang, Wenchuan
    Zeng, Wu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021