DIR-BHRNet: A Lightweight Network for Real-Time Vision-Based Multiperson Pose Estimation on Smartphones

被引:0
|
作者
Lan, Gongjin [1 ]
Wu, Yu [1 ]
Hao, Qi [1 ,2 ]
机构
[1] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen 518055, Peoples R China
[2] Southern Univ Sci & Technol, Res Inst Trustworthy Autonomous Syst, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; human pose estimation (HPE); multiperson pose estimation (MPPE); real time; smartphones;
D O I
10.1109/TII.2024.3421511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human pose estimation (HPE), particularly multiperson pose estimation (MPPE), has been applied in many domains, such as human-machine systems. However, the current MPPE methods generally run on powerful GPU systems and take a lot of computational costs. Real-time MPPE on mobile devices with low-performance computing is a challenging task. In this article, we propose a lightweight neural network, DIR-BHRNet, for real-time MPPE on smartphones. In DIR-BHRNet, we design a novel lightweight convolutional module, dense inverted residual (DIR), to improve accuracy by adding a depthwise convolution and a shortcut connection into the well-known inverted residual, and a novel efficient neural network structure, balanced HRNet (BHRNet), to reduce computational costs by reconfiguring the proper number of convolutional blocks on each branch. We evaluate DIR-BHRNet on the well-known COCO and CrowdPose datasets. The results show that DIR-BHRNet outperforms the state-of-the-art methods in terms of accuracy with a real-time computational cost. Finally, we implement the DIR-BHRNet on the current mainstream Android smartphones, which perform more than 10 FPS. The free-used executable file (Android 10), source code, and a video description of this work are publicly available on the page(1) to facilitate the development of real-time MPPE on smartphones.
引用
收藏
页码:12533 / 12541
页数:9
相关论文
共 50 条
  • [21] A Vision-Based Method for Real-Time Traffic Flow Estimation on Edge Devices
    Tran, Duong Nguyen-Ngoc
    Pham, Long Hoang
    Nguyen, Huy-Hung
    Jeon, Jae Wook
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (08) : 8038 - 8052
  • [22] Lightweight Architecture for Real-Time Hand Pose Estimation with Deep Supervision
    Wu, Yufei
    Ruan, Xiaofei
    Zhang, Yu
    Zhou, Huang
    Du, Shengyu
    Wu, Gang
    [J]. SYMMETRY-BASEL, 2019, 11 (04):
  • [23] Real-Time Pose Estimation of Rats Based on Stereo Vision Embedded in a Robotic Rat
    Guo, Xiaowen
    Jia, Guanglu
    Al-Khulaqui, Mohamed
    Chen, Zhe
    Fukuda, Toshio
    Shi, Qing
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 4690 - 4695
  • [24] SP-YOLO: an end-to-end lightweight network for real-time human pose estimation
    Yuting Zhang
    Zongyan Wang
    Menglong Li
    Pei Gao
    [J]. Signal, Image and Video Processing, 2024, 18 : 863 - 876
  • [25] SP-YOLO: an end-to-end lightweight network for real-time human pose estimation
    Zhang, Yuting
    Wang, Zongyan
    Li, Menglong
    Gao, Pei
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 863 - 876
  • [26] Real-time vision-based relative aircraft navigation
    Georgia Institute of Technology, Atlanta, GA 30332-0150
    不详
    [J]. J. Aerosp. Comput. Inf. Commun., 2007, 4 (707-738):
  • [27] Real-time vision-based detection of waiting pedestrians
    Kehtarnavaz, N
    Rajkotwala, F
    [J]. REAL-TIME IMAGING, 1997, 3 (06) : 433 - 440
  • [28] A Real-Time Monocular Vision-Based Obstacle Detection
    Wang, Szu-Hong
    Li, Xiang-Xuan
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2020, : 695 - 699
  • [29] A real-time vision-based human motion capturing
    Huang, CL
    Shen, BC
    Shih, HC
    [J]. Visual Communications and Image Processing 2005, Pts 1-4, 2005, 5960 : 917 - 928
  • [30] Vision-based real-time traffic accident detection
    Zu Hui
    Xie Yaohua
    Ma Lu
    Fu Jiansheng
    [J]. 2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1035 - 1038