A Real-time skeleton-based fall detection algorithm based on temporal convolutional networks and transformer encoder

被引:0
|
作者
Yu, Xiaoqun [1 ]
Wang, Chenfeng [1 ]
Wu, Wenyu [1 ]
Xiong, Shuping [2 ]
机构
[1] Southeast Univ, Sch Mech Engn, Dept Mech & Ind Design, Nanjing 211189, Peoples R China
[2] Korea Adv Inst Sci & Technol KAIST, Dept Ind & Syst Engn, Daejeon 34141, South Korea
基金
新加坡国家研究基金会;
关键词
Aging; Fall detection; Pose estimation; Temporal convolutional network; Transformer; Edge computing; RECOGNITION;
D O I
10.1016/j.pmcj.2025.102016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the population of older individuals living independently rises, coupled with the heightened risk of falls among this demographic, the need for automatic fall detection systems becomes increasingly urgent to ensure timely medical intervention. Computer vision (CV)-based methodologies have emerged as a preferred approach among researchers due to their contactless and pervasive nature. However, existing CV-based solutions often suffer from either poor robustness or prohibitively high computational requirements, impeding their practical implementation in elderly living environments. To address these challenges, we introduce TCNTE, a real-time skeleton-based fall detection algorithm that combines Temporal Convolutional Network (TCN) with Transformer Encoder (TE). We also successfully mitigate the severe class imbalance issue by implementing weighted focal loss. Cross-validation on multiple publicly available vision-based fall datasets demonstrates TCNTE's superiority over individual models (TCN and TE) and existing state-of-the-art fall detection algorithms, achieving remarkable accuracies (front view of UPFall: 99.58 %; side view of UP-Fall: 98.75 %; Le2i: 97.01 %; GMDCSA-24: 92.99 %) alongside practical viability. Visualizations using t-distributed stochastic neighbor embedding (t-SNE) reveal TCNTE's superior separation margin and cohesive clustering between fall and non-fall classes compared to TCN and TE. Crucially, TCNTE is designed for pervasive deployment in mobile and resource-constrained environments. Integrated with YOLOv8 pose estimation and BoT-SORT human tracking, the algorithm operates on NVIDIA Jetson Orin NX edge device, achieving an average frame rate of 19 fps for single-person and 17 fps for two-person scenarios. With its validated accuracy and impressive real-time performance, TCNTE holds significant promise for practical fall detection applications in older adult care settings.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Skeleton-Based Fall Detection with Multiple Inertial Sensors Using Spatial-Temporal Graph Convolutional Networks
    Yan, Jianjun
    Wang, Xueqiang
    Shi, Jiangtao
    Hu, Shuai
    SENSORS, 2023, 23 (04)
  • [2] A Lightweight Skeleton-Based 3D-CNN for Real-Time Fall Detection and Action Recognition
    Noor, Nadhira
    Park, In Kyu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2171 - 2180
  • [3] Skeleton-based action recognition via spatial and temporal transformer networks
    Plizzari, Chiara
    Cannici, Marco
    Matteucci, Matteo
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 208 (208-209)
  • [4] Temporal segment graph convolutional networks for skeleton-based action recognition
    Ding, Chongyang
    Wen, Shan
    Ding, Wenwen
    Liu, Kai
    Belyaev, Evgeny
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 110
  • [5] Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
    Yan, Sijie
    Xiong, Yuanjun
    Lin, Dahua
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7444 - 7452
  • [6] Real-time Skeleton-Based Indoor Activity Recognition
    Han Yun
    Chung Sheng-Luen
    Yeh Jeng-Sheng
    Chen Qi-Jun
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3965 - 3970
  • [7] Involving Distinguished Temporal Graph Convolutional Networks for Skeleton-Based Temporal Action Segmentation
    Li, Yun-Heng
    Liu, Kai-Yuan
    Liu, Sheng-Lan
    Feng, Lin
    Qiao, Hong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 647 - 660
  • [8] Deep Residual Temporal Convolutional Networks for Skeleton-Based Human Action Recognition
    Khamsehashari, R.
    Gadzicki, K.
    Zetzsche, C.
    COMPUTER VISION SYSTEMS (ICVS 2019), 2019, 11754 : 376 - 385
  • [9] Real-time fall detection algorithm based on pose estimation
    Yu N.-G.
    Bai D.-G.
    Kongzhi yu Juece/Control and Decision, 2020, 35 (11): : 2761 - 2766
  • [10] Real-time fall attitude detection algorithm based on iRMB
    Xie, Xudong
    Xu, Bing
    Chen, Zhifei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)