Automatic Evaluation Method for Functional Movement Screening Based on Multi-Scale Lightweight 3D Convolution and an Encoder-Decoder

被引:0
|
作者
Lin, Xiuchun [1 ]
Liu, Yichao [2 ]
Feng, Chen [3 ]
Chen, Zhide [2 ]
Yang, Xu [4 ]
Cui, Hui [5 ]
机构
[1] Fujian Inst Educ, Fuzhou 350025, Peoples R China
[2] Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350117, Peoples R China
[3] Fuzhou Polytech, Dept Informat Engn, Fuzhou 350003, Peoples R China
[4] Minjiang Univ, Coll Comp & Data Sci, Fuzhou 350108, Peoples R China
[5] Monash Univ, Dept Software Syst & Cybersecur, Melbourne, Vic 3800, Australia
基金
中国国家自然科学基金;
关键词
functional movement screening; human movement feature; 3D convolution; encoder-decoder; automatic evaluation method;
D O I
10.3390/electronics13101813
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Functional Movement Screening (FMS) is a test used to evaluate fundamental movement patterns in the human body and identify functional limitations. However, the challenge of carrying out an automated assessment of FMS is that complex human movements are difficult to model accurately and efficiently. To address this challenge, this paper proposes an automatic evaluation method for FMS based on a multi-scale lightweight 3D convolution encoder-decoder (ML3D-ED) architecture. This method adopts a self-built multi-scale lightweight 3D convolution architecture to extract features from videos. The extracted features are then processed using an encoder-decoder architecture and probabilistic integration technique to effectively predict the final score distribution. This architecture, compared with the traditional Two-Stream Inflated 3D ConvNet (I3D) network, offers a better performance and accuracy in capturing advanced human movement features in temporal and spatial dimensions. Specifically, the ML3D-ED backbone network reduces the number of parameters by 59.5% and the computational cost by 77.7% when compared to I3D. Experiments have shown that ML3D-ED achieves an accuracy of 93.33% on public datasets, demonstrating an improvement of approximately 9% over the best existing method. This outcome demonstrates the effectiveness of and advancements made by the ML3D-ED architecture and probabilistic integration technique in extracting advanced human movement features and evaluating functional movements.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Multi-scale Edge Detection Method Based on Encoder-Decoder
    Tian, An-Lin
    Lei, Wei-Min
    Zhang, Peng
    Zhang, Wei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (07): : 936 - 943
  • [2] Segmentation of pancreatic tumors based on multi-scale convolution and channel attention mechanism in the encoder-decoder scheme
    Du, Yue
    Zuo, Xiaoying
    Liu, Shidong
    Cheng, Dai
    Li, Jie
    Sun, Mingzhu
    Zhao, Xin
    Ding, Hui
    Hu, Yabin
    MEDICAL PHYSICS, 2023, 50 (12) : 7764 - 7778
  • [3] An encoder-decoder network for crowd counting based on multi-scale attention mechanism
    Chuang, Hao-Hsiang
    Chen, Yi-Cheng
    Lin, Chang Hong
    Multimedia Tools and Applications, 2025, 84 (03) : 1187 - 1210
  • [4] Highly efficient encoder-decoder network based on multi-scale edge enhancement and dilated convolution for LDCT image denoising
    Jia, Lina
    He, Xu
    Huang, Aimin
    Jia, Beibei
    Wang, Xinfeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (8-9) : 6081 - 6091
  • [5] A Traffic Surveillance Multi-Scale Vehicle Detection Object Method Base on Encoder-Decoder
    Hong, Feng
    Lu, Chang-Hua
    Liu, Chun
    Liu, Ru-Ru
    Wei, Ju
    IEEE ACCESS, 2020, 8 : 47664 - 47674
  • [6] MULTI-STEP CHORD SEQUENCE PREDICTION BASED ON AGGREGATED MULTI-SCALE ENCODER-DECODER NETWORKS
    Carsault, Tristan
    McLeod, Andrew
    Esling, Philippe
    Nika, Jerome
    Nakamura, Eita
    Yoshii, Kazuyoshi
    2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
  • [7] PointAtrousGraph: Deep Hierarchical Encoder-Decoder with Point Atrous Convolution for Unorganized 3D Points
    Pan, Liang
    Chew, Chee-Meng
    Lee, Gim Hee
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 1113 - 1120
  • [8] PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation
    Zhu, Yiran
    Xu, Xing
    Shen, Fumin
    Ji, Yanli
    Gao, Lianli
    Shen, Heng Tao
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1359 - 1365
  • [9] Automatic 3D Landmark Extraction System Based on an Encoder-Decoder Using Fusion of Vision and LiDAR
    Kwak, Jeonghoon
    Sung, Yunsick
    REMOTE SENSING, 2020, 12 (07)
  • [10] Roadway Crack Segmentation Based on an Encoder-decoder Deep Network with Multi-scale Convolutional Blocks
    Sun, Mengyuan
    Guo, Runhua
    Zhu, Jinhui
    Fan, Wenhui
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 869 - 874