GaitMGL: Multi-Scale Temporal Dimension and Global-Local Feature Fusion for Gait Recognition

被引:5
|
作者
Zhang, Zhipeng [1 ]
Wei, Siwei [2 ,3 ]
Xi, Liya [4 ]
Wang, Chunzhi [1 ]
机构
[1] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China
[2] CCCC Second Highway Consultants Co Ltd, Wuhan 430056, Peoples R China
[3] Wuhan Univ Technol, Sch Comp & Artificial Intelligence, Wuhan 430070, Peoples R China
[4] Wuchang Shouyi Univ, Coll Informat Sci & Engn, Wuhan 430064, Peoples R China
基金
中国国家自然科学基金;
关键词
model-based gait recognition methods; GaitMGL; GLGCN; MTCN;
D O I
10.3390/electronics13020257
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gait recognition has received widespread attention due to its non-intrusive recognition mechanism. Currently, most gait recognition methods use appearance-based recognition methods, and such methods are easily affected by occlusions when facing complex environments, which in turn affects the recognition accuracy. With the maturity of pose estimation techniques, model-based gait recognition methods have received more and more attention due to their robustness in complex environments. However, the current model-based gait recognition methods mainly focus on modeling the global feature information in the spatial dimension, ignoring the importance of local features and their influence on recognition accuracy. Meanwhile, in the temporal dimension, these methods usually use single-scale temporal information extraction, which does not take into account the inconsistency of the motion cycles of the limbs when a human body is walking (e.g., arm swing and leg pace), leading to the loss of some limb temporal information. To solve these problems, we propose a gait recognition network based on a Global-Local Graph Convolutional Network, called GaitMGL. Specifically, we introduce a new spatio-temporal feature extraction module, MGL (Multi-scale Temporal and Global-Local Spatial Extraction Module), which consists of GLGCN (Global-Local Graph Convolutional Network) and MTCN (Multi-scale Temporal Convolutional Network). GLGCN models both global and local features, and extracts global-local motion information. MTCN, on the other hand, takes into account the inconsistency of local limb motion cycles, and facilitates multi-scale temporal convolution to capture the temporal information of limb motion. In short, our GaitMGL solves the problems of loss of local information and loss of temporal information at a single scale that exist in existing model-based gait recognition networks. We evaluated our method on three publicly available datasets, CASIA-B, Gait3D, and GREW, and the experimental results show that our method demonstrates surprising performance and achieves an accuracy of 63.12% in the dataset GREW, exceeding all existing model-based gait recognition networks.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Speech emotion recognition based on multi-dimensional feature extraction and multi-scale feature fusion
    Yu, Lingli
    Xu, Fengjun
    Qu, Yundong
    Zhou, Kaijun
    APPLIED ACOUSTICS, 2024, 216
  • [42] Scene categorization based on local-global feature fusion and multi-scale multi-spatial resolution encoding
    Qin, Jianzhao
    Deng, Fuqin
    Yung, Nelson H. C.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2014, 8 : S145 - S154
  • [43] Multi-scale Temporal Feature Fusion for Time-Limited Order Prediction*
    Wang, Jun
    Zhou, Xiaolei
    Liu, Yaochang
    Zhang, Xinrui
    Wang, Shuai
    WIRELESS SENSOR NETWORKS, CWSN 2022, 2022, 1715 : 132 - 144
  • [44] EEG emotion recognition approach using multi-scale convolution and feature fusion
    Zhang, Yong
    Shan, Qingguo
    Chen, Wenyun
    Liu, Wenzhe
    VISUAL COMPUTER, 2024, : 4157 - 4169
  • [45] Research on Bone Stick Text Recognition Method with Multi-Scale Feature Fusion
    Du, Mengxiu
    Wang, Huiqin
    Liu, Rui
    Wang, Ke
    Wang, Zhan
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [46] A multi-scale feature fusion convolutional neural network for facial expression recognition
    Zhang, Xiufeng
    Fu, Xingkui
    Qi, Guobin
    Zhang, Ning
    EXPERT SYSTEMS, 2024, 41 (04)
  • [47] Recognition of abnormal car door noise based on multi-scale feature fusion
    Wang, Xiaolan
    Song, Yongchao
    Su, Lili
    Wang, Yansong
    Pan, Zuofeng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2023, 237 (06) : 1353 - 1364
  • [48] A deep-shallow and global-local multi-feature fusion network for photometric stereo
    Liu, Yanru
    Ju, Yakun
    Jian, Muwei
    Gao, Feng
    Rao, Yuan
    Hu, Yeqi
    Dong, Junyu
    IMAGE AND VISION COMPUTING, 2022, 118
  • [49] GaitASMS: gait recognition by adaptive structured spatial representation and multi-scale temporal aggregation
    Sun, Yan
    Long, Hu
    Feng, Xueling
    Nixon, Mark
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7057 - 7069
  • [50] Gaitts: indoor gait recognition with multi-scale temporal-spatial information aggregation
    Zhang, Langwen
    Men, Zihan
    Xie, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)