GaitMGL: Multi-Scale Temporal Dimension and Global-Local Feature Fusion for Gait Recognition

被引:5
|
作者
Zhang, Zhipeng [1 ]
Wei, Siwei [2 ,3 ]
Xi, Liya [4 ]
Wang, Chunzhi [1 ]
机构
[1] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China
[2] CCCC Second Highway Consultants Co Ltd, Wuhan 430056, Peoples R China
[3] Wuhan Univ Technol, Sch Comp & Artificial Intelligence, Wuhan 430070, Peoples R China
[4] Wuchang Shouyi Univ, Coll Informat Sci & Engn, Wuhan 430064, Peoples R China
基金
中国国家自然科学基金;
关键词
model-based gait recognition methods; GaitMGL; GLGCN; MTCN;
D O I
10.3390/electronics13020257
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gait recognition has received widespread attention due to its non-intrusive recognition mechanism. Currently, most gait recognition methods use appearance-based recognition methods, and such methods are easily affected by occlusions when facing complex environments, which in turn affects the recognition accuracy. With the maturity of pose estimation techniques, model-based gait recognition methods have received more and more attention due to their robustness in complex environments. However, the current model-based gait recognition methods mainly focus on modeling the global feature information in the spatial dimension, ignoring the importance of local features and their influence on recognition accuracy. Meanwhile, in the temporal dimension, these methods usually use single-scale temporal information extraction, which does not take into account the inconsistency of the motion cycles of the limbs when a human body is walking (e.g., arm swing and leg pace), leading to the loss of some limb temporal information. To solve these problems, we propose a gait recognition network based on a Global-Local Graph Convolutional Network, called GaitMGL. Specifically, we introduce a new spatio-temporal feature extraction module, MGL (Multi-scale Temporal and Global-Local Spatial Extraction Module), which consists of GLGCN (Global-Local Graph Convolutional Network) and MTCN (Multi-scale Temporal Convolutional Network). GLGCN models both global and local features, and extracts global-local motion information. MTCN, on the other hand, takes into account the inconsistency of local limb motion cycles, and facilitates multi-scale temporal convolution to capture the temporal information of limb motion. In short, our GaitMGL solves the problems of loss of local information and loss of temporal information at a single scale that exist in existing model-based gait recognition networks. We evaluated our method on three publicly available datasets, CASIA-B, Gait3D, and GREW, and the experimental results show that our method demonstrates surprising performance and achieves an accuracy of 63.12% in the dataset GREW, exceeding all existing model-based gait recognition networks.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Lightweight silkworm recognition based on Multi-scale feature fusion
    Wen, Chunming
    Wen, Jie
    Li, Jianheng
    Luo, Yunyun
    Chen, Minbo
    Xiao, Zhanpeng
    Xu, Qing
    Liang, Xiang
    An, Hui
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 200
  • [22] GMSN: An efficient multi-scale feature extraction network for gait recognition
    Wei, Tuanjie
    Liu, Mengchi
    Zhao, Huimin
    Li, Huakang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [23] Multi-Scale Feature For Recognition
    Lei, Songze
    Hao, Chongyang
    Qi, Min
    ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 277 - 280
  • [24] Optimally-Weighted Multi-Scale Local Feature Fusion Network for Driver Distraction Recognition
    Fan, Li Shao
    Gao, Shangbing
    IEEE ACCESS, 2022, 10 : 128554 - 128561
  • [25] Optimally-Weighted Multi-Scale Local Feature Fusion Network for Driver Distraction Recognition
    Fan, Li Shao
    Shangbing, Gao
    IEEE Access, 2022, 10 : 128554 - 128561
  • [26] Global and Local Multi-scale Feature Fusion Enhancement for Brain Tumor Segmentation and Pancreas Segmentation
    Wang, Huan
    Wang, Guotai
    Liu, Zijian
    Zhang, Shaoting
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 : 80 - 88
  • [27] Feature fusion of multi-granularity and multi-scale for facial expression recognition
    Xia, Haiying
    Lu, Lidan
    Song, Shuxiang
    VISUAL COMPUTER, 2024, 40 (03): : 2035 - 2047
  • [28] Feature fusion of multi-granularity and multi-scale for facial expression recognition
    Haiying Xia
    Lidan Lu
    Shuxiang Song
    The Visual Computer, 2024, 40 : 2035 - 2047
  • [29] Scene categorization based on local–global feature fusion and multi-scale multi-spatial resolution encoding
    Jianzhao Qin
    Fuqin Deng
    Nelson H. C. Yung
    Signal, Image and Video Processing, 2014, 8 : 145 - 154
  • [30] Video anomaly detection with multi-scale feature and temporal information fusion
    Cai, Yiheng
    Liu, Jiaqi
    Guo, Yajun
    Hu, Shaobin
    Lang, Shinan
    NEUROCOMPUTING, 2021, 423 : 264 - 273