Different gait combinations based on multi-modal deep CNN architectures

被引:0
|
作者
Yaprak, Busranur [1 ]
Gedikli, Eyup [2 ]
机构
[1] Gumushane Univ, Dept Software Engn, TR-29100 Gumushane, Turkiye
[2] Karadeniz Tech Univ, Dept Software Engn, TR-61080 Trabzon, Turkiye
关键词
Gait recognition; Multi-modal deep CNN; Gait Combination; GEI; Silhouette; RECOGNITION; FUSION; MOTION; IMAGE;
D O I
10.1007/s11042-024-18859-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gait recognition is the process of identifying a person from a distance based on their walking patterns. However, the recognition rate drops significantly under cross-view angle and appearance-based variations. In this study, the effectiveness of the most well-known gait representations in solving this problem is investigated based on deep learning. For this purpose, a comprehensive performance evaluation is performed by combining different modalities, including silhouettes, optical flows, and concatenated image of the Gait Energy Image (GEI) head and leg region, with GEI itself. This evaluation is carried out across different multimodal deep convolutional neural network (CNN) architectures, namely fine-tuned EfficientNet-B0, MobileNet-V1, and ConvNeXt-base models. These models are trained separately on GEIs, silhouettes, optical flows, and concatenated image of GEI head and leg regions, and then extracted GEI features are fused in pairs with other extracted modality features to find the most effective gait combination. Experimental results on the two different datasets CASIA-B and Outdoor-Gait show that the concatenated image of GEI head and leg regions significantly increased the recognition rate of the networks compared to other modalities. Moreover, this modality demonstrates greater robustness under varied carrying (BG) and clothing (CL) conditions compared to optical flows (OF) and silhouettes (SF). Codes available at https://github.com/busrakckugurlu/Different-gait-combinations-based-on-multi-modal-deep-CNN-architectures.git
引用
收藏
页码:83403 / 83425
页数:23
相关论文
共 50 条
  • [31] Cyberbullying detection on multi-modal data using pre-trained deep learning architectures
    Pericherla, Subbaraju
    Ilavarasan, E.
    INGENIERIA SOLIDARIA, 2021, 17 (03):
  • [32] An Attentive Multi-Modal CNN for Brain Tumor Radiogenomic Classification
    Qu, Ruyi
    Xiao, Zhifeng
    INFORMATION, 2022, 13 (03)
  • [33] Multi-modal deep learning for landform recognition
    Du, Lin
    You, Xiong
    Li, Ke
    Meng, Liqiu
    Cheng, Gong
    Xiong, Liyang
    Wang, Guangxia
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 158 : 63 - 75
  • [34] Deep Robust Unsupervised Multi-Modal Network
    Yang, Yang
    Wu, Yi-Feng
    Zhan, De-Chuan
    Liu, Zhi-Bin
    Jiang, Yuan
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5652 - 5659
  • [35] CNN-Based Fully Automatic Glioma Classification with Multi-modal Medical Images
    Zhao, Bingchao
    Huang, Jia
    Liang, Changhong
    Liu, Zaiyi
    Han, Chu
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2020), PT II, 2021, 12659 : 497 - 507
  • [36] Privacy Protection in Deep Multi-modal Retrieval
    Zhang, Peng-Fei
    Li, Yang
    Huang, Zi
    Yin, Hongzhi
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 634 - 643
  • [37] Deep Multi-modal Learning with Cascade Consensus
    Yang, Yang
    Wu, Yi-Feng
    Zhan, De-Chuan
    Jiang, Yuan
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 64 - 72
  • [38] Discriminative multi-modal deep generative models
    Du, Fang
    Zhang, Jiangshe
    Hu, Junying
    Fei, Rongrong
    KNOWLEDGE-BASED SYSTEMS, 2019, 173 : 74 - 82
  • [39] Common Representation Learning Using Step-based Correlation Multi-Modal CNN
    Bhatt, Gaurav
    Jha, Piyush
    Raman, Balasubramanian
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 864 - 869
  • [40] Multi-modal deep distance metric learning
    Roostaiyan, Seyed Mahdi
    Imani, Ehsan
    Baghshah, Mahdieh Soleymani
    INTELLIGENT DATA ANALYSIS, 2017, 21 (06) : 1351 - 1369