Different gait combinations based on multi-modal deep CNN architectures

被引:0
|
作者
Yaprak, Busranur [1 ]
Gedikli, Eyup [2 ]
机构
[1] Gumushane Univ, Dept Software Engn, TR-29100 Gumushane, Turkiye
[2] Karadeniz Tech Univ, Dept Software Engn, TR-61080 Trabzon, Turkiye
关键词
Gait recognition; Multi-modal deep CNN; Gait Combination; GEI; Silhouette; RECOGNITION; FUSION; MOTION; IMAGE;
D O I
10.1007/s11042-024-18859-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gait recognition is the process of identifying a person from a distance based on their walking patterns. However, the recognition rate drops significantly under cross-view angle and appearance-based variations. In this study, the effectiveness of the most well-known gait representations in solving this problem is investigated based on deep learning. For this purpose, a comprehensive performance evaluation is performed by combining different modalities, including silhouettes, optical flows, and concatenated image of the Gait Energy Image (GEI) head and leg region, with GEI itself. This evaluation is carried out across different multimodal deep convolutional neural network (CNN) architectures, namely fine-tuned EfficientNet-B0, MobileNet-V1, and ConvNeXt-base models. These models are trained separately on GEIs, silhouettes, optical flows, and concatenated image of GEI head and leg regions, and then extracted GEI features are fused in pairs with other extracted modality features to find the most effective gait combination. Experimental results on the two different datasets CASIA-B and Outdoor-Gait show that the concatenated image of GEI head and leg regions significantly increased the recognition rate of the networks compared to other modalities. Moreover, this modality demonstrates greater robustness under varied carrying (BG) and clothing (CL) conditions compared to optical flows (OF) and silhouettes (SF). Codes available at https://github.com/busrakckugurlu/Different-gait-combinations-based-on-multi-modal-deep-CNN-architectures.git
引用
收藏
页码:83403 / 83425
页数:23
相关论文
共 50 条
  • [41] Deep Object Tracking with Multi-modal Data
    Zhang, Xuezhi
    Yuan, Yuan
    Lu, Xiaoqiang
    2016 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2016, : 161 - 165
  • [42] A comparative review on multi-modal sensors fusion based on deep learning
    Tang, Qin
    Liang, Jing
    Zhu, Fangqi
    SIGNAL PROCESSING, 2023, 213
  • [43] Heterogeneous structural responses recovery based on multi-modal deep learning
    Du, Bowen
    Wu, Liyu
    Sun, Leilei
    Xu, Fei
    Li, Linchao
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2023, 22 (02): : 799 - 813
  • [44] Deep Multi-Modal Network Based Automated Depression Severity Estimation
    Uddin, Md Azher
    Joolee, Joolekha Bibi
    Sohn, Kyung-Ah
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2153 - 2167
  • [45] Multi-modal deep fusion based fake news detection method
    Jing Q.
    Fan X.
    Wang B.
    Bi J.
    Tan H.
    High Technology Letters, 2022, 32 (04) : 392 - 403
  • [46] Deep Learning Based Multi-modal Cardiac MR Image Segmentation
    Zheng, Rencheng
    Zhao, Xingzhong
    Zhao, Xingming
    Wang, He
    STATISTICAL ATLASES AND COMPUTATIONAL MODELS OF THE HEART: MULTI-SEQUENCE CMR SEGMENTATION, CRT-EPIGGY AND LV FULL QUANTIFICATION CHALLENGES, 2020, 12009 : 263 - 270
  • [47] Applying deep learning-based multi-modal for detection of coronavirus
    Rani, Geeta
    Oza, Meet Ganpatlal
    Dhaka, Vijaypal Singh
    Pradhan, Nitesh
    Verma, Sahil
    Rodrigues, Joel J. P. C.
    MULTIMEDIA SYSTEMS, 2022, 28 (04) : 1251 - 1262
  • [48] DEEP SEMANTIC SEGMENTATION OF AERIAL IMAGERY BASED ON MULTI-MODAL DATA
    Chen, Kaiqiang
    Fu, Kun
    Sun, Xian
    Weinmann, Michael
    Hinz, Stefan
    Jutzi, Boris
    Weinmann, Martin
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 6219 - 6222
  • [49] Applying deep learning-based multi-modal for detection of coronavirus
    Geeta Rani
    Meet Ganpatlal Oza
    Vijaypal Singh Dhaka
    Nitesh Pradhan
    Sahil Verma
    Joel J. P. C. Rodrigues
    Multimedia Systems, 2022, 28 : 1251 - 1262
  • [50] Deep Learning based Multi-modal Ultrasound-Photoacoustic Imaging
    Halder, Sumana
    Patidar, Sankalp
    Chaudhury, Koel
    Mandal, Subhamoy
    PROCEEDINGS OF THE 2024 IEEE SOUTH ASIAN ULTRASONICS SYMPOSIUM, SAUS 2024, 2024,