Automated efficient traffic gesture recognition using swin transformer-based multi-input deep network with radar images

被引:0
|
作者
Firat, Huseyin [1 ]
Uzen, Huseyin [2 ]
Atila, Orhan [3 ]
Sengur, Abdulkadir [4 ]
机构
[1] Dicle Univ, Fac Engn, Dept Comp Engn, Diyarbakir, Turkiye
[2] Bingol Univ, Fac Engn & Architecture, Dept Comp Engn, Bingol, Turkiye
[3] Firat Univ, Technol Fac, Elect Elect Engn Dept, Elazig, Turkiye
[4] Firat Univ, Fac Technol, Dept Elect & Elect Engn, Elazig, Turkiye
关键词
Deep learning; Radar images; Swin transformers; Traffic hand gesture;
D O I
10.1007/s11760-024-03664-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Radar-based artificial intelligence (AI) applications have gained significant attention recently, spanning from fall detection to gesture recognition. The growing interest in this field has led to a shift towards deep convolutional networks, and transformers have emerged to address limitations in convolutional neural network methods, becoming increasingly popular in the AI community. In this paper, we present a novel hybrid approach for radar-based traffic hand gesture classification using transformers. Traffic hand gesture recognition (HGR) holds importance in AI applications, and our proposed three-phase approach addresses the efficiency and effectiveness of traffic HGR. In the initial phase, feature vectors are extracted from input radar images using the pre-trained DenseNet-121 model. These features are then consolidated by concatenating them to gather information from diverse radar sensors, followed by a patch extraction operation. The concatenated features from all inputs are processed in the Swin transformer block to facilitate further HGR. The classification stage involves sequential application of global average pooling, Dense, and Softmax layers. To assess the effectiveness of our method on ULM university radar dataset, we employ various performance metrics, including accuracy, precision, recall, and F1-score, achieving an average accuracy score of 90.54%. We compare this score with existing approaches to demonstrate the competitiveness of our proposed method.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Skeleton-Based Hand Gesture Recognition by Using Multi-input Fusion Lightweight Network
    Hu, Qihao
    Gao, Qing
    Gao, Hongwei
    Ju, Zhaojie
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT I, 2022, 13455 : 24 - 34
  • [2] A Transformer-Based Network for Dynamic Hand Gesture Recognition
    D'Eusanio, Andrea
    Simoni, Alessandro
    Pini, Stefano
    Borghi, Guido
    Vezzani, Roberto
    Cucchiara, Rita
    2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 623 - 632
  • [3] Malware Detection for Portable Executables Using a Multi-input Transformer-based Approach
    Huoh, Ting-Li
    Miskell, Timothy
    Barut, Onur
    Luo, Yan
    Li, Peilong
    Zhang, Tong
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 778 - 782
  • [4] MULTI-VIEW BISTATIC SYNTHETIC APERTURE RADAR TARGET RECOGNITION BASED ON MULTI-INPUT DEEP CONVOLUTIONAL NEURAL NETWORK
    Pei, Jifang
    Huo, Weibo
    Zhang, Qianghui
    Huang, Yulin
    Miao, Yuxuan
    Zhang, Yin
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2314 - 2317
  • [5] SwinFMCW: A Joint Swin Transformer and LSTM Method for Gesture and Identity Recognition Using FMCW Radar
    Sun, Beichen
    Xu, Zhimeng
    Wu, Zhenbin
    Zhang, Shanshan
    2022 CROSS STRAIT RADIO SCIENCE & WIRELESS TECHNOLOGY CONFERENCE, CSRSWTC, 2022,
  • [6] Korean Sign Language Recognition Using Transformer-Based Deep Neural Network
    Shin, Jungpil
    Musa Miah, Abu Saleh
    Hasan, Md. Al Mehedi
    Hirooka, Koki
    Suzuki, Kota
    Lee, Hyoun-Sup
    Jang, Si-Woong
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [7] Deep learning based automated vein recognition using swin transformer and super graph glue model
    Bhushan, Kavi
    Singh, Surendra
    Kumar, Kamal
    Kumar, Parveen
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [8] Multi-Input Deep Learning Based FMCW Radar Signal Classification
    Cha, Daewoong
    Jeong, Sohee
    Yoo, Minwoo
    Oh, Jiyong
    Han, Dongseog
    ELECTRONICS, 2021, 10 (10)
  • [9] Transformer-based deep reverse attention network for multi-sensory human activity recognition
    Pramanik, Rishav
    Sikdar, Ritodeep
    Sarkar, Ram
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [10] WAVEGLOVE: TRANSFORMER-BASED HAND GESTURE RECOGNITION USING MULTIPLE INERTIAL SENSORS
    Kralik, Matej
    Suppa, Marek
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1576 - 1580