Appearance-posture fusion network for distracted driving behavior recognition

被引:0
|
作者
Yang, Xiaohui [1 ]
Qiao, Yu [1 ]
Han, Shiyuan [1 ,2 ]
Feng, Zhen [3 ]
Chen, Yuehui [1 ]
机构
[1] Univ Jinan, Sch Informat Sci & Engn, Jinan 250022, Peoples R China
[2] Shandong Womens Univ, Sch Artificial Intelligence, Jinan, Peoples R China
[3] Jinan Inspur Data Technol Co Ltd, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Driver distraction detection; Human posture estimation; Graph convolutional networks; Convolutional neural networks;
D O I
10.1016/j.eswa.2024.124883
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, detection techniques using computer vision and deep learning have shown promise in assessing driver distraction. This paper proposes a fusion network that combines a Spatial-Temporal Graph Convolutional Network (ST-GCN) and a hybrid convolutional network to integrate multimodal input data for recognizing distracted driver behavior. Specifically, to address the limitations of the ST-GCN method in modeling long-distance joint interaction features and inadequate temporal feature extraction, we design the Spatially Symmetric Configuration Partitioning Graph Convolutional Network (SSCP-GCN) to model relative motion information of symmetric relationships between limbs. Specifically, we utilize densely connected blocks for processing multi-scale temporal information between consecutive frames, thereby enhancing the reuse of bottom features. Furthermore, the expression of important temporal information is augmented by the introduction of the channel attention mechanism. To tackle the problem that the Mixed Convolution (MC) combining 3D convolution with 2D convolution cannot extract higher-order timing information and has limitations in modeling global dependency relationships, we compensate for its inability to capture higher-order temporal semantic information using the Time Shift Module (TSM) without consuming additional computational resources. Additionally, the 3D Multi-Head Self-Attention mechanism (3D MHSA) is employed to integrate global spatial-temporal information of high-level features, avoiding the issue of model complexity proliferation caused by the deep stacking design of Convolutional Neural Networks (CNN). Lastly, we introduce a multistream network framework that integrates driver posture and appearance features to harness complementary advantages, enabling us to combine multimodal input features to achieve better model performance. Experimental results indicate that the accuracy of the network designed in this paper reaches 95.6% and 94.3% on ASU dataset and NTU-RGB+D dataset, respectively. The small size of the model offers the possibility for practical application of the algorithm.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Posture and Appearance Fusion Network for Driver Distraction Recognition
    Yu, Hao
    Zhao, Chong
    Wei, Xing
    Zhai, Yan
    Chen, Zhen
    Sun, Guangling
    Lu, Yang
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2022), PT I, 2022, 13471 : 165 - 174
  • [2] A Lightweight Attention-Based Network towards Distracted Driving Behavior Recognition
    Lin, Yingcheng
    Cao, Dingxin
    Fu, Zanhao
    Huang, Yanmei
    Song, Yanyi
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [3] Bidirectional Posture-Appearance Interaction Network for Driver Behavior Recognition
    Tan, Mingkui
    Ni, Gengqin
    Liu, Xu
    Zhang, Shiliang
    Wu, Xiangmiao
    Wang, Yaowei
    Zeng, Runhao
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 13242 - 13254
  • [4] Distracted driving behaviour recognition based on transfer learning and model fusion
    Luo, Guantai
    Xiao, Wanghui
    Chen, Xinwei
    Tao, Jin
    Zhang, Chentao
    [J]. International Journal of Wireless and Mobile Computing, 2023, 24 (02) : 159 - 168
  • [5] Novel Bilinear Fusion Network Based on Multimodal Data for Student Distracted Behavior Recognition: BFNMD
    Zhang, Jian
    [J]. Journal of Cases on Information Technology, 2023, 25 (01)
  • [6] A hybrid neural network for driving behavior risk prediction based on distracted driving behavior data
    Fu, Xin
    Meng, Hongwei
    Wang, Xue
    Yang, Hao
    Wang, Jianwei
    [J]. PLOS ONE, 2022, 17 (01):
  • [7] Distracted driving recognition method based on deep convolutional neural network
    Rao, Xuli
    Lin, Feng
    Chen, Zhide
    Zhao, Jiaxu
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (01) : 193 - 200
  • [8] Distracted driving recognition method based on deep convolutional neural network
    Xuli Rao
    Feng Lin
    Zhide Chen
    Jiaxu Zhao
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 193 - 200
  • [9] Distracted driving behavior recognition based on improved MobileNetV2
    Bai, Xuemei
    Li, Jialu
    Zhang, Chenjie
    Hu, Hanping
    Gu, Dongbing
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)
  • [10] Towards Sustainable Safe Driving: A Multimodal Fusion Method for Risk Level Recognition in Distracted Driving Status
    Chen, Huiqin
    Liu, Hao
    Chen, Hailong
    Huang, Jing
    [J]. SUSTAINABILITY, 2023, 15 (12)