Multi-level Attention Fusion for Multimodal Driving Maneuver Recognition

被引:0
|
作者
Liu, Jing [1 ]
Liu, Yang [1 ]
Tian, Chengwen [1 ]
Zhao, Mengyang [1 ]
Zeng, Xinhua [1 ]
Song, Liang [1 ]
机构
[1] Fudan Univ, Acad Engn & Technol, Shanghai, Peoples R China
关键词
Driving maneuver recognition; multimodal sensing signals; attention; convolutional neural network; gated recurrent unit network; IDENTIFICATION;
D O I
10.1109/ISCAS48785.2022.9937710
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sensor-based driving maneuver recognition (DMR) is a fundamental and challenging task in ubiquitous computing, which uses multimodal signals from embedded sensors such as accelerometers and gyroscopes to recognize driving maneuvers. However, the spatial-temporal features from neural networks are often treated equally, which may limit the performance of the model in predicting maneuvers. In this paper, we propose a novel hybrid neural network model based on multi-level attention fusion for multimodal DMR. The proposed model utilizes convolutional neural networks and gated recurrent unit to extract temporal-spatial features from multimodal sensing signals and propose the multi-level attention fusion to explore the significant patterns over local and global periods. In addition, We design three different levels of fusion (early, late, and full fusion) to explore the effects of different attention fusions on the model. Extensive experiments on the real-world dataset show that the proposed model achieves superior performance to the baseline methods, and multi-level attention fusion brings 6.17% gain to the F1-score.
引用
收藏
页码:2609 / 2613
页数:5
相关论文
共 50 条
  • [41] Spatio-temporal Multi-level Fusion for Human Action Recognition
    Manh-Hung Lu
    Thi-Oanh Nguyen
    [J]. SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 298 - 305
  • [42] Lightweight Multi-level Information Fusion Network for Facial Expression Recognition
    Zhang, Yuan
    Tian, Xiang
    Zhang, Ziyang
    Xu, Xiangmin
    [J]. MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 151 - 163
  • [43] Vehicle and Pedestrian Detection Based on Multi-Level Feature Fusion in Autonomous Driving
    Guoqiang C.
    Huailong Y.
    Zhuangzhuang M.
    [J]. Recent Advances in Computer Science and Communications, 2021, 14 (07) : 2300 - 2313
  • [44] Multi-Level Local Feature Coding Fusion for Music Genre Recognition
    Ng, Wing W. Y.
    Zeng, Weijie
    Wang, Ting
    [J]. IEEE ACCESS, 2020, 8 : 152713 - 152727
  • [45] LOW-LIGHT IMAGE ENHANCEMENT WITH ATTENTION AND MULTI-LEVEL FEATURE FUSION
    Wang, Lei
    Fu, Guangtao
    Jiang, Zhuqing
    Ju, Guodong
    Men, Aidong
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 276 - 281
  • [46] Road Crack Model Based on Multi-Level Feature Fusion and Attention Mechanism
    Song, Rongrong
    Wang, Caiyong
    Tian, Qichuan
    Zhang, Qi
    [J]. Computer Engineering and Applications, 2023, 59 (13): : 281 - 288
  • [47] Multi-level feature fusion network combining attention mechanisms for polyp segmentation
    Liu, Junzhuo
    Chen, Qiaosong
    Zhang, Ye
    Wang, Zhixiang
    Deng, Xin
    Wang, Jin
    [J]. INFORMATION FUSION, 2024, 104
  • [48] TransVPR: Transformer-Based Place Recognition with Multi-Level Attention Aggregation
    Wang, Ruotong
    Shen, Yanqing
    Zuo, Weiliang
    Zhou, Sanping
    Zheng, Nanning
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13638 - 13647
  • [49] Facial Expression Recognition in the Wild Using Multi-Level Features and Attention Mechanisms
    Li, Yingjian
    Lu, Guangming
    Li, Jinxing
    Zhang, Zheng
    Zhang, David
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) : 451 - 462
  • [50] Multi-level channel attention excitation network for human action recognition in videos
    Wu, Hanbo
    Ma, Xin
    Li, Yibin
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 114