A multi-head attention-based transformer model for traffic flow forecasting with a comparative analysis to recurrent neural networks

被引:92
|
作者
Reza, Selim [1 ]
Ferreira, Marta Campos [1 ]
Machado, J. J. M. [2 ]
Tavares, Joao Manuel R. S. [1 ,2 ]
机构
[1] Univ Porto, Fac Engn, Rua Dr Roberto Frias,S-N, P-4200465 Porto, Portugal
[2] Univ Porto, Fac Engn, Dept Engn Mecan, Rua Dr Roberto Frias,S-N, P-4200465 Porto, Portugal
关键词
Intelligent transportation system; Time-series forecasting; Deep learning; Long short-term memory; Gated recurrent unit; PeMS; PREDICTION;
D O I
10.1016/j.eswa.2022.117275
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traffic flow forecasting is an essential component of an intelligent transportation system to mitigate congestion. Recurrent neural networks, particularly gated recurrent units and long short-term memory, have been the stateof-the-art traffic flow forecasting models for the last few years. However, a more sophisticated and resilient model is necessary to effectively acquire long-range correlations in the time-series data sequence under analysis. The dominant performance of transformers by overcoming the drawbacks of recurrent neural networks in natural language processing might tackle this need and lead to successful time-series forecasting. This article presents a multi-head attention based transformer model for traffic flow forecasting with a comparative analysis between a gated recurrent unit and a long-short term memory-based model on PeMS dataset in this context. The model uses 5 heads with 5 identical layers of encoder and decoder and relies on Square Subsequent Masking techniques. The results demonstrate the promising performance of the transform-based model in predicting long-term traffic flow patterns effectively after feeding it with substantial amount of data. It also demonstrates its worthiness by increasing the mean squared errors and mean absolute percentage errors by (1.25 - 47.8)% and (32.4 - 83.8)%, respectively, concerning the current baselines.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A traffic flow-forecasting model based on multi-head spatio-temporal attention and adaptive graph convolutional networks
    Zhang, Hong
    Kan, Sunan
    Cao, Jie
    Chen, Linlong
    Zhao, Tianxin
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2022, 33 (10):
  • [2] Self Multi-Head Attention-based Convolutional Neural Networks for fake news detection
    Fang, Yong
    Gao, Jian
    Huang, Cheng
    Peng, Hua
    Wu, Runpu
    [J]. PLOS ONE, 2019, 14 (09):
  • [3] Multi-head Attention-Based Masked Sequence Model for Mapping Functional Brain Networks
    He, Mengshen
    Hou, Xiangyu
    Wang, Zhenwei
    Kang, Zili
    Zhang, Xin
    Qiang, Ning
    Ge, Bao
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT I, 2022, 13431 : 295 - 304
  • [4] Wind Power Forecasting Using Attention-Based Recurrent Neural Networks: A Comparative Study
    Huang, Bin
    Liang, Yuying
    Qiu, Xiaolin
    [J]. IEEE ACCESS, 2021, 9 : 40432 - 40444
  • [5] Multi-head attention-based masked sequence model for mapping functional brain networks
    He, Mengshen
    Hou, Xiangyu
    Ge, Enjie
    Wang, Zhenwei
    Kang, Zili
    Qiang, Ning
    Zhang, Xin
    Ge, Bao
    [J]. FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [6] Attention-based Recurrent Neural Network for Traffic Flow Prediction
    Chen, Qi
    Wang, Wei
    Huang, Xin
    Liang, Hai-ning
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (03): : 831 - 839
  • [7] Multiscaled Multi-Head Attention-Based Video Transformer Network for Hand Gesture Recognition
    Garg, Mallika
    Ghosh, Debashis
    Pradhan, Pyari Mohan
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 80 - 84
  • [8] Attention-based spatial-temporal graph transformer for traffic flow forecasting
    Qingyong Zhang
    Wanfeng Chang
    Changwu Li
    Conghui Yin
    Yixin Su
    Peng Xiao
    [J]. Neural Computing and Applications, 2023, 35 : 21827 - 21839
  • [9] Attention-based spatial-temporal graph transformer for traffic flow forecasting
    Zhang, Qingyong
    Chang, Wanfeng
    Li, Changwu
    Yin, Conghui
    Su, Yixin
    Xiao, Peng
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (29): : 21827 - 21839
  • [10] Multi-Head Attention-Based Spectrum Sensing for Radio
    Devarakonda, B. V. Ravisankar
    Nandanavam, Venkateswararao
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (02) : 135 - 143