Hybrid deep learning of social media big data for predicting the evolution of COVID-19 transmission

被引:23
|
作者
Chew, Alvin Wei Ze [1 ]
Pan, Yue [2 ]
Wang, Ying [3 ]
Zhang, Limao [3 ]
机构
[1] Bentley Syst Res Off, 1 Harbourfront Pl,HarbourFront Tower One, Singapore 098633, Singapore
[2] Shanghai Jiao Tong Univ, Dept Civil Engn, Shanghai Key Lab Digital Maintenance Bldg & Infra, 800 Dongchuan Rd, Shanghai, Peoples R China
[3] Nanyang Technol Univ, Sch Civil & Environm Engn, 50 Nanyang Ave, Singapore 639798, Singapore
关键词
COVID-19; Natural language processing; Time-series prediction; Deep learning; Big-data;
D O I
10.1016/j.knosys.2021.107417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, a hybrid deep-learning model termed as ODANN, built upon neural networks (NN) coupled with data assimilation and natural language processing (NLP) features extraction methods, has been constructed to concurrently process daily COVID-19 time-series records and large volumes of COVID-19 related Twitter data, as representative of the global community's aggregated emotional responses towards the current pandemic, to model the growth rate in the number of confirmed COVID-19 cases globally via a proposed G parameter. Overall, there were 3 key components to ODANN's development phase, namely: (i) data hydration and pre-processing were performed on COVID-19 related Twitter data ranging between 23 January 2020 and 10 May 2020, which amounted to over 100 million Tweets written in English language; (ii) multiple NLP features extraction methods were subsequently leveraged to encode the hydrated Twitter data into useful semantic word vectors for training ODANN under an optimal set of hyperparameters; and (iii) historical time-series data of defined characteristics were also assimilated into ODANN's selected hidden layer(s) to model the G parameter daily with a lead-time of 1 day. By far, our experimental results demonstrated that by adopting a rolling time-window size of 5 days, with respect to the number of historical time-series records for assimilating different data features, enabled ODANN to outperform other traditional time -series models and recent studies, in terms of the computed RMSE and MAE scores attained from the model's testing step. Overall, the summarized results from ODANN demonstrated its competitive edge in modelling and forecasting the growth rate in the number of COVID-19 cases globally. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A Hybrid Deep Learning Model to Predict the Impact of COVID-19 on Mental Health From Social Media Big Data
    Al Banna, Md. Hasan
    Ghosh, Tapotosh
    Al Nahian, Md. Jaber
    Kaiser, M. Shamim
    Mahmud, Mufti
    Abu Taher, Kazi
    Hossain, Mohammad Shahadat
    Andersson, Karl
    [J]. IEEE ACCESS, 2023, 11 : 77009 - 77022
  • [2] Hybrid Deep-Learning and Machine-Learning Models for Predicting COVID-19
    Qaid, Talal S.
    Mazaar, Hussein
    Al-Shamri, Mohammad Yahya H.
    Alqahtani, Mohammed S.
    Raweh, Abeer A.
    Alakwaa, Wafaa
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [3] Hybrid Deep Learning Techniques for Predicting Complex Phenomena: A Review on COVID-19
    Jamshidi, Mohammad Behdad
    Roshani, Sobhan
    Daneshfar, Fatemeh
    Lalbakhsh, Ali
    Roshani, Saeed
    Parandin, Fariborz
    Malek, Zahra
    Talla, Jakub
    Peroutka, Zdenek
    Jamshidi, Alireza
    Hadjilooei, Farimah
    Lalbakhsh, Pedram
    [J]. AI, 2022, 3 (02) : 416 - 433
  • [4] Predicting COVID-19 pandemic waves including vaccination data with deep learning
    Begga, Ahmed
    Garibo-i-Orts, Oscar
    de Maria-Garcia, Sergi
    Escolano, Francisco
    Lozano, Miguel A.
    Oliver, Nuria
    Conejero, J. Alberto
    [J]. FRONTIERS IN PUBLIC HEALTH, 2023, 11
  • [5] COVID-19 vaccine hesitancy: a social media analysis using deep learning
    Nyawa, Serge
    Tchuente, Dieudonne
    Fosso-Wamba, Samuel
    [J]. ANNALS OF OPERATIONS RESEARCH, 2024, 339 (1-2) : 477 - 515
  • [6] Deep learning for predicting COVID-19 malignant progression
    Fang, Cong
    Bai, Song
    Chen, Qianlan
    Zhou, Yu
    Xia, Liming
    Qin, Lixin
    Gong, Shi
    Xie, Xudong
    Zhou, Chunhua
    Tu, Dandan
    Zhang, Changzheng
    Liu, Xiaowu
    Chen, Weiwei
    Bai, Xiang
    Torr, Philip H. S.
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 72
  • [7] Forecasting the Spread of COVID-19 Using Deep Learning and Big Data Analytics Methods
    Kiganda C.
    Akcayol M.A.
    [J]. SN Computer Science, 4 (4)
  • [8] Machine Learning and OLAP on Big COVID-19 Data
    Leung, Carson K.
    Chen, Yubo
    Hoi, Calvin S. H.
    Shang, Siyuan
    Cuzzocrea, Alfredo
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5118 - 5127
  • [9] The Role of Big Data and Machine Learning in COVID-19
    Ababneh, Mustafa
    Aljarrah, Aayat
    Karagozlu, Damla
    [J]. BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2020, 11 (02) : 1 - 20
  • [10] Sentiment analysis of COVID-19 social media data through machine learning
    Dharmendra Dangi
    Dheeraj K. Dixit
    Amit Bhagat
    [J]. Multimedia Tools and Applications, 2022, 81 : 42261 - 42283