Time series prediction of COVID-19 by mutation rate analysis using recurrent neural network-based LSTM model

被引:60
|
作者
Pathan, Refat Khan [1 ]
Biswas, Munmun [1 ]
Khandaker, Mayeen Uddin [2 ]
机构
[1] BGC Trust Univ Bangladesh, Dept Comp Sci & Engn, Chittagong 4381, Bangladesh
[2] Sunway Univ, Sch Healthcare & Med Sci, Ctr Biomed Phys, Bandar Sunway 47500, Selangor, Malaysia
关键词
SARS-Cov-2; Gene sequence; Mutation rate; Neural Network; LSTM model; SPIKE;
D O I
10.1016/j.chaos.2020.110018
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
SARS-CoV-2, a novel coronavirus mostly known as COVID-19 has created a global pandemic. The world is now immobilized by this infectious RNA virus. As of June 15, already more than 7.9 million people have been infected and 432k people died. This RNA virus has the ability to do the mutation in the human body. Accurate determination of mutation rates is essential to comprehend the evolution of this virus and to determine the risk of emergent infectious disease. This study explores the mutation rate of the whole genomic sequence gathered from the patient's dataset of different countries. The collected dataset is processed to determine the nucleotide mutation and codon mutation separately. Furthermore, based on the size of the dataset, the determined mutation rate is categorized for four different regions: China, Australia, the United States, and the rest of the World. It has been found that a huge amount of Thymine (T) and Adenine ( A) are mutated to other nucleotides for all regions, but codons are not frequently mutating like nucleotides. A recurrent neural network-based Long Short Term Memory (LSTM) model has been applied to predict the future mutation rate of this virus. The LSTM model gives Root Mean Square Error (RMSE) of 0.06 in testing and 0.04 in training, which is an optimized value. Using this train and testing process, the nucleotide mutation rate of 400th patient in future time has been predicted. About 0.1% increment in mutation rate is found for mutating of nucleotides from T to C and G, C to G and G to T. While a decrement of 0.1% is seen for mutating of T to A, and A to C. It is found that this model can be used to predict day basis mutation rates if more patient data is available in updated time. (C) 2020 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页数:7
相关论文
共 50 条
  • [1] Analysis and Prediction of COVID-19 by using Recurrent LSTM Neural Network Model in Machine Learning
    Dharani, N. P.
    Bojja, Polaiah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 171 - 178
  • [2] Time Series Prediction Method Based on Variant LSTM Recurrent Neural Network
    Hu, Jiaojiao
    Wang, Xiaofeng
    Zhang, Ying
    Zhang, Depeng
    Zhang, Meng
    Xue, Jianru
    NEURAL PROCESSING LETTERS, 2020, 52 (02) : 1485 - 1500
  • [3] Time Series Prediction Method Based on Variant LSTM Recurrent Neural Network
    Jiaojiao Hu
    Xiaofeng Wang
    Ying Zhang
    Depeng Zhang
    Meng Zhang
    Jianru Xue
    Neural Processing Letters, 2020, 52 : 1485 - 1500
  • [4] A Comparison: Prediction of Death and Infected COVID-19 Cases in Indonesia Using Time Series Smoothing and LSTM Neural Network
    Rasjid, Zulfany Erlisa
    Setiawan, Reina
    Effendi, Andy
    5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 982 - 988
  • [5] Analysis and prediction of worldwide novel coronavirus (COVID-19) infections, using neural network-based techniques
    Sachin Kamley
    R. S. Thakur
    Iran Journal of Computer Science, 2022, 5 (2) : 99 - 107
  • [6] Recurrent Neural Network and Reinforcement Learning Model for COVID-19 Prediction
    Kumar, R. Lakshmana
    Khan, Firoz
    Din, Sadia
    Band, Shahab S.
    Mosavi, Amir
    Ibeke, Ebuka
    FRONTIERS IN PUBLIC HEALTH, 2021, 9
  • [7] FINANCIAL TIME SERIES PREDICTION MODEL BASED RECURRENT NEURAL NETWORK
    Cheng Chaozhi
    Gao Yachun
    Ni Jingwei
    2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, : 33 - 38
  • [8] COVID-19 Outbreak: An Epidemic Analysis using Time Series Prediction Model
    Kumar, Raghavendra
    Jain, Anjali
    Tripathi, Arun Kumar
    Tyagi, Shaifali
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 1090 - 1094
  • [9] Outbreak prediction of COVID-19 using Recurrent neural network with Gated Recurrent Units
    Natarajan S.
    Kumar M.
    Gadde S.K.K.
    Venugopal V.
    Materials Today: Proceedings, 2023, 80 : 3433 - 3437
  • [10] Time series prediction of COVID-19 transmission in America using LSTM and XGBoost algorithms
    Luo, Junling
    Zhang, Zhongliang
    Fu, Yao
    Rao, Feng
    RESULTS IN PHYSICS, 2021, 27