Exploratory Data Analysis and Artificial Neural Network for Prediction of Leptospirosis Occurrence in Seremban, Malaysia Based on Meteorological Data

被引:4
|
作者
Rahmat, Fariq [1 ]
Zulkafli, Zed [2 ]
Ishak, Asnor Juraiza [1 ]
Noor, Samsul Bahari Mohd [1 ]
Yahaya, Hazlina [3 ]
Masrani, Afiqah [3 ]
机构
[1] Univ Putra Malaysia, Dept Elect & Elect Engn, Serdang, Malaysia
[2] Univ Puta Malaysia, Dept Civil Engn, Serdang, Malaysia
[3] Negeri Sembilan State Dept Hlth, Seremban, Malaysia
关键词
artificial neural network; exploratory data analysis; predictive modeling; leptospirosis; meteorological data; PATHOGENIC LEPTOSPIRA; RISK-FACTORS; TEMPERATURE; SURVIVAL; HEALTH; AREAS;
D O I
10.3389/feart.2020.00377
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Leptospirosis outbreaks in various parts of the world have been linked to changes in the weather. Furthermore, the effects have been shown to occur at different lags of up to 10 months, affecting the performance of simulation models that predict leptospirosis occurrence. In Malaysia, the link between different weather parameters, at different time lags, has yet to be established despite an increasing number of cases in recent years. In this study, a combination of data mining and machine learning is used to analyze, capture, and predict the relation between leptospirosis occurrence and temperature, rainfall, and relative humidity using the Seremban district in Malaysia as a case study. First, the optimal time lags for rainfall were determined using graphical exploratory data analysis (EDA) while non-graphical EDA was used for temperature. Then, an artificial neural network (ANN) model is developed to classify the combination of selected features into disease occurrence and non-occurrence using back-propagation training, optimizing the number of hidden layers and hidden nodes. The success is measured using accuracy, sensitivity, and specificity of each model. EDA has shown that leptospirosis occurrence in Seremban is highly correlated with weekly average temperature at lag 16 weeks and weekly rainfall amount at lag 12-20 weeks. Using these selected features, the ANN model achieved the highest accuracy, sensitivity, and specificity at 84.00, 86.44, and 79.33%, respectively. Overall, the EDA approach has increased the accuracy of the predictive model by 13.30-31.26% from the baseline models.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Artificial neural network model with different backpropagation algorithms and meteorological data for solar radiation prediction
    Heng, Seah Yi
    Ridwan, Wanie M.
    Kumar, Pavitra
    Ahmed, Ali Najah
    Fai, Chow Ming
    Birima, Ahmed Hussein
    El-Shafie, Ahmed
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [12] Prediction of Parent Data of Silkworm Breeding Based on Artificial Neural Network
    KwangGyun Sin
    MyongGuk Kim
    JinMyong Cha
    MyongIl Jin
    YongSik Choe
    National Academy Science Letters, 2023, 46 : 209 - 212
  • [13] Prediction of Parent Data of Silkworm Breeding Based on Artificial Neural Network
    Sin, KwangGyun
    Kim, MyongGuk
    Cha, JinMyong
    Jin, MyongIl
    Choe, YongSik
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2023, 46 (03): : 209 - 212
  • [14] Exploratory Data Analysis using Artificial Neural Networks
    Sriram, D.
    Kalaivani, K.
    Ulagapriya, K.
    Saritha, A.
    Sajeevram, A.
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCES AND DEVELOPMENTS IN ELECTRICAL AND ELECTRONICS ENGINEERING (ICADEE), 2020, : 186 - 195
  • [15] A neural network based method for classification of meteorological data
    Kaminski, K
    Kaminski, W
    Strumillo, P
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 616 - 621
  • [16] Big Data Analysis and User Behavior Prediction of Social Networks Based on Artificial Neural Network
    Liu Z.
    Song T.
    Journal of Computing and Information Technology, 2023, 31 (03) : 185 - 201
  • [17] Artificial neural network prediction for coke strength after reaction and data analysis
    Maharana, Sulata
    Biswas, B.
    Ganguly, Adity
    Kumar, Ashok
    World Academy of Science, Engineering and Technology, 2010, 45 : 652 - 656
  • [18] Healthcare Big Data Analysis with Artificial Neural Network for Cardiac Disease Prediction
    Mohapatra, Sulagna
    Sahoo, Prasan Kumar
    Mohapatra, Suvendu Kumar
    ELECTRONICS, 2024, 13 (01)
  • [19] Sensitivity analysis of artificial neural network for chlorophyll prediction using hyperspectral data
    Srivastava, Prashant K.
    Gupta, Manika
    Singh, Ujjwal
    Prasad, Rajendra
    Pandey, Prem Chandra
    Raghubanshi, A. S.
    Petropoulos, George P.
    ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2021, 23 (04) : 5504 - 5519
  • [20] Artificial neural network prediction for Coke strength after reaction and data analysis
    Maharana, Sulata
    Biswas, B.
    Ganguly, Adity
    Kumar, Ashok
    World Academy of Science, Engineering and Technology, 2010, 70 : 652 - 656