Exploratory Data Analysis and Artificial Neural Network for Prediction of Leptospirosis Occurrence in Seremban, Malaysia Based on Meteorological Data

被引:4
|
作者
Rahmat, Fariq [1 ]
Zulkafli, Zed [2 ]
Ishak, Asnor Juraiza [1 ]
Noor, Samsul Bahari Mohd [1 ]
Yahaya, Hazlina [3 ]
Masrani, Afiqah [3 ]
机构
[1] Univ Putra Malaysia, Dept Elect & Elect Engn, Serdang, Malaysia
[2] Univ Puta Malaysia, Dept Civil Engn, Serdang, Malaysia
[3] Negeri Sembilan State Dept Hlth, Seremban, Malaysia
关键词
artificial neural network; exploratory data analysis; predictive modeling; leptospirosis; meteorological data; PATHOGENIC LEPTOSPIRA; RISK-FACTORS; TEMPERATURE; SURVIVAL; HEALTH; AREAS;
D O I
10.3389/feart.2020.00377
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Leptospirosis outbreaks in various parts of the world have been linked to changes in the weather. Furthermore, the effects have been shown to occur at different lags of up to 10 months, affecting the performance of simulation models that predict leptospirosis occurrence. In Malaysia, the link between different weather parameters, at different time lags, has yet to be established despite an increasing number of cases in recent years. In this study, a combination of data mining and machine learning is used to analyze, capture, and predict the relation between leptospirosis occurrence and temperature, rainfall, and relative humidity using the Seremban district in Malaysia as a case study. First, the optimal time lags for rainfall were determined using graphical exploratory data analysis (EDA) while non-graphical EDA was used for temperature. Then, an artificial neural network (ANN) model is developed to classify the combination of selected features into disease occurrence and non-occurrence using back-propagation training, optimizing the number of hidden layers and hidden nodes. The success is measured using accuracy, sensitivity, and specificity of each model. EDA has shown that leptospirosis occurrence in Seremban is highly correlated with weekly average temperature at lag 16 weeks and weekly rainfall amount at lag 12-20 weeks. Using these selected features, the ANN model achieved the highest accuracy, sensitivity, and specificity at 84.00, 86.44, and 79.33%, respectively. Overall, the EDA approach has increased the accuracy of the predictive model by 13.30-31.26% from the baseline models.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A neural network-based local rainfall prediction system using meteorological data on the Internet: A case study using data from the Japan Meteorological Agency
    Kashiwao, Tomoaki
    Nakayama, Koichi
    Ando, Shin
    Ikeda, Kenji
    Lee, Moonyong
    Bahadori, Alireza
    APPLIED SOFT COMPUTING, 2017, 56 : 317 - 330
  • [42] Prediction of environmental effects in received signal strength in FM/TV station based on meteorological parameters using artificial neural network and data mining
    Mirbagheri, Seyed Ahmad
    Mohammadi, Mostafa
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2019, 250
  • [43] Artificial neural network data analysis for classification of soils based on their radionuclide content
    Dragovic, S.
    Onjia, A.
    RUSSIAN JOURNAL OF PHYSICAL CHEMISTRY A, 2007, 81 (09) : 1477 - 1481
  • [44] Artificial neural network data analysis for classification of soils based on their radionuclide content
    S. Dragović
    A. Onjia
    Russian Journal of Physical Chemistry A, 2007, 81 : 1477 - 1481
  • [45] Comparison of Kriging and artificial neural network models for the prediction of spatial data
    Tavassoli, Abbas
    Waghei, Yadollah
    Nazemi, Alireza
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (02) : 352 - 369
  • [46] Solar radiation forecasting based on meteorological data using artificial neural networks
    Ghanbarzadeh, A.
    Noghrehabadi, A. R.
    Assareh, E.
    Behrang, M. A.
    2009 7TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS, VOLS 1 AND 2, 2009, : 227 - +
  • [47] Consequences of data uncertainty and data precision in artificial neural network sugar cane yield prediction
    Satizabal M., Hector F.
    Jimenez R., Daniel R.
    Perez-Uribe, Andres
    COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 1147 - +
  • [48] Improved artificial neural network for data analysis and property prediction in slag class-ceramic
    Wen, QY
    Zhang, HW
    Zhang, PX
    Jiang, XD
    JOURNAL OF THE AMERICAN CERAMIC SOCIETY, 2005, 88 (07) : 1765 - 1769
  • [50] Big Data Analysis and Prediction System Based on Improved Convolutional Neural Network
    Du, Xuegong
    Cao, Xiaojun
    Zhang, Rui
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022