Exploratory Data Analysis and Artificial Neural Network for Prediction of Leptospirosis Occurrence in Seremban, Malaysia Based on Meteorological Data

被引:4
|
作者
Rahmat, Fariq [1 ]
Zulkafli, Zed [2 ]
Ishak, Asnor Juraiza [1 ]
Noor, Samsul Bahari Mohd [1 ]
Yahaya, Hazlina [3 ]
Masrani, Afiqah [3 ]
机构
[1] Univ Putra Malaysia, Dept Elect & Elect Engn, Serdang, Malaysia
[2] Univ Puta Malaysia, Dept Civil Engn, Serdang, Malaysia
[3] Negeri Sembilan State Dept Hlth, Seremban, Malaysia
关键词
artificial neural network; exploratory data analysis; predictive modeling; leptospirosis; meteorological data; PATHOGENIC LEPTOSPIRA; RISK-FACTORS; TEMPERATURE; SURVIVAL; HEALTH; AREAS;
D O I
10.3389/feart.2020.00377
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Leptospirosis outbreaks in various parts of the world have been linked to changes in the weather. Furthermore, the effects have been shown to occur at different lags of up to 10 months, affecting the performance of simulation models that predict leptospirosis occurrence. In Malaysia, the link between different weather parameters, at different time lags, has yet to be established despite an increasing number of cases in recent years. In this study, a combination of data mining and machine learning is used to analyze, capture, and predict the relation between leptospirosis occurrence and temperature, rainfall, and relative humidity using the Seremban district in Malaysia as a case study. First, the optimal time lags for rainfall were determined using graphical exploratory data analysis (EDA) while non-graphical EDA was used for temperature. Then, an artificial neural network (ANN) model is developed to classify the combination of selected features into disease occurrence and non-occurrence using back-propagation training, optimizing the number of hidden layers and hidden nodes. The success is measured using accuracy, sensitivity, and specificity of each model. EDA has shown that leptospirosis occurrence in Seremban is highly correlated with weekly average temperature at lag 16 weeks and weekly rainfall amount at lag 12-20 weeks. Using these selected features, the ANN model achieved the highest accuracy, sensitivity, and specificity at 84.00, 86.44, and 79.33%, respectively. Overall, the EDA approach has increased the accuracy of the predictive model by 13.30-31.26% from the baseline models.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Prediction of limit pressure and pressuremeter modulus using artificial neural network analysis based on CPTU data
    Wu M.
    Congress S.S.C.
    Liu L.
    Cai G.
    Duan W.
    Chen R.
    Arabian Journal of Geosciences, 2021, 14 (1)
  • [32] Data center cooling prediction using artificial neural network
    Shrivastava, Saurabh K.
    VanGilder, James W.
    Sammakia, Baligat G.
    IPACK 2007: PROCEEDINGS OF THE ASME INTERPACK CONFERENCE 2007, VOL 1, 2007, : 765 - 771
  • [33] Arduino data-logger and artificial neural network to data analysis
    Contreras Contreras, G. F.
    Dulce-Moreno, H. J.
    Ardila Melo, R.
    5TH INTERNATIONAL MEETING FOR RESEARCHERS IN MATERIALS AND PLASMA TECHNOLOGY (5TH IMRMPT), 2019, 1386
  • [34] Prediction models for retinopathy of prematurity occurrence based on artificial neural network
    Wu, Rong
    Chen, He
    Bai, Yichen
    Zhang, Yu
    Feng, Songfu
    Lu, Xiaohe
    BMC OPHTHALMOLOGY, 2024, 24 (01)
  • [35] Improving prediction of aphid flights by temporal analysis of input data for an artificial neural network
    Worner, SP
    Lankin, GO
    Samarasinghe, S
    Teulon, DAJ
    NEW ZEALAND PLANT PROTECTION, VOL 55, 2002, 55 : 312 - 316
  • [36] Prediction of optimum heating timing based on artificial neural network by utilizing BEMS data
    Jang, Jihoon
    Baek, Jumi
    Leigh, Seung-Bok
    JOURNAL OF BUILDING ENGINEERING, 2019, 22 : 66 - 74
  • [37] An artificial neural network hierarchy for the analysis of cell data
    Hodge, L
    Stacey, DA
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 283 - 288
  • [38] Parallel prediction of dengue cases with different risks in Mexico using an artificial neural network model considering meteorological data
    Conde-Gutierrez, R. A.
    Colorado, D.
    Marquez-Nolasco, A.
    Gonzalez-Flores, P. B.
    INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2024, 68 (06) : 1043 - 1060
  • [39] Cutting data modeling based on artificial neural network
    1600, Trans Tech Publications Ltd (620):
  • [40] Reconstruction of geomagnetic data based on artificial neural network
    Yao XiuYi
    Teng YunTian
    Yang DongMei
    Yao Yuan
    CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2018, 61 (06): : 2358 - 2368