Fuzzy association rule mining and classification for the prediction of malaria in South Korea

被引:21
|
作者
Buczak, Anna L. [1 ]
Baugher, Benjamin [1 ]
Guven, Erhan [1 ]
Ramac-Thomas, Liane C. [1 ]
Elbert, Yevgeniy [1 ]
Babin, Steven M. [1 ]
Lewis, Sheri H. [1 ]
机构
[1] Johns Hopkins Univ, Appl Phys Lab, Laurel, MD 20723 USA
关键词
Malaria; Prediction; Association rule mining; Fuzzy logic; Classification; Environmental data; Socio-economic data; Epidemiological data; TRANSMISSION; BURDEN; DENGUE; MAPS;
D O I
10.1186/s12911-015-0170-6
中图分类号
R-058 [];
学科分类号
摘要
Background: Malaria is the world's most prevalent vector-borne disease. Accurate prediction of malaria outbreaks may lead to public health interventions that mitigate disease morbidity and mortality. Methods: We describe an application of a method for creating prediction models utilizing Fuzzy Association Rule Mining to extract relationships between epidemiological, meteorological, climatic, and socio-economic data from Korea. These relationships are in the form of rules, from which the best set of rules is automatically chosen and forms a classifier. Two classifiers have been built and their results fused to become a malaria prediction model. Future malaria cases are predicted as LOW, MEDIUM or HIGH, where these classes are defined as a total of 0-2, 3-16, and above 17 cases, respectively, for a region in South Korea during a two-week period. Based on user recommendations, HIGH is considered an outbreak. Results: Model accuracy is described by Positive Predictive Value (PPV), Sensitivity, and F-score for each class, computed on test data not previously used to develop the model. For predictions made 7-8 weeks in advance, model PPV and Sensitivity are 0.842 and 0.681, respectively, for the HIGH classes. The F0.5 and F3 scores (which combine PPV and Sensitivity) are 0.804 and 0.694, respectively, for the HIGH classes. The overall FARM results (as measured by F-scores) are significantly better than those obtained by Decision Tree, Random Forest, Support Vector Machine, and Holt-Winters methods for the HIGH class. For the MEDIUM class, Random Forest and FARM obtain comparable results, with FARM being better at F0.5, and Random Forest obtaining a higher F3. Conclusions: A previously described method for creating disease prediction models has been modified and extended to build models for predicting malaria. In addition, some new input variables were used, including indicators of intervention measures. The South Korea malaria prediction models predict LOW, MEDIUM or HIGH cases 7-8 weeks in the future. This paper demonstrates that our data driven approach can be used for the prediction of different diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Fuzzy association rule mining and classification for the prediction of malaria in South Korea
    Anna L. Buczak
    Benjamin Baugher
    Erhan Guven
    Liane C. Ramac-Thomas
    Yevgeniy Elbert
    Steven M. Babin
    Sheri H. Lewis
    BMC Medical Informatics and Decision Making, 15
  • [2] Using fuzzy association rule mining in cancer classification
    Hamid Mahmoodian
    M. Hamiruce Marhaban
    Raha Abdulrahim
    Rozita Rosli
    Iqbal Saripan
    Australasian Physical & Engineering Sciences in Medicine, 2011, 34 : 41 - 54
  • [3] Incremental Fuzzy Association Rule Mining for Classification and Regression
    Wang, Ling
    Ma, Qian
    Meng, Jianyao
    IEEE ACCESS, 2019, 7 : 121095 - 121110
  • [4] From Fuzzy Association Rule Mining to Effective Classification Framework
    Alhawsawi, Osama
    AL-Saidi, Mayad
    Phi, Michael
    Jarada, Tamer N.
    Khabbaz, Mohammad
    Koockakzadeh, Negar
    Kianmehr, Keivan
    Alhajj, Reda
    Rokne, Jon
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2011, 2011, 6936 : 413 - +
  • [5] ds Using fuzzy association rule mining in cancer classification
    Mahmoodian, Hamid
    Marhaban, M. Hamiruce
    Abdulrahim, Raha
    Rosli, Rozita
    Saripan, Iqbal
    AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE, 2011, 34 (01) : 41 - 54
  • [6] Fuzzy association rule mining approaches for enhancing prediction performance
    Sowan, Bilal
    Dahal, Keshav
    Hossain, M. A.
    Zhang, Li
    Spencer, Linda
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (17) : 6928 - 6937
  • [7] Adaptive fuzzy-evidential classification based on association rule mining
    Geng, Xiaojiao
    Sun, Qingxue
    Zhou, Zhi-Jie
    Jiao, Lianmeng
    Ma, Zongfang
    INFORMATION SCIENCES, 2024, 665
  • [8] Fuzzy association rule mining framework and its application to effective fuzzy associative classification
    Kianmehr, Keivan
    Kaya, Mehmet
    ElSheikh, Abdallah M.
    Jida, Jamal
    Alhajj, Reda
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (06) : 477 - 495
  • [9] MBFP Generalized Association Rule Mining and Classification in Traffic Volume Prediction
    Zhou, Huiyu
    Mabu, Shingo
    Shimada, Kaoru
    Hirasawa, Kotaro
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2011, 6 (05) : 457 - 467
  • [10] A Survey on Fuzzy Association Rule Mining
    Kalia, Harihar
    Dehuri, Satchidananda
    Ghosh, Ashish
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2013, 9 (01) : 1 - 27