Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors Epidemic disease prediction modelling

被引:5
|
作者
Fusco, Terence [1 ]
Bi, Yaxin [1 ]
Wang, Haiying [1 ]
Browne, Fiona [1 ]
机构
[1] Univ Ulster, Fac Comp & Engn, Newtownabbey, North Ireland
关键词
Disease prediction modelling; Data imputation; Synthetic data simulation; Schistosomiasis; SMOTE; Incremental transductive approaches; SPATIAL-ANALYSIS; CLIMATE-CHANGE; CLASSIFICATION; PERFORMANCE; IMPUTATION;
D O I
10.1007/s13042-019-01029-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research presents viable solutions for prediction modelling of schistosomiasis disease based on vector density. Novel training models proposed in this work aim to address various aspects of interest in the artificial intelligence applications domain. Topics discussed include data imputation, semi-supervised labelling and synthetic instance simulation when using sparse training data. Innovative semi-supervised ensemble learning paradigms are proposed focusing on labelling threshold selection and stringency of classification confidence levels. A regression-correlation combination (RCC) data imputation method is also introduced for handling of partially complete training data. Results presented in this work show data imputation precision improvement over benchmark value replacement using proposed RCC on 70% of test cases. Proposed novel incremental transductive models such as ITSVM have provided interesting findings based on threshold constraints outperforming standard SVM application on 21% of test cases and can be applied with alternative environment-based epidemic disease domains. The proposed incremental transductive ensemble approach model enables the combination of complimentary algorithms to provide labelling for unlabelled vector density instances. Liberal (LTA) and strict training approaches provided varied results with LTA outperforming Stacking ensemble on 29.1% of test cases. Proposed novel synthetic minority over-sampling technique (SMOTE) equilibrium approach has yielded subtle classification performance increases which can be further interrogated to assess classification performance and efficiency relationships with synthetic instance generation.
引用
收藏
页码:1159 / 1178
页数:20
相关论文
共 50 条
  • [41] Machine learning approaches for asthma disease prediction among adults in Sri Lanka
    Gunawardana, J. R. N. A.
    Viswakula, S. D.
    Rannan-Eliya, Ravindra P.
    Wijemunige, Nilmini
    [J]. HEALTH INFORMATICS JOURNAL, 2024, 30 (03)
  • [42] Diabetes Disease Prediction using Machine Learning on Big Data of Healthcare
    Mir, Ayman
    Dhage, Sudhir N.
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [43] Performance Comparison of Machine Learning Approaches on Hepatitis C Prediction Employing Data Mining Techniques
    Alizargar, Azadeh
    Chang, Yang-Lang
    Tan, Tan-Hsu
    [J]. BIOENGINEERING-BASEL, 2023, 10 (04):
  • [44] Knowledge discovery in open data for epidemic disease prediction
    Wu, ChienHsing
    Kao, Shu-Chen
    [J]. HEALTH POLICY AND TECHNOLOGY, 2021, 10 (01) : 126 - 134
  • [45] Schistosomiasis transmission in Zimbabwe: Modelling based on machine learning
    Li, Hong-Mei
    Zheng, Jin-Xin
    Midzi, Nicholas
    Mutsaka-Makuvaza, Masceline Jenipher
    Lv, Shan
    Xia, Shang
    Qian, Ying-jun
    Xiao, Ning
    Berguist, Robert
    Zhou, Xiao-Nong
    [J]. INFECTIOUS DISEASE MODELLING, 2024, 9 (04) : 1081 - 1094
  • [46] Novel Feature Reduction (NFR) Model With Machine Learning and Data Mining Algorithms for Effective Disease Risk Prediction
    Pasha, Syed Javeed
    Mohamed, E. Syed
    [J]. IEEE ACCESS, 2020, 8 : 184087 - 184108
  • [47] Machine learning based cardiovascular disease prediction
    Chinnasamy, P.
    Kumar, S. Arun
    Navya, V
    Priya, K. Lakshmi
    Boddu, Siva Sruthi
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 459 - 463
  • [48] Machine learning techniques for dental disease prediction
    Iffat Firozy Rimi
    Md. Ariful Islam Arif
    Sharmin Akter
    Md. Riazur Rahman
    A. H. M. Saiful Islam
    Md. Tarek Habib
    [J]. Iran Journal of Computer Science, 2022, 5 (3) : 187 - 195
  • [49] Machine learning based cardiovascular disease prediction
    Chinnasamy, P.
    Kumar, S. Arun
    Navya, V.
    Priya, K. Lakshmi
    Boddu, Siva Sruthi
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 459 - 463
  • [50] Prediction of Heart Disease Using Machine Learning
    Begum, M. Asma
    Abirami, S.
    Anandhi, R.
    Dhivyadharshini, K.
    Devi, R. Ganga
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (04): : 39 - 42