Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors Epidemic disease prediction modelling

被引:5
|
作者
Fusco, Terence [1 ]
Bi, Yaxin [1 ]
Wang, Haiying [1 ]
Browne, Fiona [1 ]
机构
[1] Univ Ulster, Fac Comp & Engn, Newtownabbey, North Ireland
关键词
Disease prediction modelling; Data imputation; Synthetic data simulation; Schistosomiasis; SMOTE; Incremental transductive approaches; SPATIAL-ANALYSIS; CLIMATE-CHANGE; CLASSIFICATION; PERFORMANCE; IMPUTATION;
D O I
10.1007/s13042-019-01029-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research presents viable solutions for prediction modelling of schistosomiasis disease based on vector density. Novel training models proposed in this work aim to address various aspects of interest in the artificial intelligence applications domain. Topics discussed include data imputation, semi-supervised labelling and synthetic instance simulation when using sparse training data. Innovative semi-supervised ensemble learning paradigms are proposed focusing on labelling threshold selection and stringency of classification confidence levels. A regression-correlation combination (RCC) data imputation method is also introduced for handling of partially complete training data. Results presented in this work show data imputation precision improvement over benchmark value replacement using proposed RCC on 70% of test cases. Proposed novel incremental transductive models such as ITSVM have provided interesting findings based on threshold constraints outperforming standard SVM application on 21% of test cases and can be applied with alternative environment-based epidemic disease domains. The proposed incremental transductive ensemble approach model enables the combination of complimentary algorithms to provide labelling for unlabelled vector density instances. Liberal (LTA) and strict training approaches provided varied results with LTA outperforming Stacking ensemble on 29.1% of test cases. Proposed novel synthetic minority over-sampling technique (SMOTE) equilibrium approach has yielded subtle classification performance increases which can be further interrogated to assess classification performance and efficiency relationships with synthetic instance generation.
引用
收藏
页码:1159 / 1178
页数:20
相关论文
共 50 条
  • [1] Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectorsEpidemic disease prediction modelling
    Terence Fusco
    Yaxin Bi
    Haiying Wang
    Fiona Browne
    [J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 1159 - 1178
  • [2] A Review: Machine Learning and Data Mining Approaches for Cardiovascular Disease Diagnosis and Prediction
    Rao, Gorapalli Srinivasa
    Muneeswari, G.
    [J]. EAI Endorsed Transactions on Pervasive Health and Technology, 2024, 10
  • [3] Prediction of atherosclerotic disease progression combining computational modelling with machine learning
    Sakellarios, Antonis I.
    Pezoulas, Vasileios C.
    Bourantas, Christos
    Naka, Katerina K.
    Michalis, Lampros K.
    Serruys, Patrick W.
    Stone, Gregg
    Garcia-Garcia, Hector M.
    Fotiadis, Dimitrios I.
    [J]. 42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 2760 - 2763
  • [4] An Outcome Based Analysis on Heart Disease Prediction using Machine Learning Algorithms and Data Mining Approaches
    Deb, Aushtmi
    Koli, Mst Sadia Akter
    Akter, Sheikh Beauty
    Chowdhury, Adil Ahmed
    [J]. 2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 418 - 424
  • [5] Machine learning in coronary heart disease prediction: Structural equation modelling approach
    Rodrigues, Lewlyn L. R.
    Shetty, Dasharathraj K.
    Naik, Nithesh
    Maddodi, Chethana Balakrishna
    Rao, Anuradha
    Shetty, Ajith Kumar
    Bhat, Rama
    Hameed, Zeeshan
    [J]. COGENT ENGINEERING, 2020, 7 (01):
  • [6] Thyroid Disease Prediction Using Machine Learning Approaches
    Gyanendra Chaubey
    Dhananjay Bisen
    Siddharth Arjaria
    Vibhash Yadav
    [J]. National Academy Science Letters, 2021, 44 : 233 - 238
  • [7] Thyroid Disease Prediction Using Machine Learning Approaches
    Chaubey, Gyanendra
    Bisen, Dhananjay
    Arjaria, Siddharth
    Yadav, Vibhash
    [J]. NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2021, 44 (03): : 233 - 238
  • [8] Thyroid Disease Treatment prediction with machine learning approaches
    Aversano, Lerina
    Bernardi, Mario Luca
    Cimitile, Marta
    Iammarino, Martina
    Macchia, Paolo Emidio
    Nettore, Immacolata Cristina
    Verdone, Chiara
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 1031 - 1040
  • [9] Statistical Machine Learning Approaches to Liver Disease Prediction
    Mostafa, Fahad
    Hasan, Easin
    Williamson, Morgan
    Khan, Hafiz
    [J]. LIVERS, 2021, 1 (04): : 294 - 312
  • [10] Challenges and promises of machine learning-based risk prediction modelling in cardiovascular disease
    Gonzalez-Del-Hoyo, Maribel
    Rossello, Xavier
    [J]. EUROPEAN HEART JOURNAL-ACUTE CARDIOVASCULAR CARE, 2021, 10 (08) : 866 - 868