Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors Epidemic disease prediction modelling

被引:5
|
作者
Fusco, Terence [1 ]
Bi, Yaxin [1 ]
Wang, Haiying [1 ]
Browne, Fiona [1 ]
机构
[1] Univ Ulster, Fac Comp & Engn, Newtownabbey, North Ireland
关键词
Disease prediction modelling; Data imputation; Synthetic data simulation; Schistosomiasis; SMOTE; Incremental transductive approaches; SPATIAL-ANALYSIS; CLIMATE-CHANGE; CLASSIFICATION; PERFORMANCE; IMPUTATION;
D O I
10.1007/s13042-019-01029-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research presents viable solutions for prediction modelling of schistosomiasis disease based on vector density. Novel training models proposed in this work aim to address various aspects of interest in the artificial intelligence applications domain. Topics discussed include data imputation, semi-supervised labelling and synthetic instance simulation when using sparse training data. Innovative semi-supervised ensemble learning paradigms are proposed focusing on labelling threshold selection and stringency of classification confidence levels. A regression-correlation combination (RCC) data imputation method is also introduced for handling of partially complete training data. Results presented in this work show data imputation precision improvement over benchmark value replacement using proposed RCC on 70% of test cases. Proposed novel incremental transductive models such as ITSVM have provided interesting findings based on threshold constraints outperforming standard SVM application on 21% of test cases and can be applied with alternative environment-based epidemic disease domains. The proposed incremental transductive ensemble approach model enables the combination of complimentary algorithms to provide labelling for unlabelled vector density instances. Liberal (LTA) and strict training approaches provided varied results with LTA outperforming Stacking ensemble on 29.1% of test cases. Proposed novel synthetic minority over-sampling technique (SMOTE) equilibrium approach has yielded subtle classification performance increases which can be further interrogated to assess classification performance and efficiency relationships with synthetic instance generation.
引用
收藏
页码:1159 / 1178
页数:20
相关论文
共 50 条
  • [31] Modelling and prediction of GNSS time series using GBDT, LSTM and SVM machine learning approaches
    Gao, Wenzong
    Li, Zhao
    Chen, Qusen
    Jiang, Weiping
    Feng, Yanming
    [J]. JOURNAL OF GEODESY, 2022, 96 (10)
  • [32] Prediction of local scour depth around bridge piers: modelling based on machine learning approaches
    Kumar, Virendra
    Baranwal, Anubhav
    Das, Bhabani Shankar
    [J]. ENGINEERING RESEARCH EXPRESS, 2024, 6 (01):
  • [33] Modelling and prediction of GNSS time series using GBDT, LSTM and SVM machine learning approaches
    Wenzong Gao
    Zhao Li
    Qusen Chen
    Weiping Jiang
    Yanming Feng
    [J]. Journal of Geodesy, 2022, 96
  • [34] Disease Prediction Using Graph Machine Learning Based on Electronic Health Data: A Review of Approaches and Trends
    Lu, Haohui
    Uddin, Shahadat
    [J]. HEALTHCARE, 2023, 11 (07)
  • [35] Prediction of an epidemic with Machine Learning and Covid-19 Data
    Fang Wenhui
    Wang Yihui
    Lu Zhipeng
    [J]. 2021 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN ENERGY, ENVIRONMENT AND CHEMICAL SCIENCE (AEECS 2021), 2021, 245
  • [36] Diabetes Disease Prediction Using Data Mining
    Shetty, Deeraj
    Rit, Kishor
    Shaikh, Sohail
    Patil, Nikita
    [J]. 2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [37] Cassava Disease Prediction Using Data Mining
    Anand, Amal
    Joseph, Merin
    Sreelakshmi, S. K.
    Sreenu, G.
    [J]. SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2019, 2020, 39 : 679 - 686
  • [38] A NEW COMPUTATIONAL MODELLING FOR PREDICTION OF COVID-19 POPULATION AND TO APPROXIMATE EPIDEMIC EVOLUTION OF THE DISEASE
    Choukhan, C. F.
    Lemnaouar, M. R.
    Elhatimi
    Zine, R.
    Ibrihich, O.
    Esghir, M.
    [J]. COMMUNICATIONS IN MATHEMATICAL BIOLOGY AND NEUROSCIENCE, 2023,
  • [39] Evaluation based Approaches for Liver Disease Prediction using Machine Learning Algorithms
    Geetha, C.
    Arunachalam, A. R.
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [40] Machine learning approaches for asthma disease prediction among adults in Sri Lanka
    Gunawardana, J. R. N. A.
    Viswakula, S. D.
    Rannan-Eliya, Ravindra P.
    Wijemunige, Nilmini
    [J]. HEALTH INFORMATICS JOURNAL, 2024, 30 (03)