Impact of selected pre-processing techniques on prediction of risk of early readmission for diabetic patients in India

被引:9
|
作者
Duggal, Reena [1 ]
Shukla, Suren [2 ]
Chandra, Sarika [3 ]
Shukla, Balvinder [4 ]
Khatri, Sunil Kumar [1 ]
机构
[1] Amity Univ Uttar Pradesh, Amity Inst Informat Technol, Noida, India
[2] OHUM Healthcare Solut Private Ltd, Noida, India
[3] Kailash Hosp, Noida, India
[4] Amity Univ Uttar Pradesh, Noida, India
关键词
Data mining; Diabetes; Feature selection; Missing value imputation; Predicting readmission rates; Pre-processing; VALIDATION;
D O I
10.1007/s13410-016-0495-4
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Diabetes is associated with increased risk of hospital readmission. Predicting risk of readmission of diabetic patients can facilitate implementing appropriate plans to prevent these readmissions. But the real-world medical data is noisy, inconsistent, and incomplete. So before building the prediction model, it is essential to pre-process the data efficiently and make it appropriate for predictive modelling. The objective of this study is to assess the impact of selected pre-processing techniques on the prediction of risk of 30-day readmission among patients with diabetes in India. De-identified electronic medical records data was used from a reputed hospital in the National Capital Region in India and included diabetes patients ae<yen>18 years old discharged from hospital in 2012 to 2015 (n = 9381). This paper focused on data pre-processing steps to improve readmission prediction outcomes. The impact of different pre-processing choices including feature selection, missing value imputation and data balancing on the classifier performance of logistic regression, Na < ve Bayes, and decision tree was assessed on various performance metrics such as area under curve, precision, recall, and accuracy. This comprehensive experimental study, first time done from Indian healthcare perspective, offered empirical evidence that most proposed models with pre-processing techniques significantly outperform the baseline methods (without any pre-processing) with respect to selected evaluation criteria. Area under curve (AUC) was highly increased with the use of oversampling technique as data is skewed on class label Readmission. Recall was the biggest gainer with range increasing from 0.02-0.23 to 0.78-0.85, and there was also an increase in AUC from range 0.56-0.68 to 0.83-0.86 by using pre-processing approach. Data pre-processing has a significant effect on hospital readmission predictive accuracy for patients with diabetes, with certain schemes proving inferior to competitive approaches. In addition, it is found that the impact of pre-processing schemes varies by technique, signifying formulation of different best practices to aid better results of a specific technique.
引用
收藏
页码:469 / 476
页数:8
相关论文
共 45 条
  • [31] Prediction of scour hole characteristics caused by water jets using metaheuristic artificial bee colony-optimized neural network and pre-processing techniques
    Kartal, Veysi
    Emiroglu, Muhammet Emin
    Katipoglu, Okan Mert
    Karakoyun, Erkan
    JOURNAL OF HYDROINFORMATICS, 2023, 25 (06) : 2427 - 2443
  • [32] Early Prediction of a Pre-Symptomatic Neurodegeneration Disorder by Measuring Macrophage Inhibitory Factor Level in Diabetic Patients
    Khalil, Rania M.
    Alaa, Shereen
    Eissa, Hanan
    Youssef, Ibrahim
    JOURNAL OF ALZHEIMERS DISEASE, 2022, 88 (03) : 1167 - 1177
  • [33] Case Study: Risk Assessment of Indian Pulse processing firms using FMEA Techniques-Evidence from selected states of India
    Sahu, Mandavi
    Arora, Dr Sapna
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 2624 - 2636
  • [34] In-vivo survival prediction of glioma patients from [11C]MET-PET using advanced data pre-processing and machine learning
    Krajnc, D.
    Papp, L.
    Spielvogel, C. P.
    Grahovac, M.
    Beyer, T.
    Hacker, M.
    Traub-Weidinger, T.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2020, 47 (SUPPL 1) : S275 - S276
  • [35] Prediction of Rapid Early Progression and Survival Risk with Pre-Radiation MRI in WHO Grade 4 Glioma Patients
    Farzana, Walia
    Basree, Mustafa M.
    Diawara, Norou
    Shboul, Zeina A.
    Dubey, Sagel
    Lockhart, Marie M.
    Hamza, Mohamed
    Palmer, Joshua D.
    Iftekharuddin, Khan M.
    CANCERS, 2023, 15 (18)
  • [36] Using Machine Learning Techniques to Develop Risk Prediction Models for the Risk of Incident Diabetic Retinopathy Among Patients With Type 2 Diabetes Mellitus: A Cohort Study
    Zhao, Yuedong
    Li, Xinyu
    Li, Shen
    Dong, Mengxing
    Yu, Han
    Zhang, Mengxian
    Chen, Weidao
    Li, Peihua
    Yu, Qing
    Liu, Xuhan
    Gao, Zhengnan
    FRONTIERS IN ENDOCRINOLOGY, 2022, 13
  • [37] Impact of gender and dialysis modality on early mortality risk in diabetic ESRD patients: data from a large single center cohort
    C. Serafinceanu
    C. Neculaescu
    D. Cimponeriu
    R. Timar
    A. C. Covic
    International Urology and Nephrology, 2014, 46 : 607 - 614
  • [38] Impact of gender and dialysis modality on early mortality risk in diabetic ESRD patients: data from a large single center cohort
    Serafinceanu, C.
    Neculaescu, C.
    Cimponeriu, D.
    Timar, R.
    Covic, A. C.
    INTERNATIONAL UROLOGY AND NEPHROLOGY, 2014, 46 (03) : 607 - 614
  • [39] Positive impact of pre-Ramadan education on glycemic control and reducing risk of hypoglycemia in type 2 diabetic elderly patients during COVID 19 pandemic
    El Toony, Lobna F.
    Elghazally, Shimaa A.
    Hamad, Dina Ali
    PRIMARY CARE DIABETES, 2022, 16 (04) : 581 - 587
  • [40] Estimated Creatinine Clearance, Homocysteine and High Sensitivity-C-Reactive Protein Levels Determination for Early Prediction of Nephropathy and Atherosclerosis Risk In Type 2 Diabetic Patients
    Deebukkhum, Suwipar
    Pingmuangkaew, Patchanrin
    Tangvarasittichai, Orathai
    Tangvarasittichai, Surapon
    INDIAN JOURNAL OF CLINICAL BIOCHEMISTRY, 2012, 27 (03) : 239 - 245