A cluster-based ensemble approach for congenital heart disease prediction

被引:6
|
作者
Kaur, Ishleen [1 ]
Ahmad, Tanvir [2 ]
机构
[1] Univ Delhi, Sri Guru Tegh Bahadur Khalsa Coll, Delhi, India
[2] Jamia Millia Islamia, Dept Comp Engn, New Delhi, India
关键词
Congenital heart disease; DBSCAN; Ensemble; Machine learning; Random forest; DIAGNOSIS; DEFECTS; TRENDS;
D O I
10.1016/j.cmpb.2023.107922
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: One of the most prevalent birth disorders is congenital heart diseases (CHD). Although CHD risk factors have been the subject of numerous studies, their propensity to cause CHD has not been tested. Particularly few research has attempted to forecast CHD risk using population-based cross-sectional data, which is inherently imbalanced. Objective: The main goals of this study are to create a reliable data analysis model that can help with (i) a better understanding of congenital heart disease prediction in the presence of missing and unbalanced data and (ii) creating cohorts of expectant mothers with similar lifestyle characteristics. Methods: Clusters of patient cohorts are produced using the unsupervised data mining technique density-based spatial clustering of applications with noise (DBSCAN). For more accurate CHD prediction, a random forest model was trained using these clusters and their corresponding patterns. This study uses a dataset of 33,831 expectant mothers to make its prediction. Missing data were handled using the k-NN imputation approach, while extremely unbalanced data were balanced using SMOTE. These techniques are all data-driven and need little to no user or expert involvement. Results and Conclusion: Using DBSCAN, three cohorts were found. The cluster information enhanced the random forest-based CHD prediction and revealed intricate factors that influence prediction accuracy. The proposed approach gave the highest results with 99 % accuracy and 0.91 AUC and performed better than the state-of-theart methodologies. Hence, the suggested method using unsupervised learning can provide intricate information to the classifier and further enhance the performance of the classification.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Modeling the hydrodynamics of downers by cluster-based approach
    Karimipour, Shayan
    Mostoufi, Navid
    Sotudeh-Gharebagh, Rahmat
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2006, 45 (21) : 7204 - 7209
  • [32] RACC: An approach to cluster-based Web servers
    Zhang, XL
    Shanmugan, R
    Barrientos, M
    Chen, JB
    PROCEEDINGS OF THE 2ND USENIX WINDOWS NT SYMPOSIUM, 1998, : 167 - 167
  • [33] Cluster-based approach for routing in dynamic networks
    Krishna, P.
    Vaidya, N.H.
    Chatterjee, M.
    Pradhan, D.K.
    Computer Communication Review, 1997, 27 (02): : 49 - 64
  • [34] A Cluster-Based Machine Learning Ensemble Approach for Geospatial Data: Estimation of Health Insurance Status in Missouri
    Mueller, Erik
    Sandoval, J. S. Onesimo
    Mudigonda, Srikanth
    Elliott, Michael
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (01)
  • [35] Cluster-Based Memetic Approach of Image Alignment
    Cocianu, Catalina-Lucia
    Uscatu, Cristian Razvan
    ELECTRONICS, 2021, 10 (21)
  • [36] Analysis of SPI index trend variations in the United Kingdom - A cluster-based and bayesian ensemble algorithms approach
    Di Nunno, Fabio
    de Marinis, Giovanni
    Granata, Francesco
    JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2024, 52
  • [37] Feature Selection and Ensemble Hierarchical Cluster-based Under-sampling Approach for Extremely Imbalanced Datasets
    Soltani, Sima
    Sadri, Javad
    Torshizi, Hassan Ahmadi
    2011 1ST INTERNATIONAL ECONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2011, : 166 - 171
  • [38] A cluster-based approach for the innovation assessment of countries
    Onsel, Sule
    Ulengin, Fusun
    Kabak, Ozgur
    IEMC - EUROPE 2008: INTERNATIONAL ENGINEERING MANAGEMENT CONFERENCE, EUROPE, CONFERENCE PROCEEDINGS: MANAGING ENGINEERING, TECHNOLOGY AND INNOVATION FOR GROWTH, 2008, : 293 - 297
  • [39] Boron cluster-based approach to nucleophilic borylation
    Spokoyny, Alexander
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [40] CLUSTER-BASED APPROACH: A TOOL TO ENTER INTO THE MARKET
    Kassalis, Ivars
    6TH INTERNATIONAL SCIENTIFIC CONFERENCE BUSINESS AND MANAGEMENT 2010, VOLS I AND II, 2010, : 635 - 642