Classifying Familial Hypercholesterolaemia: A Tree-based Machine Learning Approach

被引:0
|
作者
Rosli, Marshima Mohd [1 ,2 ]
Edward, Jafhate [1 ]
Onn, Marcella [1 ]
Chua, Yung-An [2 ]
Kasim, Noor Alicezah Mohd [2 ,3 ]
Nawawi, Hapizah [2 ,3 ]
机构
[1] Univ Teknol MARA, Fac Comp & Math Sci, UiTM, Shah Alam 40450, Selangor, Malaysia
[2] Univ Teknol MARA, Inst Pathol, Lab & Forens Med PPerForM, UiTM, Sungai Buloh 47000, Selangor, Malaysia
[3] Univ Teknol MARA, Fac Med, UiTM, Sungai Buloh 47000, Selangor, Malaysia
关键词
Familial hypercholesterolaemia; predicting FH; machine learning algorithms; tree-based classifier; PREVALENCE; GUIDANCE; CARE; FH;
D O I
10.14569/IJACSA.2021.0120908
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Familial hypercholesterolaemia is the most common and serious form of inherited hyperlipidaemia. It has an autosomal dominant mode of inheritance, and is characterised by severely elevated low-density lipoprotein cholesterol levels. Familial hypercholesterolaemia is an important cause of premature coronary heart disease, but is potentially treatable. However, the majority of familial hypercholesterolaemia individuals are under-diagnosed and under-treated, resulting in lost opportunities for premature coronary heart disease prevention. This study aims to assess performance of machine learning algorithms for enhancing familial hypercholesterolaemia detection within the Malaysian population. We applied three machine learning algorithms (random forest, gradient boosting and decision tree) to classify familial hypercholesterolaemia among Malaysian patients and to identify relevant features from four well-known diagnostic instruments: Simon Broome, Dutch Lipid Clinic Criteria, US Make Early Diagnosis to Prevent Early Deaths and Japanese FH Management Criteria. The performance of these classifiers was compared using various measurements for accuracy, precision, sensitivity and specificity. Our results indicated that the decision tree classifier had the best performance, with an accuracy of 99.72%, followed by the gradient boosting and random forest classifiers, with accuracies of 99.54% and 99.52% respectively. The three classifiers with Recursive Feature Elimination method selected six common features of familial hypercholesterolaemia diagnostic criteria (family history of coronary heart disease, low-density lipoprotein cholesterol levels, presence of tendon xanthomata and/or corneal arcus, family hypercholesterolaemia, and family history of familial hypercholesterolaemia) that generate the highest accuracy in predicting familial hypercholesterolaemia. We anticipate machine learning algorithms will enhance rapid diagnosis of familial hypercholesterolaemia by providing the tools to develop a virtual screening test for familial hypercholesterolaemia.
引用
收藏
页码:66 / 73
页数:8
相关论文
共 50 条
  • [1] Protein pKa Prediction by Tree-Based Machine Learning
    Chen, Ada Y.
    Lee, Juyong
    Damjanovic, Ana
    Brooks, Bernard R.
    [J]. JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2022, 18 (04) : 2673 - 2686
  • [2] Runtime Optimizations for Tree-based Machine Learning Models
    Asadi, Nima
    Lin, Jimmy
    de Vries, Arjen P.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) : 2281 - 2292
  • [3] Tree-based interpretable machine learning of the thermodynamic phases
    Yang, Jintao
    Cao, Junpeng
    [J]. PHYSICS LETTERS A, 2021, 412
  • [4] Cosmic string detection with tree-based machine learning
    Sadr, A. Vafaei
    Farhang, M.
    Movahed, S. M. S.
    Bassett, B.
    Kunz, M.
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2018, 478 (01) : 1132 - 1140
  • [5] Tree-based Machine Learning Methods for Survey Research
    Kern, Christoph
    Klausch, Thomas
    Kreuter, Frauke
    [J]. SURVEY RESEARCH METHODS, 2019, 13 (01): : 73 - 93
  • [6] Finding Gene Regulatory Networks in Psoriasis: Application of a Tree-Based Machine Learning Approach
    Deng, Jingwen
    Schieler, Carlotta
    Borghans, Jose A. M.
    Lu, Chuanjian
    Pandit, Aridaman
    [J]. FRONTIERS IN IMMUNOLOGY, 2022, 13
  • [7] Tree-based methods for classifying software failures
    Francis, P
    Leon, D
    Minch, M
    Podgurski, A
    [J]. 15TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 2004, : 451 - 462
  • [8] TREE-BASED MACHINE LEARNING METHODS FOR MODELING AND FORECASTING MORTALITY
    Bjerre, Dorethe Skovgaard
    [J]. ASTIN BULLETIN-THE JOURNAL OF THE INTERNATIONAL ACTUARIAL ASSOCIATION, 2022, 52 (03) : 765 - 787
  • [9] Tree-based machine learning approaches for equity market predictions
    Wolff, Dominik
    Neugebauer, Ulrich
    [J]. JOURNAL OF ASSET MANAGEMENT, 2019, 20 (04) : 273 - 288
  • [10] Tree-based machine learning approaches for equity market predictions
    Dominik Wolff
    Ulrich Neugebauer
    [J]. Journal of Asset Management, 2019, 20 : 273 - 288