Comparing linear discriminant analysis and supervised learning algorithms for binary classification-A method comparison study

被引:19
|
作者
Graf, Ricarda [1 ]
Zeldovich, Marina [2 ]
Friedrich, Sarah [1 ,3 ]
机构
[1] Univ Augsburg, Dept Math, Univ Str 14, D-86159 Augsburg, Germany
[2] Univ Med Ctr Gottingen, Inst Med Psychol & Med Sociol, Gottingen, Germany
[3] Univ Augsburg, Ctr Adv Analyt & Predict Sci CAAPS, Augsburg, Germany
关键词
binary classification; linear discriminant analysis; multivariate normality; simulation study; supervised learning; REGULARIZATION PATH; PREDICTION; MODELS; VALIDATION; SEARCH;
D O I
10.1002/bimj.202200098
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In psychology, linear discriminant analysis (LDA) is the method of choice for two-group classification tasks based on questionnaire data. In this study, we present a comparison of LDA with several supervised learning algorithms. In particular, we examine to what extent the predictive performance of LDA relies on the multivariate normality assumption. As nonparametric alternatives, the linear support vector machine (SVM), classification and regression tree (CART), random forest (RF), probabilistic neural network (PNN), and the ensemble k conditional nearest neighbor (EkCNN) algorithms are applied. Predictive performance is determined using measures of overall performance, discrimination, and calibration, and is compared in two reference data sets as well as in a simulation study. The reference data are Likert-type data, and comprise 5 and 10 predictor variables, respectively. Simulations are based on the reference data and are done for a balanced and an unbalanced scenario in each case. In order to compare the algorithms' performance, data are simulated from multivariate distributions with differing degrees of nonnormality. Results differ depending on the specific performance measure. The main finding is that LDA is always outperformed by RF in the bimodal data with respect to overall performance. Discriminative ability of the RF algorithm is often higher compared to LDA, but its model calibration is usually worse. Still LDA mostly ranges second in cases it is outperformed by another algorithm, or the differences are only marginal. In consequence, we still recommend LDA for this type of application.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Linear Discriminant Analysis for Face Recognition - Comparison of Subspace Approach with Regularization Method
    Grzywczak, Daniel
    Skarbek, Wladyslaw
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2013, 2013, 8903
  • [42] A Novel Hippocampus Classification Method Based on the Subfields Selection and Sparse Linear Discriminant Analysis
    Wang, Xiangying
    Wang, Shiyuan
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 2109 - 2114
  • [43] Reconstructive discriminant analysis: A feature extraction method induced from linear regression classification
    Chen, Yi
    Jin, Zhong
    NEUROCOMPUTING, 2012, 87 : 41 - 50
  • [44] A 12-lead Clinical ECG Classification Method Based On Semi-supervised Discriminant Analysis
    Zhang, Hanlin
    Huang, Kai
    Li, Dong
    Zhang, Liqing
    PROCEEDINGS OF THE 2013 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2013), VOLS 1 AND 2, 2013, : 177 - 181
  • [45] Comparison between linear and nonlinear machine-learning algorithms for the classification of thyroid nodules
    Ouyang, Fu-sheng
    Guo, Bao-liang
    Ouyang, Li-zhu
    Liu, Zi-wei
    Lin, Shao-jia
    Meng, Wei
    Huang, Xi-yi
    Chen, Hai-xiong
    Hu, Qiu-gen
    Yang, Shao-ming
    EUROPEAN JOURNAL OF RADIOLOGY, 2019, 113 : 251 - 257
  • [46] Comparison of variable selection methods prior to linear discriminant analysis classification of synthetic phenethylamines and tryptamines
    Setser, Amanda L.
    Smith, Ruth Waddell
    FORENSIC CHEMISTRY, 2018, 11 : 77 - 86
  • [47] Classification and staging of dementia of the Alzheimer type - A comparison between neural networks and linear discriminant analysis
    French, BM
    Dawson, MRW
    Dobbs, AR
    ARCHIVES OF NEUROLOGY, 1997, 54 (08) : 1001 - 1009
  • [48] Comparison of different supervised learning algorithms for position analysis of the slider-crank mechanism
    Denizhan, Onur
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 92 : 39 - 49
  • [49] Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison
    Ali, Md Mamun
    Paul, Bikash Kumar
    Ahmed, Kawsar
    Bui, Francis M.
    Quinn, Julian M. W.
    Moni, Mohammad Ali
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
  • [50] Discriminant analysis and machine learning approach for evaluating and improving the performance of immunohistochemical algorithms for COO classification of DLBCL
    Yocanxóchitl Perfecto-Avalos
    Alejandro Garcia-Gonzalez
    Ana Hernandez-Reynoso
    Gildardo Sánchez-Ante
    Carlos Ortiz-Hidalgo
    Sean-Patrick Scott
    Rita Q. Fuentes-Aguilar
    Ricardo Diaz-Dominguez
    Grettel León-Martínez
    Verónica Velasco-Vales
    Mara A. Cárdenas-Escudero
    José A. Hernández-Hernández
    Arturo Santos
    José R. Borbolla-Escoboza
    Luis Villela
    Journal of Translational Medicine, 17