PERFORMANCE OF MACHINE LEARNING METHODS IN CLASSIFICATION MODELS WITH HIGH-DIMENSIONAL DATA

被引:0
|
作者
Zekic-Susac, Marijana [1 ]
Pfeifer, Sanja [1 ]
Sarlija, Natasa [1 ]
机构
[1] Univ Josip Juraj Strossmayer Osijek, Fac Econ, Gajev Trg 7, Osijek 31000, Croatia
关键词
machine learning; support vector machines; artificial neural networks; CART classification trees; k-nearest neighbour; large-dimensional data; cross-validation; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; DECISION TREES;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
The paper investigates the performance of four machine learning methods: artificial neural networks, classification trees, support vector machines, and k-nearest neighbour in classification type of problem by using a real dataset on entrepreneurial intentions of students. The aim is to fmd out which of the machine learning methods is more efficient in modelling high-dimensional data in the sense of the average classification rate obtained in a 10-fold cross-validation procedure. In addition, sensitivity and specificity is also observed. The results show that the accuracy of artificial neural networks is significantly higher than the accuracy of k-nearest neighbour, but the difference among other methods is not statistically significant.
引用
收藏
页码:219 / 224
页数:6
相关论文
共 50 条
  • [1] A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem
    Zekic-Susac, Marijana
    Pfeifer, Sanja
    Sarlija, Natasa
    [J]. BUSINESS SYSTEMS RESEARCH JOURNAL, 2014, 5 (03): : 82 - 96
  • [2] Accurate classification of depression through optimized machine learning models on high-dimensional noisy data
    Fang, Xingang
    Klawohn, Julia
    De Sabatino, Alexander
    Kundnani, Harsh
    Ryan, Jonathan
    Yu, Weikuan
    Hajcak, Greg
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 71
  • [3] Novel machine learning approach for classification of high-dimensional microarray data
    Rabia Aziz Musheer
    C. K. Verma
    Namita Srivastava
    [J]. Soft Computing, 2019, 23 : 13409 - 13421
  • [4] Novel machine learning approach for classification of high-dimensional microarray data
    Musheer, Rabia Aziz
    Verma, C. K.
    Srivastava, Namita
    [J]. SOFT COMPUTING, 2019, 23 (24) : 13409 - 13421
  • [5] Classification methods for high-dimensional genetic data
    Kalina, Jan
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2014, 34 (01) : 10 - 18
  • [6] Learning from models: high-dimensional analyses on the performance of machine learning interatomic potentials
    Liu, Yunsheng
    Mo, Yifei
    [J]. NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [7] Learning distance to subspace for the nearest subspace methods in high-dimensional data classification
    Zhu, Rui
    Dong, Mingzhi
    Xue, Jing-Hao
    [J]. INFORMATION SCIENCES, 2019, 481 : 69 - 80
  • [8] Morphological classification of brains via high-dimensional shape transformations and machine learning methods
    Lao, ZQ
    Shen, DG
    Xue, Z
    Karacali, B
    Resnick, SM
    Davatzikos, C
    [J]. NEUROIMAGE, 2004, 21 (01) : 46 - 57
  • [9] High-Dimensional Data Visualisation Methods Using Machine Learning and Their Use in Image Analysis
    Tian, Ying
    Ali, Majid Khan Majahar
    Wu, Lili
    Li, Tao
    [J]. TRAITEMENT DU SIGNAL, 2024, 41 (03) : 1355 - 1364
  • [10] Ensemble of penalized logistic models for classification of high-dimensional data
    Ijaz, Musarrat
    Asghar, Zahid
    Gul, Asma
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (07) : 2072 - 2088