Enhanced algorithm for high-dimensional data classification

被引:3
|
作者
Wang, Xiaoming [1 ]
Wang, Shitong [2 ]
机构
[1] Xihua Univ, Sch Comp & Software Engn, Chengdu 610039, Peoples R China
[2] Jiangnan Univ, Sch Digital Media, Wuxi 214122, Peoples R China
基金
美国国家科学基金会;
关键词
Machine learning; Supervised learning; Kernel methods; Support vector machine; DISCRIMINANT-ANALYSIS; FAST IMPLEMENTATION; SUPPORT; SELECTION;
D O I
10.1016/j.asoc.2015.10.051
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Minimum class variance support vector machine (MCVSVM) and large margin linear projection (LMLP) classifier, in contrast with traditional support vector machine (SVM), take the distribution information of the data into consideration and can obtain better performance. However, in the case of the singularity of the within-class scatter matrix, both MCVSVM and LMLP only exploit the discriminant information in a single subspace of the within-class scatter matrix and discard the discriminant information in the other subspace. In this paper, a so-called twin-space support vector machine (TSSVM) algorithm is proposed to deal with the high-dimensional data classification task where the within-class scatter matrix is singular. TSSVM is rooted in both the non-null space and the null space of the within-class scatter matrix, takes full advantage of the discriminant information in the two subspaces, and so can achieve better classification accuracy. In the paper, we first discuss the linear case of TSSVM, and then develop the nonlinear TSSVM. Experimental results on real datasets validate the effectiveness of TSSVM and indicate its superior performance over MCVSVM and LMLP.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [1] A classification algorithm for high-dimensional data
    Roy, Asim
    [J]. INNS CONFERENCE ON BIG DATA 2015 PROGRAM, 2015, 53 : 345 - 355
  • [2] A training algorithm for classification of high-dimensional data
    Vieira, A
    Barradas, N
    [J]. NEUROCOMPUTING, 2003, 50 : 461 - 472
  • [3] Clonal Selection Classification Algorithm for High-Dimensional Data
    Liu, Ruochen
    Zhang, Ping
    Jiao, Licheng
    [J]. LIFE SYSTEM MODELING AND INTELLIGENT COMPUTING, PT II, 2010, 98 : 89 - 95
  • [4] On sparse linear discriminant analysis algorithm for high-dimensional data classification
    Ng, Michael K.
    Liao, Li-Zhi
    Zhang, Leihong
    [J]. NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2011, 18 (02) : 223 - 235
  • [5] The classification method based on evolutionary algorithm for high-dimensional imbalanced missing data
    Liu, Yi
    Li, Gengsong
    Li, Xiang
    Qin, Wei
    Zheng, Qibin
    Ren, Xiaoguang
    [J]. ELECTRONICS LETTERS, 2023, 59 (12)
  • [6] A depth-based nearest neighbor algorithm for high-dimensional data classification
    Harikumar, Sandhya
    Aravindakshan Savithri, Akhil
    Kaimal, Ramachandra
    [J]. Turkish Journal of Electrical Engineering and Computer Sciences, 2019, 27 (06): : 4082 - 4101
  • [7] FACO: A Novel Hybrid Feature Selection Algorithm for High-Dimensional Data Classification
    Popoola, Gideon
    Oyeniran, Kayode
    [J]. SOUTHEASTCON 2024, 2024, : 61 - 68
  • [8] Classification algorithm for high-dimensional protein markers in time-course data
    Vishwakarma, Gajendra K.
    Bhattacharjee, Atanu
    Banerjee, Souvik
    Liquet, Benoit
    [J]. STATISTICS IN MEDICINE, 2020, 39 (28) : 4201 - 4217
  • [9] A depth-based nearest neighbor algorithm for high-dimensional data classification
    Harikumar, Sandhya
    Aravindakshan Savithri, Akhil
    Kaimal, Ramachandra
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (06) : 4082 - 4101
  • [10] Defining and Evaluating Classification Algorithm for High-Dimensional Data Based on Latent Topics
    Luo, Le
    Li, Li
    [J]. PLOS ONE, 2014, 9 (01):