A method of dimensionality reduction by selection of components in principal component analysis for text classification

被引:5
|
作者
Zhang, Yangwu [1 ,2 ]
Li, Guohe [1 ,3 ]
Zong, Heng [2 ]
机构
[1] China Univ Petr, Coll Geophys & Informat Engn, Beijing, Peoples R China
[2] China Univ Polit Sci & Law, Dept Sci & Technol Teaching, Beijing, Peoples R China
[3] China Univ Petr, Beijing Key Lab Data Min Petr Data, Beijing, Peoples R China
关键词
Principal components analysis; Dimensionality reduction; Text classification;
D O I
10.2298/FIL1805499Z
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Dimensionality reduction, including feature extraction and selection, is one of the key points for text classification. In this paper, we propose a mixed method of dimensionality reduction constructed by principal components analysis and the selection of components. Principal components analysis is a method of feature extraction. Not all of the components in principal component analysis contribute to classification, because PCA objective is not a form of discriminant analysis (see, e.g. Jolliffe, 2002). In this context, we present a function of components selection, which returns the useful components for classification by the indicators of the performances on the different subsets of the components. Compared to traditional methods of feature selection, SVM classifiers trained on selected components show improved classification performance and a reduction in computational overhead.
引用
收藏
页码:1499 / 1506
页数:8
相关论文
共 50 条
  • [41] Dimension selection for feature selection and dimension reduction with principal and independent component analysis
    Koch, Inge
    Naito, Kanta
    NEURAL COMPUTATION, 2007, 19 (02) : 513 - 545
  • [42] An online incremental orthogonal component analysis method for dimensionality reduction
    Zhu, Tao
    Xu, Ye
    Shen, Furao
    Zhao, Jinxi
    NEURAL NETWORKS, 2017, 85 : 33 - 50
  • [43] An Evolutionary Orthogonal Component Analysis Method for Incremental Dimensionality Reduction
    Zhang, Tianyue
    Shen, Furao
    Zhu, Tao
    Zhao, Jian
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 392 - 405
  • [44] COMPONENT SELECTION NORMS FOR PRINCIPAL COMPONENTS REGRESSION
    HILL, RC
    FOMBY, TB
    JOHNSON, SR
    COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1977, 6 (04): : 309 - 334
  • [45] A Pareto Corner Search Evolutionary Algorithm and Principal Component Analysis for Objective Dimensionality Reduction
    Xuan Hung Nguyen
    Lam Thu Bui
    Cao Truong Tran
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 25 - 30
  • [46] Kernel Principal Component Analysis for dimensionality reduction in fMRI-based diagnosis of ADHD
    Sidhu, Gagan S.
    Asgarian, Nasimeh
    Greiner, Russell
    Brown, Matthew R. G.
    FRONTIERS IN SYSTEMS NEUROSCIENCE, 2012, 6
  • [47] Spectral transformation based on nonlinear principal component analysis for dimensionality reduction of hyperspectral images
    Licciardi, Giorgio
    Chanussot, Jocelyn
    EUROPEAN JOURNAL OF REMOTE SENSING, 2018, 51 (01) : 375 - 390
  • [48] Dimensionality reduction of RKHS model using Reduced Kernel Principal Component Analysis (RKPCA)
    Ilyes, Elaissi
    Okba, Taouali
    Hassani, Messaoud
    18TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, 2010, : 951 - 956
  • [49] Energy Efficient Medical Data Dimensionality Reduction using Optimized Principal Component Analysis
    Sophia S.G.
    Thanammal K.K.
    Sujatha S.S.
    EAI Endorsed Transactions on Energy Web, 2022, 9 (37) : 1 - 7
  • [50] A Comparative Approach of Dimensionality Reduction Techniques in Text Classification
    Basha, Shaik Rahamat
    Rani, J. Keziya
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2019, 9 (06) : 4974 - 4979