Feature selection in sequential projection pursuit

被引:20
|
作者
Guo, Q
Wu, W
Massart, DL
Boucon, C
de Jong, S
机构
[1] Free Univ Brussels, Inst Pharmaceut, ChemoAC, B-1090 Brussels, Belgium
[2] GlaxoSmithKline, Safety Assessment, Welwyn Garden City AL6 9AR, Herts, England
[3] Unilever Res Labs Vlaardingen, NL-3133 AT Vlaardingen, Netherlands
关键词
feature selection; sequential projection pursuit; principal component analysis; genetic algorithm; generalised procrustes analysis; data mining;
D O I
10.1016/S0003-2670(01)01000-5
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
A feature selection method is proposed to select a subset of variables in sequential projection pursuit (SPP) analysis in order to preserve as much sample clustering information as possible. The inhomogeneity of the complete data is explored by SPP, and the retained inhomogeneity information of a candidate subset is measured by means of the percentage of consensus in generalised procrustes analysis. The best subset is obtained by applying a genetic algorithm (GA) which optimises the consensus between the subset and the complete data set. An improved algorithm is proposed which enables analysis of high-dimensional data. The method was studied on three high-dimensional industrial data sets. The results show that the proposed method successfully identified inhomogeneity-bearing variables and leads to better subsets of variables than the other studied feature selection methods in preserving interesting clustering information. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:85 / 96
页数:12
相关论文
共 50 条
  • [1] Sequential Projection Pursuit with Kernel Matrix Update and Symbolic Model Selection
    Rodriguez-Martinez, E.
    Mu, T.
    Goulermas, J. Y.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (12) : 2458 - 2469
  • [2] Special issue linStat: sequential projection pursuit
    Franceschini, Cinzia
    Loperfido, Nicola
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023,
  • [3] A projection pursuit approach to variable selection
    Montanari, A
    Lizzani, L
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 35 (04) : 463 - 473
  • [4] Neural Greedy Pursuit for Feature Selection
    Das, Sandipan
    Javid, Alireza M.
    Gohain, Prakash Borpatra
    Eldar, Yonina C.
    Chatterjee, Saikat
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [5] Selection of Mine Site Based on Projection Pursuit
    Li, Yan-Cang
    Zhao, Li-Na
    Zhou, Shu-Jing
    [J]. EBM 2010: INTERNATIONAL CONFERENCE ON ENGINEERING AND BUSINESS MANAGEMENT, VOLS 1-8, 2010, : 1802 - 1804
  • [6] Sequential Feature Selection for Classification
    Rueckstiess, Thomas
    Osendorfer, Christian
    van der Smagt, Patrick
    [J]. AI 2011: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7106 : 132 - +
  • [7] Unsupervised Discriminative Projection for Feature Selection
    Wang, Rong
    Bian, Jintang
    Nie, Feiping
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (02) : 942 - 953
  • [8] Hierarchical Feature Selection for Random Projection
    Wang, Qi
    Wan, Jia
    Nie, Feiping
    Liu, Bo
    Yan, Chenggang
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) : 1581 - 1586
  • [9] Towards an efficient genetic algorithm optimizer for sequential projection pursuit
    Espezua, Soledad
    Villanueva, Edwin
    Maciel, Carlos D.
    [J]. NEUROCOMPUTING, 2014, 123 : 40 - 48
  • [10] Structured orthogonal matching pursuit for feature selection
    Shi, Xiaoshuang
    Xing, Fuyong
    Guo, Zhenhua
    Su, Hai
    Liu, Fujun
    Yang, Lin
    [J]. NEUROCOMPUTING, 2019, 349 : 164 - 172