A new feature subset selection using bottom-up clustering

被引:0
|
作者
Zeinab Dehghan
Eghbal G. Mansoori
机构
[1] Shiraz University,School of Electrical and Computer Engineering
来源
关键词
Dimensionality reduction; Feature selection; Hierarchical clustering; Feature clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Feature subset selection and/or dimensionality reduction is an essential preprocess before performing any data mining task, especially when there are too many features in the problem space. In this paper, a clustering-based feature subset selection (CFSS) algorithm is proposed for discriminating more relevant features. In each level of agglomeration, it uses similarity measure among features to merge two most similar clusters of features. By gathering similar features into clusters and then introducing representative features of each cluster, it tries to remove some redundant features. To identify the representative features, a criterion based on mutual information is proposed. Since CFSS works in a filter manner in specifying the representatives, it is noticeably fast. As an advantage of hierarchical clustering, it does not need to determine the number of clusters in advance. In CFSS, the clustering process is repeated until all features are distributed in some clusters. However, to diffuse the features in a reasonable number of clusters, a recently proposed approach is used to obtain a suitable level for cutting the clustering tree. To assess the performance of CFSS, we have applied it on some valid UCI datasets and compared with some popular feature selection methods. The experimental results reveal the efficiency and fastness of our proposed method.
引用
收藏
页码:57 / 66
页数:9
相关论文
共 50 条
  • [11] Feature ranking based consensus clustering for feature subset selection
    Rani, D. Sandhya
    Rani, T. Sobha
    Bhavani, S. Durga
    Krishna, G. Bala
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8154 - 8169
  • [12] Feature subset selection using a new definition of classifiability
    Dong, M
    Kothari, R
    PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) : 1215 - 1225
  • [13] A Bottom-Up Approach for Licences Classification and Selection
    Daga, Enrico
    d'Aquin, Mathieu
    Motta, Enrico
    Gangemi, Aldo
    SEMANTIC WEB: ESWC 2015 SATELLITE EVENTS, 2015, 9341 : 257 - 267
  • [14] Bottom-Up Variable Selection in Cluster Analysis Using Bootstrapping: A Proposal
    Mucha, Hans-Joachim
    Bartel, Hans-Georg
    ANALYSIS OF LARGE AND COMPLEX DATA, 2016, : 125 - 135
  • [15] A BOTTOM-UP HIERARCHICAL CLUSTERING ALGORITHM WITH INTERSECTION POINTS
    Nazari, Zahra
    Nazari, Masooma
    Kang, Dongshik
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2019, 15 (01): : 291 - 304
  • [16] Bottom-up Framework for Robust Facial Feature Detectiond
    Erukhimov, Victor
    Lee, Kuang-chih
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 825 - +
  • [17] Bottom-Up Biases in Feature-Selective Attention
    Andersen, Soren K.
    Mueller, Matthias M.
    Martinovic, Jasna
    JOURNAL OF NEUROSCIENCE, 2012, 32 (47): : 16953 - 16958
  • [18] Scanpath Prediction through Convolutional Neural Network and Feature-Based Clustering: A Bottom-Up Approach
    George, Judy K.
    Sherly, Elizabeth
    2024 15th International Conference on Computing Communication and Networking Technologies, ICCCNT 2024, 2024,
  • [19] A new approach to feature subset selection
    Liu, DZ
    Feng, ZJ
    Wang, XZ
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1822 - 1825
  • [20] ProteaseGuru: A Tool for Protease Selection in Bottom-Up Proteomics
    Miller, Rachel M.
    Ibrahim, Khairina
    Smith, Lloyd M.
    JOURNAL OF PROTEOME RESEARCH, 2021, 20 (04) : 1936 - 1942