An efficient unsupervised feature selection procedure through feature clustering

被引:22
|
作者
Yan, Xuyang [1 ]
Nazmi, Shabnam [1 ]
Erol, Berat A. [1 ]
Homaifar, Abdollah [1 ]
Gebru, Biniam [1 ]
Tunstel, Edward [2 ]
机构
[1] North Carolina A&T State Univ, Autonomous Control & Informat Technol Inst ACIT, 1601 East Market St, Greensboro, NC 27401 USA
[2] United Technol Res Ctr, E Hartford, CT 06108 USA
关键词
Unsupervised feature selection; Feature clustering; Feature redundancy;
D O I
10.1016/j.patrec.2019.12.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the scarcity of readily available labels, unsupervised feature selection (UFS) methods are widely adopted in the analysis of high-dimensional data. However, most of the existing UFS methods primarily focus on the significance of features in maintaining the data structure while ignoring the redundancy among features. Moreover, the determination of the proper number of features is another challenge. In this paper, an efficient unsupervised feature selection method through feature clustering (EUFSFC) is proposed to address the redundancy among features, and to determine the size of the final feature subset. The proposed methodology is comprised of two steps: (a) feature cluster analysis, and (b) the selection of the representative features. An extended density-based clustering algorithm is proposed to separate features into an appropriate number of disjoint clusters with no requirement for predefined cluster numbers or radii. The selection of features is performed by choosing the most representative features from those feature clusters. Experiments are conducted to show the effectiveness of the proposed feature selection method. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:277 / 284
页数:8
相关论文
共 50 条
  • [1] Unsupervised Feature Selection with Feature Clustering
    Cheung, Yiu-ming
    Jia, Hong
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 9 - 15
  • [2] Unsupervised feature selection for balanced clustering
    Zhou, Peng
    Chen, Jiangyong
    Fan, Mingyu
    Du, Liang
    Shen, Yi-Dong
    Li, Xuejun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [3] A Novel Unsupervised Feature Selection Method for Bioinformatics Data Sets through Feature Clustering
    Li, Guangrong
    Hu, Xiaohua
    Shen, Xiajiong
    Chen, Xin
    Li, Zhoujun
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 41 - +
  • [4] Unsupervised Feature Selection through Fitness Proportionate Sharing Clustering
    Yan, Xuyang
    Homaifar, Abdollah
    Awogbami, Gabriel
    Girma, Abenezer
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 1355 - 1360
  • [5] An Unsupervised Attribute Clustering Algorithm for Unsupervised Feature Selection
    Zhou, Pei-Yuan
    Chan, Keith C. C.
    [J]. PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 710 - 716
  • [6] An efficient framework for unsupervised feature selection
    Zhang, Han
    Zhang, Rui
    Nie, Feiping
    Li, Xuelong
    [J]. NEUROCOMPUTING, 2019, 366 : 194 - 207
  • [7] Subspace clustering guided unsupervised feature selection
    Zhu, Pengfei
    Zhu, Wencheng
    Hu, Qinghua
    Zhang, Changqing
    Zuo, Wangmeng
    [J]. PATTERN RECOGNITION, 2017, 66 : 364 - 374
  • [8] Unsupervised Feature Selection with Joint Clustering Analysis
    An, Shuai
    Wang, Jun
    Wei, Jinmao
    Yang, Zhenglu
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1639 - 1648
  • [9] A unifying criterion for unsupervised clustering and feature selection
    Breaban, Mihaela
    Luchian, Henri
    [J]. PATTERN RECOGNITION, 2011, 44 (04) : 854 - 865
  • [10] On feature selection through clustering
    Butterworth, R
    Piatetsky-Shapiro, G
    Simovici, DA
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 581 - 584