A new unsupervised feature selection algorithm using similarity-based feature clustering

被引:32
|
作者
Zhu, Xiaoyan [1 ]
Wang, Yu [1 ]
Li, Yingbin [1 ]
Tan, Yonghui [1 ]
Wang, Guangtao [2 ]
Song, Qinbao [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian, Shaanxi, Peoples R China
[2] JD AI Res, Mountain View, CA USA
基金
中国国家自然科学基金;
关键词
clustering; feature selection; feature similarity; CLASSIFICATION;
D O I
10.1111/coin.12192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised feature selection is an important problem, especially for high-dimensional data. However, until now, it has been scarcely studied and the existing algorithms cannot provide satisfying performance. Thus, in this paper, we propose a new unsupervised feature selection algorithm using similarity-based feature clustering, Feature Selection-based Feature Clustering (FSFC). FSFC removes redundant features according to the results of feature clustering based on feature similarity. First, it clusters the features according to their similarity. A new feature clustering algorithm is proposed, which overcomes the shortcomings of K-means. Second, it selects a representative feature from each cluster, which contains most interesting information of features in the cluster. The efficiency and effectiveness of FSFC are tested upon real-world data sets and compared with two representative unsupervised feature selection algorithms, Feature Selection Using Similarity (FSUS) and Multi-Cluster-based Feature Selection (MCFS) in terms of runtime, feature compression ratio, and the clustering results of K-means. The results show that FSFC can not only reduce the feature space in less time, but also significantly improve the clustering performance of K-means.
引用
下载
收藏
页码:2 / 22
页数:21
相关论文
共 50 条
  • [31] Improving Classification of Protein Interaction Articles Using Context Similarity-Based Feature Selection
    Chen, Yifei
    Sun, Yuxing
    Han, Bing-Qing
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [32] Unsupervised Feature Selection Using Binary Bat Algorithm
    Rani, A. Sylvia Selva
    Rajalaxmi, R. R.
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 451 - 456
  • [33] Feature Selection Using Differential Evolution for Unsupervised Image Clustering
    Gutoski, Matheus
    Ribeiro, Manasses
    Romero Aquino, Nelson Marcelo
    Hattori, Leandro Takeshi
    Lazzaretti, Andre Eugenio
    Lopes, Heitor Silverio
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2018, PT I, 2018, 10841 : 376 - 385
  • [34] Unsupervised Feature Selection Algorithm Based on Sparse Representation
    Cui, Guoqing
    Yang, Jie
    Zareapoor, Masoumeh
    Wang, Jiechen
    2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 1028 - 1033
  • [35] UNSUPERVISED FEATURE SELECTION BASED ON FEATURE RELEVANCE
    Zhang, Feng
    Zhao, Ya-Jun
    Chen, Jun-Fen
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 487 - +
  • [36] Unsupervised feature selection based extreme learning machine for clustering
    Jichao Chen
    Yijie Zeng
    Yue Li
    Guang-Bin Huang
    NEUROCOMPUTING, 2020, 386 : 198 - 207
  • [37] Similarity-Based Feature Selection for Learning from Examples with Continuous Values
    Li, Yun
    Hu, Su-Jun
    Yang, Wen-Jie
    Sun, Guo-Zi
    Yao, Fang-Wu
    Yang, Geng
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 957 - 964
  • [38] Feature reduction for imbalanced data classification using similarity-based feature clustering with adaptive weighted K-nearest neighbors
    Sun, Lin
    Zhang, Jiuxiao
    Ding, Weiping
    Xu, Jiucheng
    INFORMATION SCIENCES, 2022, 593 : 591 - 613
  • [39] Similarity-based online feature selection in content-based image retrieval
    Jiang, W
    Er, G
    Dai, QH
    Gu, JW
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (03) : 702 - 712
  • [40] Subspace clustering guided unsupervised feature selection
    Zhu, Pengfei
    Zhu, Wencheng
    Hu, Qinghua
    Zhang, Changqing
    Zuo, Wangmeng
    PATTERN RECOGNITION, 2017, 66 : 364 - 374