Unsupervised feature selection with ensemble learning

被引:52
|
作者
Elghazel, Haytham [1 ,2 ]
Aussem, Alex [1 ,2 ]
机构
[1] Univ Lyon, F-69622 Lyon, France
[2] Univ Lyon 1, LIRIS, UMR 5205, F-69622 Villeurbanne, France
关键词
Unsupervised learning; Feature selection; Ensemble methods; Random forest; CLUSTERING ENSEMBLES; FEATURE RANKING; CLASSIFICATION; DISCOVERY; CONSENSUS; GENES;
D O I
10.1007/s10994-013-5337-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we show that the way internal estimates are used to measure variable importance in Random Forests are also applicable to feature selection in unsupervised learning. We propose a new method called Random Cluster Ensemble (RCE for short), that estimates the out-of-bag feature importance from an ensemble of partitions. Each partition is constructed using a different bootstrap sample and a random subset of the features. We provide empirical results on nineteen benchmark data sets indicating that RCE, boosted with a recursive feature elimination scheme (RFE) (Guyon and Elisseeff, Journal of Machine Learning Research, 3:1157-1182, 2003), can lead to significant improvement in terms of clustering accuracy, over several state-of-the-art supervised and unsupervised algorithms, with a very limited subset of features. The method shows promise to deal with very large domains. All results, datasets and algorithms are available on line (http://perso.univ-lyon1.fr/haytham.elghazel/RCE.zip)
引用
收藏
页码:157 / 180
页数:24
相关论文
共 50 条
  • [1] Unsupervised feature selection with ensemble learning
    Haytham Elghazel
    Alex Aussem
    [J]. Machine Learning, 2015, 98 : 157 - 180
  • [2] Ensemble Method for Unsupervised Feature Selection
    Luo, Yihui
    Xiong, Shuchu
    [J]. ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL IV, PROCEEDINGS, 2009, : 513 - 516
  • [3] Unsupervised feature selection for ensemble of classifiers
    Morita, M
    Oliveira, LS
    Sabourin, R
    [J]. NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 81 - 86
  • [4] Feature selection for unsupervised learning
    Dy, JG
    Brodley, CE
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 845 - 889
  • [5] Feature Selection for Unsupervised Learning
    Adhikary, Jyoti Ranjan
    Murty, M. Narasimha
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 382 - 389
  • [6] Feature selection for unsupervised learning through local learning
    Yao, Jin
    Mao, Qi
    Goodison, Steve
    Mai, Volker
    Sun, Yijun
    [J]. PATTERN RECOGNITION LETTERS, 2015, 53 : 100 - 107
  • [7] Bi-level ensemble method for unsupervised feature selection
    Zhou, Peng
    Wang, Xia
    Du, Liang
    [J]. INFORMATION FUSION, 2023, 100
  • [8] UNSUPERVISED FEATURE SELECTION WITH LOCAL STRUCTURE LEARNING
    Yang, Sheng
    Nie, Feiping
    Li, Xuelong
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 3398 - 3402
  • [9] Joint Dictionary Learning for Unsupervised Feature Selection
    Fan, Yang
    Dai, Jianhua
    Zhang, Qilai
    Liu, Shuai
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 46 - 58
  • [10] UNSUPERVISED FEATURE SELECTION BY JOINT GRAPH LEARNING
    Zhang, Zhihong
    Xiahou, Jianbing
    Liang, Yuanheng
    Chen, Yuhan
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 554 - 558