Efficient greedy feature selection for unsupervised learning

被引:0
|
作者
Ahmed K. Farahat
Ali Ghodsi
Mohamed S. Kamel
机构
[1] University of Waterloo,Department of Electrical and Computer Engineering
[2] University of Waterloo,Department of Statistics and Actuarial Science
来源
关键词
Feature selection; Greedy algorithms; Unsupervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Reducing the dimensionality of the data has been a challenging task in data mining and machine learning applications. In these applications, the existence of irrelevant and redundant features negatively affects the efficiency and effectiveness of different learning algorithms. Feature selection is one of the dimension reduction techniques, which has been used to allow a better understanding of data and improve the performance of other learning tasks. Although the selection of relevant features has been extensively studied in supervised learning, feature selection in the absence of class labels is still a challenging task. This paper proposes a novel method for unsupervised feature selection, which efficiently selects features in a greedy manner. The paper first defines an effective criterion for unsupervised feature selection that measures the reconstruction error of the data matrix based on the selected subset of features. The paper then presents a novel algorithm for greedily minimizing the reconstruction error based on the features selected so far. The greedy algorithm is based on an efficient recursive formula for calculating the reconstruction error. Experiments on real data sets demonstrate the effectiveness of the proposed algorithm in comparison with the state-of-the-art methods for unsupervised feature selection.
引用
收藏
页码:285 / 310
页数:25
相关论文
共 50 条
  • [21] Feature selection method based on unsupervised learning
    Zhang Li
    Sun Gang
    Guo Jun
    [J]. PROCEEDINGS OF 2004 CHINESE CONTROL AND DECISION CONFERENCE, 2004, : 218 - 220
  • [22] Robust Spectral Learning for Unsupervised Feature Selection
    Shi, Lei
    Du, Liang
    Shen, Yi-Dong
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 977 - 982
  • [23] Greedy-Based Feature Selection for Efficient LiDAR SLAM
    Jiao, Jianhao
    Zhu, Yilong
    Ye, Haoyang
    Huang, Huaiyang
    Yun, Peng
    Jiang, Linxin
    Wang, Lujia
    Liu, Ming
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5222 - 5228
  • [24] UFSSF - An Efficient Unsupervised Feature Selection for Streaming Features
    Almusallam, Naif
    Tari, Zahir
    Chan, Jeffrey
    AlHarthi, Adil
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT II, 2018, 10938 : 493 - 505
  • [25] Efficient information-theoretic unsupervised feature selection
    Lee, J.
    Seo, W.
    Kim, D. -W.
    [J]. ELECTRONICS LETTERS, 2018, 54 (02) : 76 - 77
  • [26] Unsupervised Feature Selection via Collaborative Embedding Learning
    Li, Junyu
    Qi, Fei
    Sun, Xin
    Zhang, Bin
    Xu, Xiangmin
    Cai, Hongmin
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (03): : 2529 - 2540
  • [27] Local and Global Discriminative Learning for Unsupervised Feature Selection
    Du, Liang
    Shen, Zhiyong
    Li, Xuan
    Zhou, Peng
    Shen, Yi-Dong
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 131 - 140
  • [28] Joint dictionary and graph learning for unsupervised feature selection
    Deqiong Ding
    Fei Xia
    Xiaogao Yang
    Chang Tang
    [J]. Applied Intelligence, 2020, 50 : 1379 - 1397
  • [29] UNIFIED APPROACH TO FEATURE SELECTION AND LEARNING IN UNSUPERVISED ENVIRONMENTS
    LAKSHMINARASIMHAN, AL
    DASARATHY, BV
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1975, 24 (09) : 948 - 952
  • [30] Unsupervised feature selection with adaptive multiple graph learning
    Zhou, Peng
    Du, Liang
    Li, Xuejun
    Shen, Yi-Dong
    Qian, Yuhua
    [J]. PATTERN RECOGNITION, 2020, 105