Iterative column subset selection

被引:7
|
作者
Ordozgoiti, Bruno [1 ]
Gomez Canaval, Sandra [1 ]
Mozo, Alberto [1 ]
机构
[1] Univ Politecn Madrid, Dept Comp Syst, Madrid, Spain
基金
欧盟地平线“2020”;
关键词
Column subset selection; Unsupervised feature selection; Dimensionality reduction; Machine learning; Data mining; UNSUPERVISED FEATURE-SELECTION; FACE RECOGNITION; RANK; DECOMPOSITION; RELEVANCE;
D O I
10.1007/s10115-017-1115-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction is often a crucial step for the successful application of machine learning and data mining methods. One way to achieve said reduction is feature selection. Due to the impossibility of labelling many data sets, unsupervised approaches are frequently the only option. The column subset selection problem translates naturally to this purpose and has received considerable attention over the last few years, as it provides simple linear models for low-rank data reconstruction. Recently, it was empirically shown that an iterative algorithm, which can be implemented efficiently, provides better subsets than other state-of-the-art methods. In this paper, we describe this algorithm and provide a more in-depth analysis. We carry out numerous experiments to gain insights on its behaviour and derive a simple bound for the norm recovered by the resulting matrix. To the best of our knowledge, this is the first theoretical result of this kind for this algorithm.
引用
收藏
页码:65 / 94
页数:30
相关论文
共 50 条
  • [31] Unsupervised Band Selection Method Based on Importance-Assisted Column Subset Selection
    Luo, Xiaoyan
    Shen, Zhiqi
    Xue, Rui
    Wan, Han
    IEEE ACCESS, 2019, 7 : 517 - 527
  • [32] Addressing Feature Drift in Data Streams Using Iterative Subset Selection
    Yuan, Lanqin
    Pfahringer, Bernhard
    Barddal, Jean Paul
    APPLIED COMPUTING REVIEW, 2019, 19 (01): : 20 - 33
  • [33] Gene subset selection using an iterative approach based on genetic algorithms
    Mohamad M.S.
    Omatu S.
    Deris S.
    Yoshioka M.
    Artif. Life Rob., 2009, 1 (12-15): : 12 - 15
  • [34] An Explicit Sampling Dependent Spectral Error Bound for Column Subset Selection
    Yang, Tianbao
    Zhang, Lijun
    Jin, Rong
    Zhu, Shenghuo
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 135 - 143
  • [35] Greedy column subset selection for large-scale data sets
    Ahmed K. Farahat
    Ahmed Elgohary
    Ali Ghodsi
    Mohamed S. Kamel
    Knowledge and Information Systems, 2015, 45 : 1 - 34
  • [36] Greedy column subset selection for large-scale data sets
    Farahat, Ahmed K.
    Elgohary, Ahmed
    Ghodsi, Ali
    Kamel, Mohamed S.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (01) : 1 - 34
  • [37] A Comparison of Differential Evolution and Genetic Algorithms for the Column Subset Selection Problem
    Kromer, Pavel
    Platos, Jan
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 223 - 232
  • [38] On a new method for controlling the entire spectrum in the problem of column subset selection
    Chretien, Stephan
    Darses, Sebastien
    EXPOSITIONES MATHEMATICAE, 2019, 37 (03) : 314 - 321
  • [39] Evaluating column subset selection methods for endmember extraction in hyperspectral unmixing
    Aldeghlawi, Maher
    Alkhatib, Mohammed Q.
    Velez-Reyes, Miguel
    ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGERY XXVI, 2020, 11392
  • [40] Using a column subset selection method for endmember extraction in hyperspectral unmixing
    Velez-Reyes, Miguel
    Aldeghlawi, Maher
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XXIV, 2018, 10644