Iterative column subset selection

被引:7
|
作者
Ordozgoiti, Bruno [1 ]
Gomez Canaval, Sandra [1 ]
Mozo, Alberto [1 ]
机构
[1] Univ Politecn Madrid, Dept Comp Syst, Madrid, Spain
基金
欧盟地平线“2020”;
关键词
Column subset selection; Unsupervised feature selection; Dimensionality reduction; Machine learning; Data mining; UNSUPERVISED FEATURE-SELECTION; FACE RECOGNITION; RANK; DECOMPOSITION; RELEVANCE;
D O I
10.1007/s10115-017-1115-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction is often a crucial step for the successful application of machine learning and data mining methods. One way to achieve said reduction is feature selection. Due to the impossibility of labelling many data sets, unsupervised approaches are frequently the only option. The column subset selection problem translates naturally to this purpose and has received considerable attention over the last few years, as it provides simple linear models for low-rank data reconstruction. Recently, it was empirically shown that an iterative algorithm, which can be implemented efficiently, provides better subsets than other state-of-the-art methods. In this paper, we describe this algorithm and provide a more in-depth analysis. We carry out numerous experiments to gain insights on its behaviour and derive a simple bound for the norm recovered by the resulting matrix. To the best of our knowledge, this is the first theoretical result of this kind for this algorithm.
引用
收藏
页码:65 / 94
页数:30
相关论文
共 50 条
  • [1] Iterative column subset selection
    Bruno Ordozgoiti
    Sandra Gómez Canaval
    Alberto Mozo
    Knowledge and Information Systems, 2018, 54 : 65 - 94
  • [2] A Note on Column Subset Selection
    Youssef, Pierre
    INTERNATIONAL MATHEMATICS RESEARCH NOTICES, 2014, 2014 (23) : 6431 - 6447
  • [3] Regularized greedy column subset selection
    Ordozgoiti, Bruno
    Mozo, Alberto
    Garcia Lopez de Lacalle, Jesus
    INFORMATION SCIENCES, 2019, 486 : 393 - 418
  • [4] Distributed Column Subset Selection on MapReduce
    Farahat, Ahmed K.
    Elgohary, Ahmed
    Ghodsi, Ali
    Kamel, Mohamed S.
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 171 - 180
  • [5] A Comparison of Column Subset Selection Methods for Unsupervised Band Subset Selection in Hyperspectral Imagery
    Aldeghlawi, Maher
    Velez-Reyes, Miguel
    2018 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION (SSIAI), 2018, : 57 - 60
  • [6] Deterministic and iterative solutions to subset selection problems
    Nafie, M
    Tewfik, AH
    Ali, M
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (07) : 1591 - 1601
  • [7] A determinantal point process for column subset selection
    Belhadji, Ayoub
    Bardenet, Rémi
    Chainais, Pierre
    Journal of Machine Learning Research, 2020, 21
  • [8] A determinantal point process for column subset selection
    Belhadji, Ayoub
    Bardenet, Remi
    Chainais, Pierre
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [9] Genetic Algorithm for the Column Subset Selection Problem
    Kroemer, Pavel
    Platos, Jan
    Snasel, Vaclav
    2014 EIGHTH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS),, 2014, : 16 - 22
  • [10] Column subset selection is NP-complete
    Shitov, Yaroslav
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2021, 610 : 52 - 58