Iterative column subset selection

被引:7
|
作者
Ordozgoiti, Bruno [1 ]
Gomez Canaval, Sandra [1 ]
Mozo, Alberto [1 ]
机构
[1] Univ Politecn Madrid, Dept Comp Syst, Madrid, Spain
基金
欧盟地平线“2020”;
关键词
Column subset selection; Unsupervised feature selection; Dimensionality reduction; Machine learning; Data mining; UNSUPERVISED FEATURE-SELECTION; FACE RECOGNITION; RANK; DECOMPOSITION; RELEVANCE;
D O I
10.1007/s10115-017-1115-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimensionality reduction is often a crucial step for the successful application of machine learning and data mining methods. One way to achieve said reduction is feature selection. Due to the impossibility of labelling many data sets, unsupervised approaches are frequently the only option. The column subset selection problem translates naturally to this purpose and has received considerable attention over the last few years, as it provides simple linear models for low-rank data reconstruction. Recently, it was empirically shown that an iterative algorithm, which can be implemented efficiently, provides better subsets than other state-of-the-art methods. In this paper, we describe this algorithm and provide a more in-depth analysis. We carry out numerous experiments to gain insights on its behaviour and derive a simple bound for the norm recovered by the resulting matrix. To the best of our knowledge, this is the first theoretical result of this kind for this algorithm.
引用
收藏
页码:65 / 94
页数:30
相关论文
共 50 条
  • [21] Equity Factor Analysis via Column Subset Selection
    Boutsidis, Christos
    Malioutov, Dmitry
    2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 1131 - 1131
  • [22] Interlacing Polynomial Method for the Column Subset Selection Problem
    Cai, Jian-Feng
    Xu, Zhiqiang
    Xu, Zili
    INTERNATIONAL MATHEMATICS RESEARCH NOTICES, 2024, 2024 (09) : 7798 - 7819
  • [23] An Improved Approximation Algorithm for the Column Subset Selection Problem
    Boutsidis, Christos
    Mahoney, Michael W.
    Drineas, Petros
    PROCEEDINGS OF THE TWENTIETH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2009, : 968 - +
  • [24] Column Subset Selection Problem is UG-hard
    Civril, A.
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (04) : 849 - 859
  • [25] An Empirical Comparison of Sampling Techniques for Matrix Column Subset Selection
    Wang, Yining
    Singh, Aarti
    2015 53RD ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2015, : 1069 - 1074
  • [26] Greedy Column Subset Selection: New Bounds and Distributed Algorithms
    Altschuler, Jason
    Bhaskara, Aditya
    Fu, Gang
    Mirrokni, Vahab
    Rostamizadeh, Afshin
    Zadimoghaddam, Morteza
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [27] Optimal column subset selection for image classification by genetic algorithms
    Pavel Krömer
    Jan Platoš
    Jana Nowaková
    Václav Snášel
    Annals of Operations Research, 2018, 265 : 205 - 222
  • [28] Towards a Zero-One Law for Column Subset Selection
    Song, Zhao
    Woodruff, David P.
    Zhong, Peilin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [29] Optimal column subset selection for image classification by genetic algorithms
    Kroemer, Pavel
    Platos, Jan
    Nowakova, Jana
    Snasel, Vaclav
    ANNALS OF OPERATIONS RESEARCH, 2018, 265 (02) : 205 - 222
  • [30] Column Subset Selection with Missing Data via Active Sampling
    Wang, Yining
    Singh, Aarti
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 1033 - 1041