Large-Scale Nonlinear Variable Selection via Kernel Random Features

被引:0
|
作者
Gregorova, Magda [1 ,2 ]
Ramapuram, Jason [1 ,2 ]
Kalousis, Alexandros [1 ,2 ]
Marchand-Maillet, Stephane [2 ]
机构
[1] HES SO, Geneva Sch Business Adm, Geneva, Switzerland
[2] Univ Geneva, Geneva, Switzerland
关键词
SPARSITY;
D O I
10.1007/978-3-030-10928-8_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new method for input variable selection in nonlinear regression. The method is embedded into a kernel regression machine that can model general nonlinear functions, not being a priori limited to additive models. This is the first kernel-based variable selection method applicable to large datasets. It sidesteps the typical poor scaling properties of kernel methods by mapping the inputs into a relatively low-dimensional space of random features. The algorithm discovers the variables relevant for the regression task together with learning the prediction model through learning the appropriate nonlinear random feature maps. We demonstrate the outstanding performance of our method on a set of large-scale synthetic and real datasets. Code related to this paper is available at: https://bitbucket.org/dmmlgeneva/srff_pytorch.
引用
收藏
页码:177 / 192
页数:16
相关论文
共 50 条
  • [1] LARGE-SCALE RANDOM FEATURES FOR KERNEL REGRESSION
    Laparra, Valero
    Gonzalez, Diego Marcos
    Tuia, Devis
    Camps-Valls, Gustau
    [J]. 2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 17 - 20
  • [2] Online Adaptive Kernel Learning with Random Features for Large-scale Nonlinear Classification
    Chen, Yingying
    Yang, Xiaowei
    [J]. PATTERN RECOGNITION, 2022, 131
  • [3] Data-dependent compression of random features for large-scale kernel approximation
    Agrawal, Raj
    Campbell, Trevor
    Huggins, Jonathan
    Broderick, Tamara
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [4] KERNEL COMPUTATIONS FROM LARGE-SCALE RANDOM FEATURES OBTAINED BY OPTICAL PROCESSING UNITS
    Ohana, Ruben
    Wacker, Jonas
    Dong, Jonathan
    Marmin, Sebastien
    Krzakala, Florent
    Filippone, Maurizio
    Daudet, Laurent
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 9294 - 9298
  • [5] Large-scale Online Kernel Learning with Random Feature Reparameterization
    Tu Dinh Nguyen
    Le, Trung
    Bui, Hung
    Phung, Dinh
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2543 - 2549
  • [6] Variable selection in latent variable models via knockoffs: an application to international large-scale assessment in education
    Xie, Zilong
    Chen, Yunxiao
    von Davier, Matthias
    Weng, Haolei
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2023, 187 (03) : 723 - 747
  • [7] Use of Wrapper Algorithms Coupled with a Random Forests Classifier for Variable Selection in Large-Scale Genomic Association Studies
    Rodin, Andrei S.
    Litvinenko, Anatoliy
    Klos, Kathy
    Morrison, Alanna C.
    Woodage, Trevor
    Coresh, Josef
    Boerwinkle, Eric
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2009, 16 (12) : 1705 - 1718
  • [8] MCMC methods for bayesian variable selection in large-scale genomic applications
    Zucknick, Manuela
    Holmes, Chris
    Richardson, Sylvia
    [J]. ANNALS OF HUMAN GENETICS, 2007, 71 : 558 - 559
  • [9] LARGE-SCALE REGULARIZED PORTFOLIO SELECTION VIA CONVEX OPTIMIZATION
    Zhao, Ziping
    Palomar, Daniel P.
    [J]. 2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [10] A Multiblock Kernel Dynamic Latent Variable Model for Large-Scale Industrial Process Monitoring
    Li, Chen
    Li, Gang
    Chen, Xu
    Zhou, Ping
    He, Xiao
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71