Simultaneous variable selection and smoothing for high-dimensional function-on-scalar regression

被引:12
|
作者
Parodi, Alice [1 ]
Reimherr, Matthew [2 ]
机构
[1] Politecn Milan, MOX Dept Math, Milan, Italy
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2018年 / 12卷 / 02期
关键词
Nonlinear regression; variable selection; functional data analysis; reproducing kernel Hilbert space; minimax convergence; VARYING-COEFFICIENT MODELS; ADAPTIVE LASSO; CHILDHOOD;
D O I
10.1214/18-EJS1509
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We present a new methodology, called FLAME, which simultaneously selects important predictors and produces smooth estimates in a function-on-scalar linear model with a large number of scalar predictors. Our framework applies quite generally by viewing the functional outcomes as elements of an arbitrary real separable Hilbert space. To select important predictors while also producing smooth parameter estimates, we utilize operators to define subspaces that are imbued with certain desirable properties as determined by the practitioner and the setting, such as smoothness or periodicity. In special cases one can show that these subspaces correspond to Reproducing Kernel Hilbert Spaces, however our methodology applies more broadly. We provide a very fast algorithm for computing the estimators, which is based on a functional coordinate descent, and an B. package, flm, whose backend is written in C++. Asymptotic properties of the estimators are developed and simulations are provided to illustrate the advantages of FLAME over existing methods, both in terms of statistical performance and computational efficiency. We conclude with an application to childhood asthma, where we find a potentially important genetic mutation that was not selected by previous functional data based methods.
引用
下载
收藏
页码:4602 / 4639
页数:38
相关论文
共 50 条
  • [21] High-dimensional local polynomial regression with variable selection and dimension reduction
    Cheung, Kin Yap
    Lee, Stephen M. S.
    STATISTICS AND COMPUTING, 2024, 34 (01)
  • [22] Variable selection in high-dimensional sparse multiresponse linear regression models
    Luo, Shan
    STATISTICAL PAPERS, 2020, 61 (03) : 1245 - 1267
  • [23] High-dimensional macroeconomic forecasting and variable selection via penalized regression
    Uematsu, Yoshimasa
    Tanaka, Shinya
    ECONOMETRICS JOURNAL, 2019, 22 (01): : 34 - +
  • [24] High-dimensional local polynomial regression with variable selection and dimension reduction
    Kin Yap Cheung
    Stephen M. S. Lee
    Statistics and Computing, 2024, 34
  • [25] Robust Variable Selection with Optimality Guarantees for High-Dimensional Logistic Regression
    Insolia, Luca
    Kenney, Ana
    Calovi, Martina
    Chiaromonte, Francesca
    STATS, 2021, 4 (03): : 665 - 681
  • [26] Variable selection in high-dimensional sparse multiresponse linear regression models
    Shan Luo
    Statistical Papers, 2020, 61 : 1245 - 1267
  • [27] An Improved Forward Regression Variable Selection Algorithm for High-Dimensional Linear Regression Models
    Xie, Yanxi
    Li, Yuewen
    Xia, Zhijie
    Yan, Ruixia
    IEEE ACCESS, 2020, 8 (08): : 129032 - 129042
  • [28] Fast Function-on-Scalar Regression with Penalized Basis Expansions
    Reiss, Philip T.
    Huang, Lei
    Mennes, Maarten
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2010, 6 (01):
  • [29] Nonlinear function-on-scalar regression via functional universal approximation
    Luo, Ruiyan
    Qi, Xin
    BIOMETRICS, 2023, 79 (04) : 3319 - 3331
  • [30] High-Dimensional Variable Selection for Quantile Regression Based on Variational Bayesian Method
    Dai, Dengluan
    Tang, Anmin
    Ye, Jinli
    MATHEMATICS, 2023, 11 (10)