Simultaneous variable selection and smoothing for high-dimensional function-on-scalar regression

被引:12
|
作者
Parodi, Alice [1 ]
Reimherr, Matthew [2 ]
机构
[1] Politecn Milan, MOX Dept Math, Milan, Italy
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2018年 / 12卷 / 02期
关键词
Nonlinear regression; variable selection; functional data analysis; reproducing kernel Hilbert space; minimax convergence; VARYING-COEFFICIENT MODELS; ADAPTIVE LASSO; CHILDHOOD;
D O I
10.1214/18-EJS1509
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We present a new methodology, called FLAME, which simultaneously selects important predictors and produces smooth estimates in a function-on-scalar linear model with a large number of scalar predictors. Our framework applies quite generally by viewing the functional outcomes as elements of an arbitrary real separable Hilbert space. To select important predictors while also producing smooth parameter estimates, we utilize operators to define subspaces that are imbued with certain desirable properties as determined by the practitioner and the setting, such as smoothness or periodicity. In special cases one can show that these subspaces correspond to Reproducing Kernel Hilbert Spaces, however our methodology applies more broadly. We provide a very fast algorithm for computing the estimators, which is based on a functional coordinate descent, and an B. package, flm, whose backend is written in C++. Asymptotic properties of the estimators are developed and simulations are provided to illustrate the advantages of FLAME over existing methods, both in terms of statistical performance and computational efficiency. We conclude with an application to childhood asthma, where we find a potentially important genetic mutation that was not selected by previous functional data based methods.
引用
下载
收藏
页码:4602 / 4639
页数:38
相关论文
共 50 条
  • [41] High-dimensional quantile regression: Convolution smoothing and concave regularization
    Tan, Kean Ming
    Wang, Lan
    Zhou, Wen-Xin
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (01) : 205 - 233
  • [42] Variable selection in function-on-scalar single-index model via the alternating direction method of multipliers
    Rahul Ghosal
    Arnab Maity
    TEST, 2024, 33 : 106 - 126
  • [43] Variable selection and estimation in high-dimensional models
    Horowitz, Joel L.
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2015, 48 (02): : 389 - 407
  • [44] Variable selection for high-dimensional incomplete data
    Liang, Lixing
    Zhuang, Yipeng
    Yu, Philip L. H.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 192
  • [45] Variable selection in high-dimensional partially linear additive models for composite quantile regression
    Guo, Jie
    Tang, Manlai
    Tian, Maozai
    Zhu, Kai
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 65 : 56 - 67
  • [46] SCAD-penalized quantile regression for high-dimensional data analysis and variable selection
    Amin, Muhammad
    Song, Lixin
    Thorlie, Milton Abdul
    Wang, Xiaoguang
    STATISTICA NEERLANDICA, 2015, 69 (03) : 212 - 235
  • [47] High-dimensional graphs and variable selection with the Lasso
    Meinshausen, Nicolai
    Buehlmann, Peter
    ANNALS OF STATISTICS, 2006, 34 (03): : 1436 - 1462
  • [48] Sparse Bayesian variable selection in high-dimensional logistic regression models with correlated priors
    Ma, Zhuanzhuan
    Han, Zifei
    Ghosh, Souparno
    Wu, Liucang
    Wang, Min
    STATISTICAL ANALYSIS AND DATA MINING, 2024, 17 (01)
  • [49] Variable selection in the single-index quantile regression model with high-dimensional covariates
    Kuruwita, C. N.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (03) : 1120 - 1132
  • [50] Transfer learning for sparse variable selection in high-dimensional regression from quadratic measurement
    Shang, Qingxu
    Li, Jie
    Song, Yunquan
    KNOWLEDGE-BASED SYSTEMS, 2024, 300