Simultaneous variable selection and smoothing for high-dimensional function-on-scalar regression

被引:12
|
作者
Parodi, Alice [1 ]
Reimherr, Matthew [2 ]
机构
[1] Politecn Milan, MOX Dept Math, Milan, Italy
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2018年 / 12卷 / 02期
关键词
Nonlinear regression; variable selection; functional data analysis; reproducing kernel Hilbert space; minimax convergence; VARYING-COEFFICIENT MODELS; ADAPTIVE LASSO; CHILDHOOD;
D O I
10.1214/18-EJS1509
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We present a new methodology, called FLAME, which simultaneously selects important predictors and produces smooth estimates in a function-on-scalar linear model with a large number of scalar predictors. Our framework applies quite generally by viewing the functional outcomes as elements of an arbitrary real separable Hilbert space. To select important predictors while also producing smooth parameter estimates, we utilize operators to define subspaces that are imbued with certain desirable properties as determined by the practitioner and the setting, such as smoothness or periodicity. In special cases one can show that these subspaces correspond to Reproducing Kernel Hilbert Spaces, however our methodology applies more broadly. We provide a very fast algorithm for computing the estimators, which is based on a functional coordinate descent, and an B. package, flm, whose backend is written in C++. Asymptotic properties of the estimators are developed and simulations are provided to illustrate the advantages of FLAME over existing methods, both in terms of statistical performance and computational efficiency. We conclude with an application to childhood asthma, where we find a potentially important genetic mutation that was not selected by previous functional data based methods.
引用
下载
收藏
页码:4602 / 4639
页数:38
相关论文
共 50 条
  • [1] Simultaneous variable selection, clustering, and smoothing in function-on-scalar regression
    Mehrotra, Suchit
    Maity, Arnab
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2022, 50 (01): : 180 - 199
  • [2] High-Dimensional Spatial Quantile Function-on-Scalar Regression
    Zhang, Zhengwu
    Wang, Xiao
    Kong, Linglong
    Zhu, Hongtu
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1563 - 1578
  • [3] Variable selection in function-on-scalar regression
    Chen, Yakuan
    Goldsmith, Jeff
    Ogden, R. Todd
    STAT, 2016, 5 (01): : 88 - 101
  • [4] Variable selection in nonlinear function-on-scalar regression
    Ghosal, Rahul
    Maity, Arnab
    BIOMETRICS, 2023, 79 (01) : 292 - 303
  • [5] Robust estimation and variable selection for function-on-scalar regression
    Cai, Xiong
    Xue, Liugen
    Ca, Jiguo
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2022, 50 (01): : 162 - 179
  • [6] Adaptive function-on-scalar regression with a smoothing elastic net
    Mirshani, Ardalan
    Reimherr, Matthew
    JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 185
  • [7] MCEN: a method of simultaneous variable selection and clustering for high-dimensional multinomial regression
    Sheng Ren
    Emily L. Kang
    Jason L. Lu
    Statistics and Computing, 2020, 30 : 291 - 304
  • [8] MCEN: a method of simultaneous variable selection and clustering for high-dimensional multinomial regression
    Ren, Sheng
    Kang, Emily L.
    Lu, Jason L.
    STATISTICS AND COMPUTING, 2020, 30 (02) : 291 - 304
  • [9] SPATIAL BAYESIAN VARIABLE SELECTION AND GROUPING FOR HIGH-DIMENSIONAL SCALAR-ON-IMAGE REGRESSION
    Li, Fan
    Zhang, Tingting
    Wang, Quanli
    Gonzalez, Marlen Z.
    Maresh, Erin L.
    Coan, James A.
    ANNALS OF APPLIED STATISTICS, 2015, 9 (02): : 687 - 713
  • [10] A stepwise regression algorithm for high-dimensional variable selection
    Hwang, Jing-Shiang
    Hu, Tsuey-Hwa
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (09) : 1793 - 1806