A selective overview of feature screening for ultrahigh-dimensional data

被引:0
|
作者
LIU JingYuan [1 ,2 ,3 ]
ZHONG Wei [2 ,1 ,3 ]
LI RunZe [4 ]
机构
[1] Department of Statistics, School of Economics, Xiamen University
[2] Wang Yanan Institute for Studies in Economics, Xiamen University
[3] Fujian Key Laboratory of Statistical Science, Xiamen University
[4] Department of Statistics and The Methodology Center, Pennsylvania State University,University Park
基金
中央高校基本科研业务费专项资金资助; 美国国家卫生研究院; 美国国家科学基金会; 中国国家自然科学基金;
关键词
correlation learning; distance correlation; sure independence screening; sure joint screening; sure screening property; ultrahigh-dim;
D O I
暂无
中图分类号
O212.4 [多元分析];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High-dimensional data have frequently been collected in many scientific areas including genomewide association study, biomedical imaging, tomography, tumor classifications, and finance. Analysis of highdimensional data poses many challenges for statisticians. Feature selection and variable selection are fundamental for high-dimensional data analysis. The sparsity principle, which assumes that only a small number of predictors contribute to the response, is frequently adopted and deemed useful in the analysis of high-dimensional data.Following this general principle, a large number of variable selection approaches via penalized least squares or likelihood have been developed in the recent literature to estimate a sparse model and select significant variables simultaneously. While the penalized variable selection methods have been successfully applied in many highdimensional analyses, modern applications in areas such as genomics and proteomics push the dimensionality of data to an even larger scale, where the dimension of data may grow exponentially with the sample size. This has been called ultrahigh-dimensional data in the literature. This work aims to present a selective overview of feature screening procedures for ultrahigh-dimensional data. We focus on insights into how to construct marginal utilities for feature screening on specific models and motivation for the need of model-free feature screening procedures.
引用
收藏
页码:2033 / 2054
页数:22
相关论文
共 50 条
  • [31] Feature selection of ultrahigh-dimensional covariates with survival outcomes: a selective review
    Hong, Hyokyoung Grace
    Li, Yi
    [J]. APPLIED MATHEMATICS-A JOURNAL OF CHINESE UNIVERSITIES SERIES B, 2017, 32 (04): : 379 - 396
  • [32] Feature selection of ultrahigh-dimensional covariates with survival outcomes:a selective review
    HONG Hyokyoung Grace
    LI Yi
    [J]. Applied Mathematics:A Journal of Chinese Universities, 2017, 32 (04) : 379 - 396
  • [33] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Liu, Yi
    [J]. ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2019, 35 (04): : 845 - 861
  • [34] Feature screening for ultrahigh-dimensional binary classification via linear projection
    Lai, Peng
    Wang, Mingyue
    Song, Fengli
    Zhou, Yanqiu
    [J]. AIMS MATHEMATICS, 2023, 8 (06): : 14270 - 14287
  • [35] Feature screening in ultrahigh-dimensional varying-coefficient Cox model
    Yang, Guangren
    Zhang, Ling
    Li, Runze
    Huang, Yuan
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 284 - 297
  • [36] Feature selection of ultrahigh-dimensional covariates with survival outcomes: a selective review
    Hyokyoung Grace Hong
    Yi Li
    [J]. Applied Mathematics-A Journal of Chinese Universities, 2017, 32 : 379 - 396
  • [37] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
    Yi Liu
    [J]. Acta Mathematicae Applicatae Sinica, English Series, 2019, 35 : 845 - 861
  • [38] Unified mean-variance feature screening for ultrahigh-dimensional regression
    Liming Wang
    Xingxiang Li
    Xiaoqing Wang
    Peng Lai
    [J]. Computational Statistics, 2022, 37 : 1887 - 1918
  • [39] Covariate Information Number for Feature Screening in Ultrahigh-Dimensional Supervised Problems
    Nandy, Debmalya
    Chiaromonte, Francesca
    Li, Runze
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1516 - 1529
  • [40] Unified mean-variance feature screening for ultrahigh-dimensional regression
    Wang, Liming
    Li, Xingxiang
    Wang, Xiaoqing
    Lai, Peng
    [J]. COMPUTATIONAL STATISTICS, 2022, 37 (04) : 1887 - 1918