A selective overview of feature screening for ultrahigh-dimensional data

被引：0

作者：

JingYuan Liu

Wei Zhong

RunZe Li

机构：

[1] Xiamen University,Department of Statistics, School of Economics

[2] Xiamen University,Wang Yanan Institute for Studies in Economics

[3] Xiamen University,Fujian Key Laboratory of Statistical Science

[4] Pennsylvania State University,Department of Statistics and The Methodology Center

来源：

Science China Mathematics | 2015年 / 58卷

关键词：

correlation learning; distance correlation; sure independence screening; sure joint screening; sure screening property; ultrahigh-dimensional data; 62H12; 62H20;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

High-dimensional data have frequently been collected in many scientific areas including genomewide association study, biomedical imaging, tomography, tumor classifications, and finance. Analysis of highdimensional data poses many challenges for statisticians. Feature selection and variable selection are fundamental for high-dimensional data analysis. The sparsity principle, which assumes that only a small number of predictors contribute to the response, is frequently adopted and deemed useful in the analysis of high-dimensional data. Following this general principle, a large number of variable selection approaches via penalized least squares or likelihood have been developed in the recent literature to estimate a sparse model and select significant variables simultaneously. While the penalized variable selection methods have been successfully applied in many highdimensional analyses, modern applications in areas such as genomics and proteomics push the dimensionality of data to an even larger scale, where the dimension of data may grow exponentially with the sample size. This has been called ultrahigh-dimensional data in the literature. This work aims to present a selective overview of feature screening procedures for ultrahigh-dimensional data. We focus on insights into how to construct marginal utilities for feature screening on specific models and motivation for the need of model-free feature screening procedures.

引用

页码：1 / 22

页数：21

共 50 条

[41] Covariate Information Number for Feature Screening in Ultrahigh-Dimensional Supervised Problems
Nandy, Debmalya
Chiaromonte, Francesca
Li, Runze
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1516 - 1529
[42] Unified mean-variance feature screening for ultrahigh-dimensional regression
Wang, Liming
Li, Xingxiang
Wang, Xiaoqing
Lai, Peng
COMPUTATIONAL STATISTICS, 2022, 37 (04) : 1887 - 1918
[43] Feature Screening for Ultrahigh-dimensional Censored Data with Varying Coefficient Single-index Model
Yi LIU
Acta Mathematicae Applicatae Sinica, 2019, 35 (04) : 845 - 861
[44] Feature screening based on distance correlation for ultrahigh-dimensional censored data with covariate measurement error
Chen, Li-Pang
COMPUTATIONAL STATISTICS, 2021, 36 (02) : 857 - 884
[45] Fast robust feature screening for ultrahigh-dimensional varying coefficient models
Ma, Xuejun
Chen, Xin
Zhang, Jingxiao
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2017, 87 (04) : 724 - 732
[46] Feature screening based on distance correlation for ultrahigh-dimensional censored data with covariate measurement error
Li-Pang Chen
Computational Statistics, 2021, 36 : 857 - 884
[47] Feature screening and FDR control with knockoff features for ultrahigh-dimensional right-censored data
Pan, Yingli
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 173
[48] Spearman Rank Correlation Screening for Ultrahigh-Dimensional Censored Data
Wang, Hongni
Yan, Jingxin
Yan, Xiaodong
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10104 - 10112
[49] Robust conditional nonparametric independence screening for ultrahigh-dimensional data
Zhang, Shucong
Pan, Jing
Zhou, Yong
STATISTICS & PROBABILITY LETTERS, 2018, 143 : 95 - 101
[50] A new nonparametric screening method for ultrahigh-dimensional survival data
Liu, Yanyan
Zhang, Jing
Zhao, Xingqiu
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 119 : 74 - 85

← 1 2 3 4 5 →