Model-Free Conditional Feature Screening with FDR Control

被引:8
|
作者
Tong, Zhaoxue [1 ]
Cai, Zhanrui [2 ]
Yang, Songshan [3 ]
Li, Runze [1 ]
机构
[1] Penn State Univ, University Pk, PA USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Renmin Univ China, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
False discovery rate control; Ranking consistency; Sure screening; Ultra-high dimensional data analysis; FEATURE-SELECTION; FILTER; RATES;
D O I
10.1080/01621459.2022.2063130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we propose a model-free conditional feature screening method with false discovery rate (FDR) control for ultra-high dimensional data. The proposed method is built upon a new measure of conditional independence. Thus, the new method does not require a specific functional form of the regression function and is robust to heavy-tailed responses and predictors. The variables to be conditional on are allowed to be multivariate. The proposed method enjoys sure screening and ranking consistency properties under mild regularity conditions. To control the FDR, we apply the Reflection via Data Splitting method and prove its theoretical guarantee using martingale theory and empirical process techniques. Simulated examples and real data analysis show that the proposed method performs very well compared with existing works. Supplementary materials for this article are available online.
引用
收藏
页码:2575 / 2587
页数:13
相关论文
共 50 条
  • [31] Model-free control
    Fliess, Michel
    Join, Cedric
    INTERNATIONAL JOURNAL OF CONTROL, 2013, 86 (12) : 2228 - 2252
  • [32] On model-free conditional coordinate tests for regressions
    Yu, Zhou
    Zhu, Lixing
    Wen, Xuerong Meggie
    JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 109 : 61 - 72
  • [33] Model-free feature screening based on Hellinger distance for ultrahigh dimensional data
    Jiujing Wu
    Hengjian Cui
    Statistical Papers, 2024, 65 (9) : 5903 - 5930
  • [34] A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data
    Xue, Jingnan
    Liang, Faming
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (04) : 803 - 813
  • [35] Scalable Model-Free Feature Screening via Sliced-Wasserstein Dependency
    Li, Tao
    Yu, Jun
    Meng, Cheng
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (04) : 1501 - 1511
  • [36] A Model-Free Feature Selection Technique of Feature Screening and Random Forest-Based Recursive Feature Elimination
    Xia, Siwei
    Yang, Yuehan
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [37] Model-free data screening and cleaning
    Tarter, Michael E.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (02): : 168 - 176
  • [38] Model-free variable selection for conditional mean in regression
    Dong, Yuexiao
    Yu, Zhou
    Zhu, Liping
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 152 (152)
  • [39] Model-free feature screening via distance correlation for ultrahigh dimensional survival data
    Jing Zhang
    Yanyan Liu
    Hengjian Cui
    Statistical Papers, 2021, 62 : 2711 - 2738
  • [40] Model-free feature screening via distance correlation for ultrahigh dimensional survival data
    Zhang, Jing
    Liu, Yanyan
    Cui, Hengjian
    STATISTICAL PAPERS, 2021, 62 (06) : 2711 - 2738