Derandomised knockoffs: leveraging e-values for false discovery rate control

被引:11
|
作者
Ren, Zhimei [1 ]
Barber, Rina Foygel [2 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
[2] Univ Chicago, Dept Stat, Chicago, IL USA
基金
美国国家科学基金会;
关键词
false discovery rate; knockoffs; multiple hypothesis testing; stability; variable selection;
D O I
10.1093/jrsssb/qkad085
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Model-X knockoffs is a flexible wrapper method for high-dimensional regression algorithms, which provides guaranteed control of the false discovery rate (FDR). Due to the randomness inherent to the method, different runs of model-X knockoffs on the same dataset often result in different sets of selected variables, which is undesirable in practice. In this article, we introduce a methodology for derandomising model-X knockoffs with provable FDR control. The key insight of our proposed method lies in the discovery that the knockoffs procedure is in essence an e-BH procedure. We make use of this connection and derandomise model-X knockoffs by aggregating the e-values resulting from multiple knockoff realisations. We prove that the derandomised procedure controls the FDR at the desired level, without any additional conditions (in contrast, previously proposed methods for derandomisation are not able to guarantee FDR control). The proposed method is evaluated with numerical experiments, where we find that the derandomised procedure achieves comparable power and dramatically decreased selection variability when compared with model-X knockoffs.
引用
收藏
页码:122 / 154
页数:33
相关论文
共 50 条
  • [11] Split Knockoffs for Multiple Comparisons: Controlling the Directional False Discovery Rate
    Cao, Yang
    Sun, Xinwei
    Yao, Yuan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 2822 - 2832
  • [12] False discovery rate control with multivariate p-values
    Chi, Zhiyi
    ELECTRONIC JOURNAL OF STATISTICS, 2008, 2 : 368 - 411
  • [13] ONLINE RULES FOR CONTROL OF FALSE DISCOVERY RATE AND FALSE DISCOVERY EXCEEDANCE
    Javanmard, Adel
    Montanari, Andrea
    ANNALS OF STATISTICS, 2018, 46 (02): : 526 - 554
  • [14] False Discovery Rate Control With Groups
    Hu, James X.
    Zhao, Hongyu
    Zhou, Harrison H.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (491) : 1215 - 1227
  • [15] False discovery rate control for multiple testing based on discrete p-values
    Chen, Xiongzhi
    BIOMETRICAL JOURNAL, 2020, 62 (04) : 1060 - 1079
  • [16] Model-free Knockoffs for SLOPE-Adaptive Variable Selection with Controlled False Discovery Rate
    Humayoo, Mahammad
    Cheng, Xueqi
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 302 - 307
  • [17] Derandomized novelty detection with FDR control via conformal e-values
    Bashari, Meshi
    Epstein, Amir
    Romano, Yaniv
    Sesia, Matteo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [18] Distributed False Discovery Rate Control with Quantization
    Xiang, Yu
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 246 - 249
  • [19] Optimal weighting for false discovery rate control
    Roquain, Etienne
    van de Wiel, Mark A.
    ELECTRONIC JOURNAL OF STATISTICS, 2009, 3 : 678 - 711
  • [20] Contextual Online False Discovery Rate Control
    Chen, Shiyun
    Kasiviswanathan, Shiva Prasad
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 952 - 960