Derandomised knockoffs: leveraging e-values for false discovery rate control

被引:11
|
作者
Ren, Zhimei [1 ]
Barber, Rina Foygel [2 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
[2] Univ Chicago, Dept Stat, Chicago, IL USA
基金
美国国家科学基金会;
关键词
false discovery rate; knockoffs; multiple hypothesis testing; stability; variable selection;
D O I
10.1093/jrsssb/qkad085
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Model-X knockoffs is a flexible wrapper method for high-dimensional regression algorithms, which provides guaranteed control of the false discovery rate (FDR). Due to the randomness inherent to the method, different runs of model-X knockoffs on the same dataset often result in different sets of selected variables, which is undesirable in practice. In this article, we introduce a methodology for derandomising model-X knockoffs with provable FDR control. The key insight of our proposed method lies in the discovery that the knockoffs procedure is in essence an e-BH procedure. We make use of this connection and derandomise model-X knockoffs by aggregating the e-values resulting from multiple knockoff realisations. We prove that the derandomised procedure controls the FDR at the desired level, without any additional conditions (in contrast, previously proposed methods for derandomisation are not able to guarantee FDR control). The proposed method is evaluated with numerical experiments, where we find that the derandomised procedure achieves comparable power and dramatically decreased selection variability when compared with model-X knockoffs.
引用
收藏
页码:122 / 154
页数:33
相关论文
共 50 条
  • [31] Optimal false discovery rate control for dependent data
    Xie, Jichun
    Cai, T. Tony
    Maris, John
    Li, Hongzhe
    STATISTICS AND ITS INTERFACE, 2011, 4 (04) : 417 - 430
  • [32] PAPRIKA: Private Online False Discovery Rate Control
    Zhang, Wanrong
    Kamath, Gautam
    Cummings, Rachel
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [33] Sequential selection procedures and false discovery rate control
    G'Sell, Max Grazier
    Wager, Stefan
    Chouldechova, Alexandra
    Tibshirani, Robert
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2016, 78 (02) : 423 - 444
  • [34] False discovery rate control via debiased lasso
    Javanmard, Adel
    Javadi, Hamid
    ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (01): : 1212 - 1253
  • [35] Peak p-values and false discovery rate inference in neuroimaging
    Schwartzman, Armin
    Telschow, Fabian
    NEUROIMAGE, 2019, 197 : 402 - 413
  • [36] False Discovery Rate Control via Data Splitting
    Dai, Chenguang
    Lin, Buyu
    Xing, Xin
    Liu, Jun S.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2503 - 2520
  • [37] On control of the false discovery rate under no assumption of dependency
    Guo, Wenge
    Rao, M. Bhaskara
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2008, 138 (10) : 3176 - 3188
  • [38] Online control of the false discovery rate with decaying memory
    Ramdas, Aaditya
    Yang, Fanny
    Wainwright, Martin J.
    Jordan, Michael, I
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [39] Conformal link prediction for false discovery rate control
    Marandon, Ariane
    TEST, 2024, 33 (04) : 1062 - 1083
  • [40] False discovery rate control under Archimedean copula
    Bodnar, Taras
    Dickhaus, Thorsten
    ELECTRONIC JOURNAL OF STATISTICS, 2014, 8 : 2207 - 2241