Derandomised knockoffs: leveraging e-values for false discovery rate control

被引:11
|
作者
Ren, Zhimei [1 ]
Barber, Rina Foygel [2 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
[2] Univ Chicago, Dept Stat, Chicago, IL USA
基金
美国国家科学基金会;
关键词
false discovery rate; knockoffs; multiple hypothesis testing; stability; variable selection;
D O I
10.1093/jrsssb/qkad085
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Model-X knockoffs is a flexible wrapper method for high-dimensional regression algorithms, which provides guaranteed control of the false discovery rate (FDR). Due to the randomness inherent to the method, different runs of model-X knockoffs on the same dataset often result in different sets of selected variables, which is undesirable in practice. In this article, we introduce a methodology for derandomising model-X knockoffs with provable FDR control. The key insight of our proposed method lies in the discovery that the knockoffs procedure is in essence an e-BH procedure. We make use of this connection and derandomise model-X knockoffs by aggregating the e-values resulting from multiple knockoff realisations. We prove that the derandomised procedure controls the FDR at the desired level, without any additional conditions (in contrast, previously proposed methods for derandomisation are not able to guarantee FDR control). The proposed method is evaluated with numerical experiments, where we find that the derandomised procedure achieves comparable power and dramatically decreased selection variability when compared with model-X knockoffs.
引用
收藏
页码:122 / 154
页数:33
相关论文
共 50 条
  • [41] Importance of presenting the variability of the false discovery rate control
    Lin, Yi-Ting
    Lee, Wen-Chung
    BMC GENETICS, 2015, 16
  • [42] A Fuzzy Permutation Method for False Discovery Rate Control
    Yang, Ya-Hui
    Lin, Wan-Yu
    Lee, Wen-Chung
    SCIENTIFIC REPORTS, 2016, 6
  • [43] ADAPTIVE FALSE DISCOVERY RATE CONTROL FOR HETEROGENEOUS DATA
    Habiger, Joshua D.
    STATISTICA SINICA, 2017, 27 (04) : 1731 - 1756
  • [44] Wavelet thresholding with Bayesian false discovery rate control
    Tadesse, MG
    Ibrahim, JG
    Vannucci, M
    Gentleman, R
    BIOMETRICS, 2005, 61 (01) : 25 - 35
  • [45] Testing Jumps via False Discovery Rate Control
    Yen, Yu-Min
    PLOS ONE, 2013, 8 (04):
  • [46] Dynamic adaptive procedures that control the false discovery rate
    MacDonald, Peter W.
    Liang, Kun
    Janssen, Arnold
    ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (02): : 3009 - 3024
  • [47] Cellwise outlier detection with false discovery rate control
    Liu, Yanhong
    Ren, Haojie
    Guo, Xu
    Zhou, Qin
    Zou, Changliang
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2022, 50 (03): : 951 - 971
  • [48] Adaptive False Discovery Rate Control with Privacy Guarantee
    Xia, Xintao
    Cai, Zhanrui
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [49] False discovery rate control in cancer biomarker selection
    Li, Zhaoming
    GENES & DISEASES, 2023, 10 (04) : 1141 - 1142
  • [50] Importance of presenting the variability of the false discovery rate control
    Yi-Ting Lin
    Wen-Chung Lee
    BMC Genetics, 16