Directional false discovery rate control in large-scale multiple comparisons

被引:1
|
作者
Liang, Wenjuan [1 ,2 ]
Xiang, Dongdong [1 ]
Mei, Yajun [3 ]
Li, Wendong [1 ,4 ]
机构
[1] East China Normal Univ, Sch Stat, KLATASDS MOE, Shanghai, Peoples R China
[2] Huangshan Univ, Sch Math & Stat, Huangshan, Peoples R China
[3] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA USA
[4] 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Gene expression; multiple testing; marginal FDR; separate control; data-driven; NULL; HYPOTHESES;
D O I
10.1080/02664763.2024.2344260
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The advance of high-throughput biomedical technology makes it possible to access massive measurements of gene expression levels. An important statistical issue is identifying both under-expressed and over-expressed genes for a disease. Most existing multiple-testing procedures focus on selecting only the non-null or significant genes without further identifying their expression type. Only limited methods are designed for the directional problem, and yet they fail to separately control the numbers of falsely discovered over-expressed and under-expressed genes with only a unified index combining all the false discoveries. In this paper, based on a three-classification multiple testing framework, we propose a practical data-driven procedure to control separately the two directions of false discoveries. The proposed procedure is theoretically valid and optimal in the sense that it maximizes the expected number of true discoveries while controlling the false discovery rates for under-expressed and over-expressed genes simultaneously. The procedure allows different nominal levels for the two directions, exhibiting high flexibility in practice. Extensive numerical results and analysis of two large-scale genomic datasets show the effectiveness of our procedure.
引用
收藏
页码:3195 / 3214
页数:20
相关论文
共 50 条
  • [1] Weighted False Discovery Rate Control in Large-Scale Multiple Testing
    Basu, Pallavi
    Cai, T. Tony
    Das, Kiranmoy
    Sun, Wenguang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (523) : 1172 - 1183
  • [2] Optimal Control of Directional False Discovery Rates in Large-Scale Testing
    Tang, Guozhu
    Kang, Yicheng
    Xiang, Dongdong
    STATISTICS IN MEDICINE, 2025, 44 (05)
  • [3] False discovery control in large-scale spatial multiple testing
    Sun, Wenguang
    Reich, Brian J.
    Cai, T. Tony
    Guindani, Michele
    Schwartzman, Armin
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2015, 77 (01) : 59 - 83
  • [4] Split Knockoffs for Multiple Comparisons: Controlling the Directional False Discovery Rate
    Cao, Yang
    Sun, Xinwei
    Yao, Yuan
    arXiv, 2023,
  • [5] Split Knockoffs for Multiple Comparisons: Controlling the Directional False Discovery Rate
    Cao, Yang
    Sun, Xinwei
    Yao, Yuan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (548) : 2822 - 2832
  • [6] False Discovery Rate Control for Fast Screening of Large-Scale Genomics Biobanks
    Machkour, Jasin
    Muma, Michael
    Palomar, Daniel P.
    2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 666 - 670
  • [7] Transfer Learning in Large-Scale Gaussian Graphical Models with False Discovery Rate Control
    Li, Sai
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (543) : 2171 - 2183
  • [8] OPTIMAL FALSE DISCOVERY RATE CONTROL FOR LARGE SCALE MULTIPLE TESTING WITH AUXILIARY INFORMATION
    Cao, Hongyuan
    Chen, Jun
    Zhang, Xianyang
    ANNALS OF STATISTICS, 2022, 50 (02): : 807 - 857
  • [9] Mixed directional false discovery rate control in multiple pairwise comparisons using weighted p-values
    Zhao, Haibing
    Peddada, Shyamal D.
    Cui, Xinping
    BIOMETRICAL JOURNAL, 2015, 57 (01) : 144 - 158
  • [10] Simultaneous control of all false discovery proportions in large-scale multiple hypothesis testing
    Goeman, Jelle
    Meijer, Rosa
    Krebs, Thijmen
    Solari, Aldo
    BIOMETRIKA, 2019, 106 (04) : 841 - 856