Directional false discovery rate control in large-scale multiple comparisons

被引:1
|
作者
Liang, Wenjuan [1 ,2 ]
Xiang, Dongdong [1 ]
Mei, Yajun [3 ]
Li, Wendong [1 ,4 ]
机构
[1] East China Normal Univ, Sch Stat, KLATASDS MOE, Shanghai, Peoples R China
[2] Huangshan Univ, Sch Math & Stat, Huangshan, Peoples R China
[3] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA USA
[4] 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Gene expression; multiple testing; marginal FDR; separate control; data-driven; NULL; HYPOTHESES;
D O I
10.1080/02664763.2024.2344260
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The advance of high-throughput biomedical technology makes it possible to access massive measurements of gene expression levels. An important statistical issue is identifying both under-expressed and over-expressed genes for a disease. Most existing multiple-testing procedures focus on selecting only the non-null or significant genes without further identifying their expression type. Only limited methods are designed for the directional problem, and yet they fail to separately control the numbers of falsely discovered over-expressed and under-expressed genes with only a unified index combining all the false discoveries. In this paper, based on a three-classification multiple testing framework, we propose a practical data-driven procedure to control separately the two directions of false discoveries. The proposed procedure is theoretically valid and optimal in the sense that it maximizes the expected number of true discoveries while controlling the false discovery rates for under-expressed and over-expressed genes simultaneously. The procedure allows different nominal levels for the two directions, exhibiting high flexibility in practice. Extensive numerical results and analysis of two large-scale genomic datasets show the effectiveness of our procedure.
引用
收藏
页码:3195 / 3214
页数:20
相关论文
共 50 条
  • [31] A Bayesian false discovery rate for multiple testing
    Whittemore, Alice S.
    JOURNAL OF APPLIED STATISTICS, 2007, 34 (01) : 1 - 9
  • [32] Using false discovery rates for multiple comparisons in ecology and evolution
    Pike, Nathan
    METHODS IN ECOLOGY AND EVOLUTION, 2011, 2 (03): : 278 - 282
  • [33] Can we estimate the local false discovery rate in large scale association studies?
    Fischer, C
    Beckmann, L
    Obreiter, M
    Kindler-Röhrborn, A
    Chang-Claude, J
    GENETIC EPIDEMIOLOGY, 2005, 29 (03) : 247 - 248
  • [34] Screening-Assisted Dynamic Multiple Testing with False Discovery Rate Control
    MUSHTAQ Iram
    ZHOU Qin
    ZI Xuemin
    JournalofSystemsScience&Complexity, 2023, 36 (02) : 716 - 754
  • [35] QuTE: decentralized multiple testing on sensor networks with false discovery rate control
    Ramdas, Aaditya
    Chen, Jianbo
    Wainwright, Martin J.
    Jordan, Michael I.
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [36] Screening-Assisted Dynamic Multiple Testing with False Discovery Rate Control
    Mushtaq, Iram
    Zhou, Qin
    Zi, Xuemin
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2023, 36 (02) : 716 - 754
  • [37] Screening-Assisted Dynamic Multiple Testing with False Discovery Rate Control
    Iram Mushtaq
    Qin Zhou
    Xuemin Zi
    Journal of Systems Science and Complexity, 2023, 36 : 716 - 754
  • [38] Large-scale modelling as a route to multiple surface comparisons of the CCP module family
    Soares, DC
    Gerloff, DL
    Syme, NR
    Coulson, AFW
    Parkinson, J
    Barlow, PN
    PROTEIN ENGINEERING DESIGN & SELECTION, 2005, 18 (08): : 379 - 388
  • [39] Distributed False Discovery Rate Control with Quantization
    Xiang, Yu
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 246 - 249
  • [40] Optimal weighting for false discovery rate control
    Roquain, Etienne
    van de Wiel, Mark A.
    ELECTRONIC JOURNAL OF STATISTICS, 2009, 3 : 678 - 711