Inflated expectations: Rare-variant association analysis using public controls

被引:3
|
作者
Kim, Jung [1 ]
Karyadi, Danielle M. [1 ]
Hartley, Stephen W. [1 ]
Zhu, Bin [1 ]
Wang, Mingyi [2 ,3 ]
Wu, Dongjing [2 ,3 ]
Song, Lei [1 ]
Armstrong, Gregory T. [4 ]
Bhatia, Smita [5 ]
Robison, Leslie L. [4 ]
Yasui, Yutaka [4 ]
Carter, Brian [6 ]
Sampson, Joshua N. [1 ]
Freedman, Neal D. [1 ]
Goldstein, Alisa M. [1 ]
Mirabello, Lisa [1 ]
Chanock, Stephen J. [1 ]
Morton, Lindsay M. [1 ]
Savage, Sharon A. [1 ]
Stewart, Douglas R. [1 ]
机构
[1] NCI, Div Canc Epidemiol & Genet, Rockville, MD 20850 USA
[2] NCI, Div Canc Epidemiol & Genet, Canc Genom Res Lab, Rockville, MD USA
[3] Frederick Natl Lab Canc Res, Leidos Biomed Res Inc, Frederick, MD USA
[4] St Jude Childrens Res Hosp, Dept Epidemiol & Canc Control, Memphis, TN USA
[5] Univ Alabama Birmingham, Inst Canc Outcomes & Survivorship, Birmingham, AL USA
[6] Amer Canc Soc, Dept Populat Sci, Atlanta, GA USA
来源
PLOS ONE | 2023年 / 18卷 / 01期
关键词
DESIGN;
D O I
10.1371/journal.pone.0280951
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The use of publicly available sequencing datasets as controls (hereafter, "public controls") in studies of rare variant disease associations has great promise but can increase the risk of false-positive discovery. The specific factors that could contribute to inflated distribution of test statistics have not been systematically examined. Here, we leveraged both public controls, gnomAD v2.1 and several datasets sequenced in our laboratory to systematically investigate factors that could contribute to the false-positive discovery, as measured by lambda(Delta 95), a measure to quantify the degree of inflation in statistical significance. Analyses of datasets in this investigation found that 1) the significantly inflated distribution of test statistics decreased substantially when the same variant caller and filtering pipelines were employed, 2) differences in library prep kits and sequencers did not affect the false-positive discovery rate and, 3) joint vs. separate variant-calling of cases and controls did not contribute to the inflation of test statistics. Currently available methods do not adequately adjust for the high false-positive discovery. These results, especially if replicated, emphasize the risks of using public controls for rare-variant association tests in which individual-level data and the computational pipeline are not readily accessible, which prevents the use of the same variant-calling and filtering pipelines on both cases and controls. A plausible solution exists with the emergence of cloud-based computing, which can make it possible to bring containerized analytical pipelines to the data (rather than the data to the pipeline) and could avert or minimize these issues. It is suggested that future reports account for this issue and provide this as a limitation in reporting new findings based on studies that cannot practically analyze all data on a single pipeline.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Rare-variant genome-wide association studies: a new frontier in genetic analysis of complex traits
    Wagner, Michael J.
    PHARMACOGENOMICS, 2013, 14 (04) : 413 - 424
  • [32] Joint analysis of multiple blood pressure phenotypes in GAW19 data by using a multivariate rare-variant association test
    Jianping Sun
    Sahir R. Bhatnagar
    Karim Oualkacha
    Antonio Ciampi
    Celia M. T. Greenwood
    BMC Proceedings, 10 (Suppl 7)
  • [33] A powerful new method for rare-variant analysis of quantitative traits in families
    Bailey-Wilson, Joan E.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2020, 28 (12) : 1629 - 1630
  • [34] A powerful new method for rare-variant analysis of quantitative traits in families
    Joan E. Bailey-Wilson
    European Journal of Human Genetics, 2020, 28 : 1629 - 1630
  • [35] Rare-variant association analysis reveals known and new age-related hearing loss genes
    Cornejo-Sanchez, Diana M.
    Li, Guangyou
    Fabiha, Tabassum
    Wang, Ran
    Acharya, Anushree
    Everard, Jenna L.
    Kadlubowska, Magda K.
    Huang, Yin
    Schrauwen, Isabelle
    Wang, Gao T.
    DeWan, Andrew T.
    Leal, Suzanne M.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 (06) : 638 - 647
  • [36] Family-based Bayesian collapsing method for rare-variant association study
    Liang He
    Janne M Pitkäniemi
    BMC Proceedings, 8 (Suppl 1)
  • [37] The effect of phenotypic outliers and non-normality on rare-variant association testing
    Paul L Auer
    Alex P Reiner
    Suzanne M Leal
    European Journal of Human Genetics, 2016, 24 : 1188 - 1194
  • [38] gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels
    Larson, Nicholas B.
    McDonnell, Shannon
    Albright, Lisa Cannon
    Teerlink, Craig
    Stanford, Janet
    Ostrander, Elaine A.
    Isaacs, William B.
    Xu, Jianfeng
    Cooney, Kathleen A.
    Lange, Ethan
    Schleutker, Johanna
    Carpten, John D.
    Powell, Isaac
    Bailey-Wilson, Joan E.
    Cussenot, Olivier
    Cancel-Tassin, Geraldine
    Giles, Graham G.
    MacInnis, Robert J.
    Maier, Christiane
    Whittemore, Alice S.
    Hsieh, Chih-Lin
    Wiklund, Fredrik
    Catolona, William J.
    Foulkes, William
    Mandal, Diptasri
    Eeles, Rosalind
    Kote-Jarai, Zsofia
    Ackerman, Michael J.
    Olson, Timothy M.
    Klein, Christopher J.
    Thibodeau, Stephen N.
    Schaid, Daniel J.
    GENETIC EPIDEMIOLOGY, 2017, 41 (04) : 297 - 308
  • [39] Rare-variant association analysis reveals known and new age-related hearing loss genes
    Diana M. Cornejo-Sanchez
    Guangyou Li
    Tabassum Fabiha
    Ran Wang
    Anushree Acharya
    Jenna L. Everard
    Magda K. Kadlubowska
    Yin Huang
    Isabelle Schrauwen
    Gao T. Wang
    Andrew T. DeWan
    Suzanne M. Leal
    European Journal of Human Genetics, 2023, 31 : 638 - 647
  • [40] The effect of phenotypic outliers and non-normality on rare-variant association testing
    Auer, Paul L.
    Reiner, Alex P.
    Leal, Suzanne M.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2016, 24 (08) : 1188 - 1194