Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale

被引:0
|
作者
Xihao Li
Zilin Li
Hufeng Zhou
Sheila M. Gaynor
Yaowu Liu
Han Chen
Ryan Sun
Rounak Dey
Donna K. Arnett
Stella Aslibekyan
Christie M. Ballantyne
Lawrence F. Bielak
John Blangero
Eric Boerwinkle
Donald W. Bowden
Jai G. Broome
Matthew P. Conomos
Adolfo Correa
L. Adrienne Cupples
Joanne E. Curran
Barry I. Freedman
Xiuqing Guo
George Hindy
Marguerite R. Irvin
Sharon L. R. Kardia
Sekar Kathiresan
Alyna T. Khan
Charles L. Kooperberg
Cathy C. Laurie
X. Shirley Liu
Michael C. Mahaney
Ani W. Manichaikul
Lisa W. Martin
Rasika A. Mathias
Stephen T. McGarvey
Braxton D. Mitchell
May E. Montasser
Jill E. Moore
Alanna C. Morrison
Jeffrey R. O’Connell
Nicholette D. Palmer
Akhil Pampana
Juan M. Peralta
Patricia A. Peyser
Bruce M. Psaty
Susan Redline
Kenneth M. Rice
Stephen S. Rich
Jennifer A. Smith
Hemant K. Tiwari
机构
[1] Harvard T.H. Chan School of Public Health,Department of Biostatistics
[2] Southwestern University of Finance and Economics,School of Statistics
[3] The University of Texas Health Science Center at Houston,Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health
[4] The University of Texas Health Science Center at Houston,Center for Precision Health, School of Public Health and School of Biomedical Informatics
[5] University of Texas MD Anderson Cancer Center,Department of Biostatistics
[6] College of Public Health,Department of Epidemiology
[7] University of Kentucky,Department of Medicine
[8] University of Alabama at Birmingham,Department of Epidemiology, School of Public Health
[9] Baylor College of Medicine,Human Genome Sequencing Center
[10] University of Michigan,Department of Biochemistry
[11] Department of Human Genetics and South Texas Diabetes and Obesity Institute,Division of Medical Genetics
[12] School of Medicine,Department of Biostatistics
[13] The University of Texas Rio Grande Valley,Jackson Heart Study, Department of Medicine
[14] Baylor College of Medicine,Department of Biostatistics
[15] Wake Forest University School of Medicine,Framingham Heart Study
[16] University of Washington,Department of Internal Medicine, Nephrology
[17] University of Washington,Department of Population Medicine
[18] University of Mississippi Medical Center,Cardiology Division
[19] Boston University School of Public Health,Department of Medicine
[20] National Heart,Division of Public Health Sciences
[21] Lung,Department of Data Sciences
[22] and Blood Institute and Boston University,Department of Statistics
[23] Wake Forest School of Medicine,Center for Public Health Genomics
[24] The Institute for Translational Genomics and Population Sciences,Division of Cardiology
[25] Department of Pediatrics,GeneSTAR Research Program, Department of Medicine
[26] The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center,Department of Epidemiology
[27] Qatar University College of Medicine,Department of Medicine
[28] QU Health,Geriatrics Research and Education Clinical Center
[29] Verve Therapeutics,Division of Endocrinology, Diabetes, and Nutrition, Program for Personalized and Genomic Medicine
[30] Massachusetts General Hospital,Program in Bioinformatics and Integrative Biology
[31] Harvard Medical School,Program in Medical and Population Genetics
[32] Fred Hutchinson Cancer Research Center,Center for Genomic Medicine and Cardiovascular Research Center
[33] Dana–Farber Cancer Institute and Harvard T.H. Chan School of Public Health,Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Services
[34] Harvard University,Division of Sleep and Circadian Disorders
[35] University of Virginia,Division of Sleep Medicine
[36] George Washington School of Medicine and Health Sciences,Division of Pulmonary, Critical Care, and Sleep Medicine
[37] Johns Hopkins University School of Medicine,Survey Research Center
[38] International Health Institute,Department of Biostatistics, School of Public Health
[39] Department of Anthropology,Department of Laboratory Medicine and Pathology
[40] Brown University,Department of Medicine
[41] University of Maryland School of Medicine,Department of Human Genetics and Biostatistics
[42] Baltimore VA Medical Center,Department of Physiology and Biophysics
[43] University of Maryland School of Medicine,Division of Cardiology
[44] University of Massachusetts Medical School,Stanley Center for Psychiatric Research
[45] Broad Institute of Harvard and MIT,Analytic and Translational Genetics Unit
[46] Massachusetts General Hospital,Division of Genetics
[47] University of Washington,Department of Biomedical Informatics
[48] Kaiser Permanente Washington Health Research Institute,Department of Biostatistics
[49] Brigham and Women’s Hospital,Department of Internal Medicine
[50] Harvard Medical School,Department of Computational Medicine and Bioinformatics
来源
Nature Genetics | 2020年 / 52卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Large-scale whole-genome sequencing studies have enabled the analysis of rare variants (RVs) associated with complex phenotypes. Commonly used RV association tests have limited scope to leverage variant functions. We propose STAAR (variant-set test for association using annotation information), a scalable and powerful RV association test method that effectively incorporates both variant categories and multiple complementary annotations using a dynamic weighting scheme. For the latter, we introduce ‘annotation principal components’, multidimensional summaries of in silico variant annotations. STAAR accounts for population structure and relatedness and is scalable for analyzing very large cohort and biobank whole-genome sequencing studies of continuous and dichotomous traits. We applied STAAR to identify RVs associated with four lipid traits in 12,316 discovery and 17,822 replication samples from the Trans-Omics for Precision Medicine Program. We discovered and replicated new RV associations, including disruptive missense RVs of NPC1L1 and an intergenic region near APOC1P1 associated with low-density lipoprotein cholesterol.
引用
收藏
页码:969 / 983
页数:14
相关论文
共 41 条
  • [21] FunSPU: A versatile and adaptive multiple functional annotation-based association test of whole-genome sequencing data
    Ma, Yiding
    Wei, Peng
    PLOS GENETICS, 2019, 15 (04):
  • [22] Rainbow: a tool for large-scale whole-genome sequencing data analysis using cloud computing
    Shanrong Zhao
    Kurt Prenger
    Lance Smith
    Thomas Messina
    Hongtao Fan
    Edward Jaeger
    Susan Stephens
    BMC Genomics, 14
  • [23] Analysis of large-scale biobanks and whole genome sequencing studies: Challenges and opportunities
    Lin, Xihong
    GENETIC EPIDEMIOLOGY, 2020, 44 (05) : 501 - 502
  • [24] Population-Wide Whole-Genome Sequencing in an Isolated Cohort Reveals Rare Variant Burdens Associated with Multiple Quantitative Traits
    Gilly, Arthur
    GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 654 - 654
  • [25] Large-scale whole-exome sequencing association studies identify rare functional variants influencing serum urate levels
    Tin, Adrienne
    Li, Yong
    Brody, Jennifer A.
    Nutile, Teresa
    Chu, Audrey Y.
    Huffman, Jennifer E.
    Yang, Qiong
    Chen, Ming-Huei
    Robinson-Cohen, Cassianne
    Mace, Aurelien
    Liu, Jun
    Demirkan, Ayse
    Sorice, Rossella
    Sedaghat, Sanaz
    Swen, Melody
    Yu, Bing
    Ghasemi, Sahar
    Teumer, Alexanda
    Vollenweider, Peter
    Ciullo, Marina
    Li, Meng
    Uitterlinden, Andre G.
    Kraaij, Robert
    Amin, Najaf
    van Rooij, Jeroen
    Kutalik, Zoltan
    Dehghan, Abbas
    McKnight, Barbara
    van Duijn, Cornelia M.
    Morrison, Alanna
    Psaty, Bruce M.
    Boerwinkle, Eric
    Fox, Caroline S.
    Woodward, Owen M.
    Koettgen, Anna
    NATURE COMMUNICATIONS, 2018, 9
  • [26] Large-scale whole-exome sequencing association studies identify rare functional variants influencing serum urate levels
    Adrienne Tin
    Yong Li
    Jennifer A. Brody
    Teresa Nutile
    Audrey Y. Chu
    Jennifer E. Huffman
    Qiong Yang
    Ming-Huei Chen
    Cassianne Robinson-Cohen
    Aurélien Macé
    Jun Liu
    Ayşe Demirkan
    Rossella Sorice
    Sanaz Sedaghat
    Melody Swen
    Bing Yu
    Sahar Ghasemi
    Alexanda Teumer
    Peter Vollenweider
    Marina Ciullo
    Meng Li
    André G. Uitterlinden
    Robert Kraaij
    Najaf Amin
    Jeroen van Rooij
    Zoltán Kutalik
    Abbas Dehghan
    Barbara McKnight
    Cornelia M. van Duijn
    Alanna Morrison
    Bruce M. Psaty
    Eric Boerwinkle
    Caroline S. Fox
    Owen M. Woodward
    Anna Köttgen
    Nature Communications, 9
  • [27] Whole-genome sequencing analysis of a rare Salmonella diarizonae clinical strain carrying multiple plasmids and novel gene cassettes
    Liu, Zhonghua
    Zhao, Ying
    Zhao, Weiwei
    Fei, Xiaoyuan
    Zhai, Pingping
    Bi, Wanglai
    Li, Rui
    JOURNAL OF GLOBAL ANTIMICROBIAL RESISTANCE, 2022, 29 : 339 - 342
  • [28] Enrichment Analysis Informs Rare Variant Association Tests of Type 2 Diabetes and Glycemic Traits in CHARGE Whole-Genome Sequence
    Lent, Samantha
    Manning, Alisa
    Wessel, Jennifer
    Dupuis, Josee
    Meigs, James B.
    DIABETES, 2017, 66 : A478 - A478
  • [29] Rare Variants Association Analysis in Large-Scale Sequencing Studies at the Single Locus Level
    Jeng, Xinge Jessie
    Daye, Zhongyin John
    Lu, Wenbin
    Tzeng, Jung-Ying
    PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (06)
  • [30] An efficient large-scale whole-genome sequencing analyses practice with an average daily analysis of 100Tbp: ZBOLT
    Li, Zhichao
    Xie, Yinlong
    Zeng, Wenjun
    Huang, Yushan
    Gu, Shengchang
    Gao, Ya
    Huang, Weihua
    Lu, Lihua
    Wang, Xiaohong
    Wu, Jiasheng
    Yin, Xiaoxu
    Zhu, Rongyi
    Huang, Guodong
    Lu, Lin
    Tang, Jingbo
    Zheng, Yunping
    Liu, Quan
    Zhou, Xianqiang
    Shan, Riqiang
    Wang, Bo
    Fang, Mingyan
    Jin, Xin
    CLINICAL AND TRANSLATIONAL DISCOVERY, 2023, 3 (06):