Association Analysis and Meta-Analysis of Multi-Allelic Variants for Large-Scale Sequence Data

被引:5
|
作者
Jiang, Yu [1 ]
Chen, Sai [2 ]
Wang, Xingyan [1 ]
Liu, Mengzhen [3 ]
Iacono, William G. [4 ]
Hewitt, John K. [5 ]
Hokanson, John E. [6 ]
Krauter, Kenneth [5 ]
Laakso, Markku [7 ,8 ]
Li, Kevin W. [9 ]
Lutz, Sharon M. [10 ]
McGue, Matthew [3 ]
Pandit, Anita [9 ]
Zajac, Gregory J. M. [9 ]
Boehnke, Michael [9 ]
Abecasis, Goncalo R. [9 ]
Vrieze, Scott, I [3 ]
Jiang, Bibo [1 ]
Zhan, Xiaowei [11 ]
Liu, Dajiang J. [1 ]
机构
[1] Penn State Coll Med, Dept Publ Hlth Sci, Hershey, PA 17033 USA
[2] Illumina Inc, 5200 Illuminay Way, San Diego, CA 92122 USA
[3] Univ Minnesota, Dept Psychol, Minneapolis, MN 55454 USA
[4] Univ Minnesota, Dept Psychiat, Minneapolis, MN 55454 USA
[5] Univ Colorado Boulder, Inst Behav Genet, Aurora, CO 80045 USA
[6] Univ Colorado Denver, Sch Publ Hlth, Dept Epidemiol, Aurora, CO 80045 USA
[7] Univ Eastern Finland, Dept Med, Kuopio 70211, Finland
[8] Kuopio Univ Hosp, Kuopio 70211, Finland
[9] Univ Michigan, Ctr Stat Genet, Dept Biostat, Ann Arbor, MI 48109 USA
[10] Univ Colorado, Dept Biostat & Informat, Anschutz Med Campus, Aurora, CO 80045 USA
[11] Univ Texas Southwestern Med Ctr Dallas, Quantitat Biomed Res Ctr, Dept Clin Sci, Dallas, TX 75390 USA
关键词
multi-allelic variants; GWAS; meta-analysis; smoking; RARE VARIANTS; GENOTYPE IMPUTATION; GENERAL FRAMEWORK; PROTEIN; RISK; TOOL;
D O I
10.3390/genes11050586
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
There is great interest in understanding the impact of rare variants in human diseases using large sequence datasets. In deep sequence datasets of >10,000 samples, similar to 10% of the variant sites are observed to be multi-allelic. Many of the multi-allelic variants have been shown to be functional and disease-relevant. Proper analysis of multi-allelic variants is critical to the success of a sequencing study, but existing methods do not properly handle multi-allelic variants and can produce highly misleading association results. We discuss practical issues and methods to encode multi-allelic sites, conduct single-variant and gene-level association analyses, and perform meta-analysis for multi-allelic variants. We evaluated these methods through extensive simulations and the study of a large meta-analysis of similar to 18,000 samples on the cigarettes-per-day phenotype. We showed that our joint modeling approach provided an unbiased estimate of genetic effects, greatly improved the power of single-variant association tests among methods that can properly estimate allele effects, and enhanced gene-level tests over existing approaches. Software packages implementing these methods are available online.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] ANALYSIS OF MULTI-ALLELIC DATA
    WATTERSON, GA
    GENETICS, 1978, 88 (01) : 171 - 179
  • [2] FURTHER ANALYSIS OF MULTI-ALLELIC DATA
    WATTERSON, GA
    ANDERSON, R
    GENETICS, 1978, 90 (01) : 207 - 210
  • [3] FURTHER ANALYSIS OF MULTI-ALLELIC DATA - REPLY
    COYNE, JA
    FELTON, AA
    GENETICS, 1978, 90 (01) : 210 - 211
  • [5] LARGE-SCALE MULTI-ANCESTRY GENOME-WIDE ASSOCIATION META-ANALYSIS OF MAJOR DEPRESSION
    Meng, Xiangrui
    Giannakopoulou, Olga
    Navoly, Georgina
    Koller, Dora
    Levey, Daniel
    Koen, Nastassja
    Loos, Ruth J. F.
    Davis, Lea
    Martin, Nick
    Walters, Robin
    Polimanti, Renato
    Stein, Murray
    Gelernter, Joel
    Kuchenbaecker, Karoline
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2022, 63 : E49 - E49
  • [6] A large-scale meta-analysis to refine colorectal cancer risk estimates associated with MUTYH variants
    E Theodoratou
    H Campbell
    A Tenesa
    R Houlston
    E Webb
    S Lubbe
    P Broderick
    S Gallinger
    E M Croitoru
    M A Jenkins
    A K Win
    S P Cleary
    T Koessler
    P D Pharoah
    S Küry
    S Bézieau
    B Buecher
    N A Ellis
    P Peterlongo
    K Offit
    L A Aaltonen
    S Enholm
    A Lindblom
    X-L Zhou
    I P Tomlinson
    V Moreno
    I Blanco
    G Capellà
    R Barnetson
    M E Porteous
    M G Dunlop
    S M Farrington
    British Journal of Cancer, 2010, 103 : 1875 - 1884
  • [7] A large-scale meta-analysis to refine colorectal cancer risk estimates associated with MUTYH variants
    Theodoratou, E.
    Campbell, H.
    Tenesa, A.
    Houlston, R.
    Webb, E.
    Lubbe, S.
    Broderick, P.
    Gallinger, S.
    Croitoru, E. M.
    Jenkins, M. A.
    Win, A. K.
    Cleary, S. P.
    Koessler, T.
    Pharoah, P. D.
    Kuery, S.
    Bezieau, S.
    Buecher, B.
    Ellis, N. A.
    Peterlongo, P.
    Offit, K.
    Aaltonen, L. A.
    Enholm, S.
    Lindblom, A.
    Zhou, X-L
    Tomlinson, I. P.
    Moreno, V.
    Blanco, I.
    Capella, G.
    Barnetson, R.
    Porteous, M. E.
    Dunlop, M. G.
    Farrington, S. M.
    BRITISH JOURNAL OF CANCER, 2010, 103 (12) : 1875 - 1884
  • [8] Variant Association Tools for Quality Control and Analysis of Large-Scale Sequence and Genotyping Array Data
    Wang, Gao T.
    Peng, Bo
    Leal, Suzanne M.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2014, 94 (05) : 770 - 783
  • [9] Genetic variants in promoter regions associated with type 2 diabetes mellitus: A large-scale meta-analysis and subgroup analysis
    Wu, Ling
    Wang, Chi Chiu
    JOURNAL OF CELLULAR BIOCHEMISTRY, 2019, 120 (08) : 13012 - 13025
  • [10] A large-scale genome-wide association study meta-analysis of cannabis use disorder
    Johnson, Emma C.
    Demontis, Ditte
    Thorgeirsson, Thorgeir E.
    Walters, Raymond K.
    Polimanti, Renato
    Hatoum, Alexander S.
    Sanchez-Roige, Sandra
    Paul, Sarah E.
    Wendt, Frank R.
    Clarke, Toni-Kim
    Lai, Dongbing
    Reginsson, Gunnar W.
    Zhou, Hang
    He, June
    Baranger, David A. A.
    Gudbjartsson, Daniel F.
    Wedow, Robbee
    Adkins, Daniel E.
    Adkins, Amy E.
    Alexander, Jeffry
    Bacanu, Silviu-Alin
    Bigdeli, Tim B.
    Boden, Joseph
    Brown, Sandra A.
    Bucholz, Kathleen K.
    Bybjerg-Grauholm, Jonas
    Corley, Robin P.
    Degenhardt, Louisa
    Dick, Danielle M.
    Domingue, Benjamin W.
    Fox, Louis
    Goate, Alison M.
    Gordon, Scott D.
    Hack, Laura M.
    Hancock, Dana B.
    Hartz, Sarah M.
    Hickie, Ian B.
    Hougaard, David M.
    Krauter, Kenneth
    Lind, Penelope A.
    McClintick, Jeanette N.
    McQueen, Matthew B.
    Meyers, Jacquelyn L.
    Montgomery, Grant W.
    Mors, Ole
    Mortensen, Preben B.
    Nordentoft, Merete
    Pearson, John F.
    Peterson, Roseann E.
    Reynolds, Maureen D.
    LANCET PSYCHIATRY, 2020, 7 (12): : 1032 - 1045