A Data-Driven Method for Identifying Rare Variants with Heterogeneous Trait Effects

被引:13
|
作者
Zhang, Qunyuan [1 ]
Irvin, Marguerite R. [2 ]
Arnett, Donna K. [2 ]
Province, Michael A. [1 ]
Borecki, Ingrid [1 ]
机构
[1] Washington Univ, Sch Med, Div Stat Genom, St Louis, MO 63108 USA
[2] Univ Alabama, Dept Epidemiol, Birmingham, AL USA
关键词
rare variant; collapsing; heterogeneous effects; sum test; quantitative trait; COMMON DISEASES; PLASMA-LEVELS; ASSOCIATION; CHOLESTEROL; CONTRIBUTE; GENES;
D O I
10.1002/gepi.20618
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Collapsing multiple variants into one variable and testing their collective effect is a useful strategy for rare variant association analysis. Direct collapsing, however, is not valid or may significantly lose power when a group of variants to be collapsed have heterogeneous effects on target traits (i.e. some positive and some negative). This could be especially true for quantitative traits (such as blood pressure and body mass index), regardless of whether subjects are sampled randomly from a population or selectively from two extreme tails of the trait distribution. To deal with this problem, we propose a novel, data-driven method, the P-value Weighted Sum Test (PWST), which allows each variant to be individually weighted according to the evidence of association from the data itself. Specifically, both significance and direction of individual variant effects are used to calculate a single weighted sum score based on rescaled left-tail P-values from single-variant analysis, after which a permutation test of association is performed between the score and the trait. Our simulation under different sampling strategies shows that PWST significantly increases statistical power when there are heterogeneous variant effects. The appeal of the PWST approach is illustrated in an application to sequence data by detecting the collective effect of variants in the peroxisome proliferator-activated receptor alpha (PPAR alpha) gene on triglycerides (TG) response to fenofibrate treatment from 300 subjects in the Genetics of Lipid Lowering and Diet Network study. Genet. Epidemiol. 35:679-685, 2011. (C) 2011 Wiley Periodicals, Inc.
引用
收藏
页码:679 / 685
页数:7
相关论文
共 50 条
  • [1] Adjusting Family Relatedness in Data-driven Burden Test of Rare Variants
    Zhang, Qunyuan
    Wang, Lihua
    Koboldt, Dan
    Boreki, Ingrid B.
    Province, Michael A.
    GENETIC EPIDEMIOLOGY, 2014, 38 (08) : 722 - 727
  • [2] Data-driven platform for identifying variants of interest in COVID-19 virus
    Ramarao-Milne, Priya
    Jain, Yatish
    Sng, Letitia M. F.
    Hosking, Brendan
    Lee, Carol
    Bayat, Arash
    Kuiper, Michael
    Wilson, Laurence O. W.
    Twine, Natalie A.
    Bauer, Denis C.
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 2942 - 2950
  • [3] Data-driven method for identifying the expression of the Lyapunov exponent from random data
    Chen, Xi
    Jin, Xiaoling
    Huang, Zhilong
    INTERNATIONAL JOURNAL OF NON-LINEAR MECHANICS, 2023, 148
  • [4] A data-driven method for identifying conservation-relevant benthic habitats
    Ross, Rebecca E.
    Gonzalez-Mirelis, Genoveva
    Bakke, Gunnstein
    Dolan, Margaret F. J.
    Buhl-Mortensen, Pal
    ECOLOGICAL INDICATORS, 2023, 147
  • [5] A Data-driven Method for Identifying Intricate Trend Component Hidden in Measured Signal
    Jiang, X. X.
    Li, S. M.
    FLUCTUATION AND NOISE LETTERS, 2016, 15 (02):
  • [6] A data-driven inspection method for identifying container bookings with concealed hazardous materials
    Shen, Xiuyu
    Chen, Jingxu
    Zhu, Siying
    Yu, Xinlian
    ENGINEERING OPTIMIZATION, 2024, 56 (09) : 1361 - 1381
  • [7] IDENTIFYING HOUSING SUBMARKETS IN JOHOR BAHRU AND KULAI, MALAYSIA: A DATA-DRIVEN METHOD
    Sairi, Nur Asyikin Mohd
    Burhan, Burhaida
    Safian, Edie Ezwan Mohd
    REAL ESTATE MANAGEMENT AND VALUATION, 2022, 30 (02) : 1 - 11
  • [8] A Unified Method for Detecting Secondary Trait Associations with Rare Variants: Application to Sequence Data
    Liu, Dajiang J.
    Leal, Suzanne M.
    PLOS GENETICS, 2012, 8 (11):
  • [9] Data-driven encoding for quantitative genetic trait prediction
    Dan He
    Zhanyong Wang
    Laxmi Parida
    BMC Bioinformatics, 16
  • [10] Data-driven encoding for quantitative genetic trait prediction
    He, Dan
    Wang, Zhanyong
    Parida, Laxmi
    BMC BIOINFORMATICS, 2015, 16