A Data-Driven Method for Identifying Rare Variants with Heterogeneous Trait Effects

被引:13
|
作者
Zhang, Qunyuan [1 ]
Irvin, Marguerite R. [2 ]
Arnett, Donna K. [2 ]
Province, Michael A. [1 ]
Borecki, Ingrid [1 ]
机构
[1] Washington Univ, Sch Med, Div Stat Genom, St Louis, MO 63108 USA
[2] Univ Alabama, Dept Epidemiol, Birmingham, AL USA
关键词
rare variant; collapsing; heterogeneous effects; sum test; quantitative trait; COMMON DISEASES; PLASMA-LEVELS; ASSOCIATION; CHOLESTEROL; CONTRIBUTE; GENES;
D O I
10.1002/gepi.20618
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Collapsing multiple variants into one variable and testing their collective effect is a useful strategy for rare variant association analysis. Direct collapsing, however, is not valid or may significantly lose power when a group of variants to be collapsed have heterogeneous effects on target traits (i.e. some positive and some negative). This could be especially true for quantitative traits (such as blood pressure and body mass index), regardless of whether subjects are sampled randomly from a population or selectively from two extreme tails of the trait distribution. To deal with this problem, we propose a novel, data-driven method, the P-value Weighted Sum Test (PWST), which allows each variant to be individually weighted according to the evidence of association from the data itself. Specifically, both significance and direction of individual variant effects are used to calculate a single weighted sum score based on rescaled left-tail P-values from single-variant analysis, after which a permutation test of association is performed between the score and the trait. Our simulation under different sampling strategies shows that PWST significantly increases statistical power when there are heterogeneous variant effects. The appeal of the PWST approach is illustrated in an application to sequence data by detecting the collective effect of variants in the peroxisome proliferator-activated receptor alpha (PPAR alpha) gene on triglycerides (TG) response to fenofibrate treatment from 300 subjects in the Genetics of Lipid Lowering and Diet Network study. Genet. Epidemiol. 35:679-685, 2011. (C) 2011 Wiley Periodicals, Inc.
引用
收藏
页码:679 / 685
页数:7
相关论文
共 50 条
  • [21] Identifying core IoT technologies using ARM and FCM: A comprehensive data-driven method
    Dahooie, Jalil Heidary
    Nouri, Iman
    Mohammadi, Mehdi
    Yalcin, Haydar
    Daim, Tugrul
    WORLD PATENT INFORMATION, 2024, 78
  • [22] Identifying the best data-driven feature selection method for boosting reproducibility in classification tasks
    Georges, Nicolas
    Mhiri, Islem
    Rekik, Islem
    PATTERN RECOGNITION, 2020, 101
  • [23] Data-Driven Investigation into Variants of Code Writing Questions
    Butler, Liia
    Challen, Geoffrey
    Xie, Tao
    2020 IEEE 32ND CONFERENCE ON SOFTWARE ENGINEERING EDUCATION AND TRAINING (CSEE&T), 2020, : 75 - 84
  • [24] Data-driven approach for identifying spatiotemporally recurrent bottlenecks
    Song, Tai-Jin
    Williams, Billy M.
    Rouphail, Nagui M.
    IET INTELLIGENT TRANSPORT SYSTEMS, 2018, 12 (08) : 756 - 764
  • [25] A data-driven framework for identifying tropical wetland model
    Anupam, Angesh
    Wilton, David J.
    Anderson, Sean R.
    Kadirkamanathan, Visakan
    2018 UKACC 12TH INTERNATIONAL CONFERENCE ON CONTROL (CONTROL), 2018, : 242 - 247
  • [26] Large-scale data-driven analysis to understand the contribution of rare variants to congenital heart disease
    Martinez, Enrique Audain
    Wilsdon, Anna
    Dombrowsky, Gregor
    Sifrim, Alejandro
    Breckpot, Jeroen
    Perez-Riverol, Yasset
    Daly, Allan
    Antoniou, Pavlos
    Hofmann, Philipp
    Kahlert, Anne-Karin
    Bauer, Ulrike
    Pickardt, Thomas
    Klaassen, Sabine
    Berger, Felix
    Daehnert, Ingo
    Dittrich, Sven
    Stiller, Brigitte
    Abdul-Khaliq, Hashim
    Bu'Lock, Frances
    Uebing, Anselm
    Kramer, Hans-Heiner
    Iyer, Vivek
    Larsen, Lars Allan
    Brook, J. David
    Hurles, Matthew
    Hitz, Marc-Phillip
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 13 - 13
  • [27] A Data-Driven Model for Anisotropic Heterogeneous Subsurface Scattering
    Song, Ying
    Wang, Wencheng
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [28] Predicting heterogeneous ice nucleation with a data-driven approach
    Fitzner, Martin
    Pedevilla, Philipp
    Michaelides, Angelos
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [29] Data-driven preventive maintenance for a heterogeneous machine portfolio
    Deprez, Laurens
    Antonio, Katrien
    Arts, Joachim
    Boute, Robert
    OPERATIONS RESEARCH LETTERS, 2023, 51 (02) : 163 - 170
  • [30] Predicting heterogeneous ice nucleation with a data-driven approach
    Martin Fitzner
    Philipp Pedevilla
    Angelos Michaelides
    Nature Communications, 11