Efficient Association Study Design Via Power-Optimized Tag SNP Selection

被引:15
|
作者
Han, B. [3 ]
Kang, H. M. [3 ]
Seo, M. S. [4 ]
Zaitlen, N. [5 ]
Eskin, E. [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[3] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[4] Chosun Univ, Dept Compute Sci, Kwangju, South Korea
[5] Univ Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
association study; tag SNP selection; statistical power; single nucleotide polymorphism; linkage disequilibrium;
D O I
10.1111/j.1469-1809.2008.00469.x
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Discovering statistical correlation between causal genetic variation and clinical traits through association studies is an important method for identifying the genetic basis of human diseases. Since fully resequencing a cohort is prohibitively costly, genetic association studies take advantage of local correlation structure (or linkage disequilibrium) between single nucleotide polymorphisms (SNPs) by selecting a subset of SNPs to be genotyped (tag SNPs). While many current association studies are performed using commercially available high-throughput genotyping products that define a set of tag SNPs, choosing tag SNPs remains an important problem for both custom follow-up studies as well as designing the high-throughput genotyping products themselves. The most widely used tag SNP selection method optimizes the correlation between SNPs (r(2)). However, tag SNPs chosen based on an r(2) criterion do not necessarily maximize the statistical power of an association study. We propose a study design framework that chooses SNPs to maximize power and efficiently measures the power through empirical simulation. Empirical results based on the HapMap data show that our method gains considerable power over a widely used r(2)-based method, or equivalently reduces the number of tag SNPs required to attain the desired power of a study. Our power-optimized 100k whole genome tag set provides equivalent power to the Affymetrix 500k chip for the CEU population. For the design of custom follow-up studies, our method provides up to twice the power increase using the same number of tag SNPs as r(2)-based methods. Our method is publicly available via web server at external link type http://design.cs.ucla.edu.
引用
收藏
页码:834 / 847
页数:14
相关论文
共 50 条
  • [31] FastTagger: an efficient algorithm for genome-wide tag SNP selection using multi-marker linkage disequilibrium
    Liu, Guimei
    Wang, Yue
    Wong, Limsoon
    BMC BIOINFORMATICS, 2010, 11
  • [32] Efficient Integrative Multi-SNP Association Analysis via Deterministic Approximation of Posteriors
    Wen, Xiaoquan
    Lee, Yeji
    Luca, Francesca
    Pique-Regi, Roger
    AMERICAN JOURNAL OF HUMAN GENETICS, 2016, 98 (06) : 1114 - 1129
  • [33] A Design Methodology for Domain-Optimized Power-Efficient Supercomputing
    Mohiyuddin, Marghoob
    Murphy, Mark
    Oliker, Leonid
    Shalf, John
    Wawrzynek, John
    Williams, Samuel
    PROCEEDINGS OF THE CONFERENCE ON HIGH PERFORMANCE COMPUTING NETWORKING, STORAGE AND ANALYSIS, 2009,
  • [34] Via structure optimized design and study of copper Interconnect
    Du, Ming
    Zhang, Rui
    MATERIALS PROCESSING TECHNOLOGY, PTS 1-4, 2011, 291-294 : 3445 - +
  • [35] Optimized Selection of Wireless Network Topologies and Components via Efficient Pruning of Feasible Paths
    Kirov, Dmitrii
    Nuzzo, Pierluigi
    Passerone, Roberto
    Sangiovanni-Vincentelli, Alberto
    2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [36] Nonlinear WEC Optimized Geometric Buoy Design for Efficient Reactive Power Requirements
    Wilson, David G.
    Robinett, Rush D. I. I. I. I. I. I.
    Bacelli, Giorgio
    Abdelkhalik, Ossama
    Weaver, Wayne W.
    Coe, Ryan
    OCEANS 2019 MTS/IEEE SEATTLE, 2019,
  • [37] Reliable, Efficient, and Power Optimized Control-Channel Selection Scheme for Cognitive Radio Networks
    Aslam, Saleem
    Shahid, Adnan
    Lee, Kyung Geun
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [38] Power estimation of multiple SNP association test of case-control study and application
    Hao, K
    Xu, X
    Laird, N
    Wang, XB
    Xu, XP
    GENETIC EPIDEMIOLOGY, 2004, 26 (01) : 22 - 30
  • [39] SNP Selection in Genome-Wide Association Studies via Penalized Support Vector Machine with MAX Test
    Kim, Jinseog
    Sohn, Insuk
    Kim, Dennis
    Jung, Sin-Ho
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [40] Bitwidth-Optimized Energy-Efficient FFT Design via Scaling Information Propagation
    Liu, Xinzhe
    Chen, Fupeng
    Muhamad, Raees Kizhakkumkara
    Blinder, David
    Nikolova, Dessislava
    Schelkens, Peter
    Catthoor, Francky
    Ha, Yajun
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 613 - 618