Efficient Genome-Wide TagSNP Selection Across Populations via the Linkage Disequilibrium Criterion

被引:7
|
作者
Liu, Lan [1 ,2 ]
Wu, Yonghui [1 ,2 ]
Lonardi, Stefano [1 ]
Jiang, Tao [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92507 USA
[2] Google Inc, Mountain View, CA USA
关键词
genome-wide tagSNP selection; greedy algorithm; HapMap; Lagrangian relaxation; linkage disequilibrium; multiple populations; SINGLE-NUCLEOTIDE POLYMORPHISMS; HAPLOTYPE-TAGGING SNPS; SET; BLOCKS; ASSOCIATION; ALGORITHM; PATTERNS; MAP;
D O I
10.1089/cmb.2007.0228
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In this article, we studied the tag single-nucleotide polymorphism (tagSNP) selection problem on multiple populations using the pairwise r(2) linkage disequilibrium criterion. We proposed a novel combinatorial optimization model for the tagSNP selection problem, called the minimum common tagSNP selection (MCTS) problem, and presented efficient solutions for MCTS. Our approach consists of the following three main steps: (i) partitioning the SNP markers into small disjoint components, (ii) applying some data reduction rules to simplify the problem, and (iii) applying either a fast greedy algorithm or a Lagrangian relaxation algorithm to solve the remaining (general) MCTS. These algorithms also provide lower bounds on tagging (i. e., the minimum number of tagSNPs needed). The lower bounds allow us to evaluate how far our solution is from the optimum. To the best of our knowledge, it is the first time the tagging lower bounds are discussed in the literature. We assessed the performance of our algorithms on real HapMap data for genome-wide tagging. The experiments demonstrated that our algorithms run 3-4 orders of magnitude faster than the existing single-population tagging programs such as FESTA, LD-Select, and the multiple-population tagging method MultiPop-TagSelect. Our method also greatly reduced the required tagSNPs compared with LD-Select on a single population and MultiPop-TagSelect on multiple populations. Moreover, the numbers of tagSNPs selected by our algorithms are almost optimal because they are very close to the corresponding lower bounds obtained by our method.
引用
收藏
页码:21 / 37
页数:17
相关论文
共 50 条
  • [31] Extent and genome-wide distribution of linkage disequilibrium in commercial maize germplasm
    Van Inghelandt, Delphine
    Reif, Jochen C.
    Dhillon, Baldev S.
    Flament, Pascal
    Melchinger, Albrecht E.
    THEORETICAL AND APPLIED GENETICS, 2011, 123 (01) : 11 - 20
  • [32] Genome-wide linkage disequilibrium in the Blonde d'Aquitaine cattle breed
    Beghain, J.
    Boitard, S.
    Weiss, B.
    Boussaha, M.
    Gut, I.
    Rocha, D.
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2013, 130 (04) : 294 - 302
  • [33] Extent and genome-wide distribution of linkage disequilibrium in commercial maize germplasm
    Delphine Van Inghelandt
    Jochen C. Reif
    Baldev S. Dhillon
    Pascal Flament
    Albrecht E. Melchinger
    Theoretical and Applied Genetics, 2011, 123 : 11 - 20
  • [34] Genome-wide linkage disequilibrium in two Japanese beef cattle breeds
    Odani, M
    Narita, A
    Watanabe, T
    Yokouchi, K
    Sugimoto, Y
    Fujita, T
    Oguni, T
    Matsumoto, M
    Sasaki, Y
    ANIMAL GENETICS, 2006, 37 (02) : 139 - 144
  • [35] Genome-wide linkage disequilibrium in a Thai multibreed dairy cattle population
    Laodim, Thawee
    Koonawootrittriron, Skorn
    Elzo, Mauricio A.
    Suwanasopee, Thanathip
    LIVESTOCK SCIENCE, 2015, 180 : 27 - 33
  • [36] Genome-Wide Linkage-Disequilibrium Profiles from Single Individuals
    Lynch, Michael
    Xu, Sen
    Maruki, Takahiro
    Jiang, Xiaoqian
    Pfaffelhuber, Peter
    Haubold, Bernhard
    GENETICS, 2014, 198 (01) : 269 - +
  • [37] Power of genome-wide linkage disequilibrium testing by using microsatellite markers
    Ohashi, J
    Tokunaga, K
    JOURNAL OF HUMAN GENETICS, 2003, 48 (09) : 487 - 491
  • [38] Power of genome-wide linkage disequilibrium testing by using microsatellite markers
    Jun Ohashi
    Katsushi Tokunaga
    Journal of Human Genetics, 2003, 48 : 487 - 491
  • [39] SELECTION IN POPULATIONS WITH INITIAL LINKAGE DISEQUILIBRIUM
    JONES, LP
    COMSTOCK, RE
    GENETICS, 1968, 60 (1P2) : 190 - &
  • [40] Genome wide linkage disequilibrium mapping in human populations applied to complex diseases
    Cohen, D
    BIOLOGICAL PSYCHIATRY, 1999, 45 (08) : 96S - 96S