Efficient Genome-Wide TagSNP Selection Across Populations via the Linkage Disequilibrium Criterion

被引:7
|
作者
Liu, Lan [1 ,2 ]
Wu, Yonghui [1 ,2 ]
Lonardi, Stefano [1 ]
Jiang, Tao [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92507 USA
[2] Google Inc, Mountain View, CA USA
关键词
genome-wide tagSNP selection; greedy algorithm; HapMap; Lagrangian relaxation; linkage disequilibrium; multiple populations; SINGLE-NUCLEOTIDE POLYMORPHISMS; HAPLOTYPE-TAGGING SNPS; SET; BLOCKS; ASSOCIATION; ALGORITHM; PATTERNS; MAP;
D O I
10.1089/cmb.2007.0228
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In this article, we studied the tag single-nucleotide polymorphism (tagSNP) selection problem on multiple populations using the pairwise r(2) linkage disequilibrium criterion. We proposed a novel combinatorial optimization model for the tagSNP selection problem, called the minimum common tagSNP selection (MCTS) problem, and presented efficient solutions for MCTS. Our approach consists of the following three main steps: (i) partitioning the SNP markers into small disjoint components, (ii) applying some data reduction rules to simplify the problem, and (iii) applying either a fast greedy algorithm or a Lagrangian relaxation algorithm to solve the remaining (general) MCTS. These algorithms also provide lower bounds on tagging (i. e., the minimum number of tagSNPs needed). The lower bounds allow us to evaluate how far our solution is from the optimum. To the best of our knowledge, it is the first time the tagging lower bounds are discussed in the literature. We assessed the performance of our algorithms on real HapMap data for genome-wide tagging. The experiments demonstrated that our algorithms run 3-4 orders of magnitude faster than the existing single-population tagging programs such as FESTA, LD-Select, and the multiple-population tagging method MultiPop-TagSelect. Our method also greatly reduced the required tagSNPs compared with LD-Select on a single population and MultiPop-TagSelect on multiple populations. Moreover, the numbers of tagSNPs selected by our algorithms are almost optimal because they are very close to the corresponding lower bounds obtained by our method.
引用
收藏
页码:21 / 37
页数:17
相关论文
共 50 条
  • [41] Genetic Diversity, Linkage Disequilibrium and Selection Signatures in Chinese and Western Pigs Revealed by Genome-Wide SNP Markers
    Ai, Huashui
    Huang, Lusheng
    Ren, Jun
    PLOS ONE, 2013, 8 (02):
  • [42] Genome-wide linkage disequilibrium mapping of late onset Alzheimer's disease
    Hiltunen, M
    Pirskanen, M
    Helisalmi, S
    Koivisto, AM
    Lehtovirta, M
    Soininen, H
    Mannermaa, A
    Thompson, D
    Easton, D
    Ryynänen, M
    NEUROBIOLOGY OF AGING, 2002, 23 (01) : S423 - S423
  • [43] Relatedness Mapping and Tracts of Relatedness for Genome-Wide Data in the Presence of Linkage Disequilibrium
    Albrechtsen, Anders
    Korneliussen, Thorfinn Sand
    Moltke, Ida
    Hansen, Thomas van Overseem
    Nielsen, Finn Cilius
    Nielsen, Rasmus
    GENETIC EPIDEMIOLOGY, 2009, 33 (03) : 266 - 274
  • [44] Accounting for linkage disequilibrium in genome-wide association studies: A penalized regression method
    Liu, Jin
    Wang, Kai
    Ma, Shuangge
    Huang, Jian
    STATISTICS AND ITS INTERFACE, 2013, 6 (01) : 99 - 115
  • [45] Two genome-wide linkage disequilibrium screens in Scandinavian multiple sclerosis patients
    Harbo, HF
    Datta, P
    Oturai, A
    Ryder, LP
    Sawcer, S
    Setakis, E
    Åkesson, E
    Celius, EG
    Modin, H
    Sandberg-Wollheim, M
    Myhr, KM
    Andersen, O
    Hillert, J
    Sorensen, PS
    Svejgaard, A
    Compston, A
    Vartdal, F
    Spurkland, A
    JOURNAL OF NEUROIMMUNOLOGY, 2003, 143 (1-2) : 101 - 106
  • [46] Genome-wide analysis of zygotic linkage disequilibrium and its components in crossbred cattle
    Jiang, Qi
    Wang, Zhiquan
    Moore, Stephen S.
    Yang, Rong-Cai
    BMC GENETICS, 2012, 13
  • [47] Noninvasive detection of twin zygosity using genome-wide linkage disequilibrium information
    Kong, Lingrong
    Yang, Yingjun
    Yuan, Chao
    Wei, Xing
    Zhou, Xinyao
    Zhou, Jia
    Xing, Ya
    Zou, Gang
    Sun, Qianqian
    Cai, Luyao
    Liang, Qiufeng
    Zhang, Yao
    Wang, Hongkun
    Liu, Zesi
    Wu, Di
    Sun, Luming
    CLINICAL AND TRANSLATIONAL MEDICINE, 2024, 14 (12):
  • [48] The genome-wide distribution and extent of background linkage disequilibrium in a population isolate.
    Service, SK
    Ophoff, R
    Freimer, NB
    AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (04) : 226 - 226
  • [49] Genome-wide analysis of zygotic linkage disequilibrium and its components in crossbred cattle
    Qi Jiang
    Zhiquan Wang
    Stephen S Moore
    Rong-Cai Yang
    BMC Genetics, 13
  • [50] Linkage Disequilibrium and Evaluation of Genome-Wide Association Mapping Models in Tetraploid Potato
    Sharma, Sanjeev Kumar
    MacKenzie, Katrin
    McLean, Karen
    Dale, Finlay
    Daniels, Steve
    Bryan, Glenn J.
    G3-GENES GENOMES GENETICS, 2018, 8 (10): : 3185 - 3202