Genome-wide tagging SNPs with entropy-based Monte Carlo method

被引:8
|
作者
Liu, Zhenqiu
Lin, Shili
Tan, Ming
机构
[1] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
[2] Univ Maryland, Greenbaum Canc Ctr, Div Biostat, Baltimore, MD 21201 USA
关键词
haplotype diversity; importance sampling; Kullback-Leibler distance; multilocus linkage disequilibrium; tagging SNPs;
D O I
10.1089/cmb.2006.13.1606
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The number of common single nucleotide polymorphisms (SNPs) in the human genome is estimated to be around 3-6 million. It is highly anticipated that the study of SNPs will help provide a means for elucidating the genetic component of complex diseases and variable drug responses. High-throughput technologies such as oligonucleotide arrays have produced enormous amount of SNP data, which creates great challenges in genome-wide disease linkage and association studies. In this paper, we present an adaptation of the cross entropy (CE) method and propose an iterative CE Monte Carlo (CEMC) algorithm for tagging SNP selection. This differs from most of SNP selection algorithms in the literature in that our method is independent of the notion of haplotype block. Thus, the method is applicable to whole genome SNP selection without prior knowledge of block boundaries. We applied this block-free algorithm to three large datasets (two simulated and one real) that are in the order of thousands of SNPs. The successful applications to these large scale datasets demonstrate that CEMC is computationally feasible for whole genome SNP selection. Furthermore, the results show that CEMC is significantly better than random selection, and it also outperformed another block-free selection algorithm for the dataset considered.
引用
收藏
页码:1606 / 1614
页数:9
相关论文
共 50 条
  • [1] An entropy-based genome-wide transmission/disequilibrium test
    Zhao, Jinying
    Boerwinkle, Eric
    Xiong, Momiao
    HUMAN GENETICS, 2007, 121 (3-4) : 357 - 367
  • [2] An entropy-based genome-wide transmission/disequilibrium test
    Jinying Zhao
    Eric Boerwinkle
    Momiao Xiong
    Human Genetics, 2007, 121 : 357 - 367
  • [3] Entropy-Based Model Selection Using Monte Carlo Method
    Satoh, Masaki
    Miura, Takao
    2019 6TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC 2019), 2019,
  • [4] Entropy-based adaptive Hamiltonian Monte Carlo
    Hirt, Marcel
    Titsias, Michalis K.
    Dellaportas, Petros
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Entropy-based joint analysis for two-stage genome-wide association studies
    Guolian Kang
    Yijun Zuo
    Journal of Human Genetics, 2007, 52 : 747 - 756
  • [6] Entropy-based joint analysis for two-stage genome-wide association studies
    Kang, Guolian
    Zuo, Yijun
    JOURNAL OF HUMAN GENETICS, 2007, 52 (09) : 747 - 756
  • [7] Genome-wide tagging for everyone
    Need, Anna C.
    Goldstein, David B.
    NATURE GENETICS, 2006, 38 (11) : 1227 - 1228
  • [8] Genome-wide tagging for everyone
    Anna C Need
    David B Goldstein
    Nature Genetics, 2006, 38 : 1227 - 1228
  • [10] Optimal haplotype block-free selection of tagging SNPs for genome-wide association studies
    Halldórsson, BV
    Bafna, V
    Lippert, R
    Schwartz, R
    De La Vega, FM
    Clark, AG
    Istrail, S
    GENOME RESEARCH, 2004, 14 (08) : 1633 - 1640