A Conditional Autoregressive Model for Detecting Natural Selection in Protein-Coding DNA Sequences

被引:0
|
作者
Fan, Yu [1 ]
Wu, Rui [2 ]
Chen, Ming-Hui [2 ]
Kuo, Lynn [2 ]
Lewis, Paul O. [3 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, 1400 Pressler Dr,FCT4-6000, Houston, TX 77030 USA
[2] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
[3] Univ Connecticut, Dept Ecol Evolut Biol, Storrs, CT 06269 USA
来源
关键词
EVOLUTIONARY INFERENCE; MOLECULAR ADAPTATION; TERTIARY STRUCTURE; SITES;
D O I
10.1007/978-1-4614-7846-1_17
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Phylogenetics, the study of evolutionary relationships among groups of organisms, has played an important role in modern biological research, such as genomic comparison, detecting orthology and paralogy, estimating divergence times, reconstructing ancient proteins, identifying mutations likely to be associated with disease, determining the identity of new pathogens, and finding the residues that are important to natural selection. Given an alignment of protein-coding DNA sequences, most methods for detecting natural selection rely on estimating the codon-specific nonsynonymous/synonymous rate ratios (dN/dS). Here, we describe an approach to modeling variation in the dN/dS by using a conditional autoregressive (CAR) model. The CAR model relaxes the assumption in most contemporary phylogenetic models, i.e., sites in molecular sequences evolve independently. By incorporating the information stored in the Protein Data Bank (PDB) file, the CAR model estimates the dN/dS based on the protein three-dimensional structure. We implement the model in a fully Bayesian approach with all parameters of the model considered as random variables and make use of the NVIDIA's parallel computing architecture (CUDA) to accelerate the calculation. Our result of analyzing an empirical abalone sperm lysine data is in accordance with the previous findings.
引用
收藏
页码:203 / 212
页数:10
相关论文
共 50 条
  • [41] Individual variation in protein-coding sequences of human genome
    Sunyaev, S
    Hanke, J
    Brett, D
    Aydin, A
    Zastrow, I
    Lathe, W
    Bork, P
    Reich, J
    [J]. ADVANCES IN PROTEIN CHEMISTRY, VOL 54: ANALYSIS OF AMINO ACID SEQUENCES, 2000, 54 : 409 - 437
  • [42] COATi: Statistical Pairwise Alignment of Protein-Coding Sequences
    Mesa, Juan Jose Garcia
    Zhu, Ziqi
    Cartwright, Reed A.
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2024, 41 (07)
  • [43] Computational identification of protein-coding sequences by comparative analysis
    Fontaine, Arnaud
    Touzet, Helene
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2009, 3 (02) : 160 - 176
  • [44] Selfish DNA in protein-coding genes of Rickettsia
    Ogata, H
    Audic, S
    Barbe, V
    Artiguenave, F
    Fournier, PE
    Raoult, D
    Claverie, JM
    [J]. SCIENCE, 2000, 290 (5490) : 347 - 350
  • [45] Effects of natural selection on interpopulation divergence at polymorphic sites in human protein-coding loci
    Hughes, AL
    Packer, B
    Welch, R
    Bergen, AW
    Chanock, SJ
    Yeager, M
    [J]. GENETICS, 2005, 170 (03) : 1181 - 1187
  • [46] Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes
    Lin, Michael F.
    Kheradpour, Pouya
    Washietl, Stefan
    Parker, Brian J.
    Pedersen, Jakob S.
    Kellis, Manolis
    [J]. GENOME RESEARCH, 2011, 21 (11) : 1916 - 1928
  • [47] IDENTIFICATION OF PROTEIN-CODING REGIONS IN GENOMIC DNA
    SNYDER, EE
    STORMO, GD
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1995, 248 (01) : 1 - 18
  • [48] Translation conditional models for protein coding sequences
    Rodolphe, F
    Mathé, C
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (1-2) : 249 - 260
  • [49] Detecting Adaptation in Protein-Coding Genes Using a Bayesian Site-Heterogeneous Mutation-Selection Codon Substitution Model
    Rodrigue, Nicolas
    Lartillot, Nicolas
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2017, 34 (01) : 204 - 214
  • [50] A combinatorial algorithm for highly specific recognition of protein-coding regions in higher eukaryotic DNA sequences
    Roitberg, MA
    Astakhova, TV
    Gelfand, MS
    [J]. MOLECULAR BIOLOGY, 1997, 31 (01) : 18 - 23