A Conditional Autoregressive Model for Detecting Natural Selection in Protein-Coding DNA Sequences

被引:0
|
作者
Fan, Yu [1 ]
Wu, Rui [2 ]
Chen, Ming-Hui [2 ]
Kuo, Lynn [2 ]
Lewis, Paul O. [3 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, 1400 Pressler Dr,FCT4-6000, Houston, TX 77030 USA
[2] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
[3] Univ Connecticut, Dept Ecol Evolut Biol, Storrs, CT 06269 USA
来源
关键词
EVOLUTIONARY INFERENCE; MOLECULAR ADAPTATION; TERTIARY STRUCTURE; SITES;
D O I
10.1007/978-1-4614-7846-1_17
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Phylogenetics, the study of evolutionary relationships among groups of organisms, has played an important role in modern biological research, such as genomic comparison, detecting orthology and paralogy, estimating divergence times, reconstructing ancient proteins, identifying mutations likely to be associated with disease, determining the identity of new pathogens, and finding the residues that are important to natural selection. Given an alignment of protein-coding DNA sequences, most methods for detecting natural selection rely on estimating the codon-specific nonsynonymous/synonymous rate ratios (dN/dS). Here, we describe an approach to modeling variation in the dN/dS by using a conditional autoregressive (CAR) model. The CAR model relaxes the assumption in most contemporary phylogenetic models, i.e., sites in molecular sequences evolve independently. By incorporating the information stored in the Protein Data Bank (PDB) file, the CAR model estimates the dN/dS based on the protein three-dimensional structure. We implement the model in a fully Bayesian approach with all parameters of the model considered as random variables and make use of the NVIDIA's parallel computing architecture (CUDA) to accelerate the calculation. Our result of analyzing an empirical abalone sperm lysine data is in accordance with the previous findings.
引用
收藏
页码:203 / 212
页数:10
相关论文
共 50 条
  • [1] A Dirichlet process model for detecting positive selection in protein-coding DNA sequences
    Huelsenbeck, JP
    Jain, S
    Frost, SWD
    Pond, SLK
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (16) : 6263 - 6268
  • [2] CRANN: detecting adaptive evolution in protein-coding DNA sequences
    Creevey, CJ
    McInerney, JO
    [J]. BIOINFORMATICS, 2003, 19 (13) : 1726 - 1726
  • [3] Reconstructing protein-coding sequences from ancient DNA
    Hofreiter, Michael
    Hartmann, Stefanie
    [J]. ODORANT BINDING AND CHEMOSENSORY PROTEINS, 2020, 642 : 21 - 33
  • [4] Estimates of the Effect of Natural Selection on Protein-Coding Content
    Yap, Von Bing
    Lindsay, Helen
    Easteal, Simon
    Huttley, Gavin
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2010, 27 (03) : 726 - 734
  • [5] Natural selection on protein-coding genes in the human genome
    Carlos D. Bustamante
    Adi Fledel-Alon
    Scott Williamson
    Rasmus Nielsen
    Melissa Todd Hubisz
    Stephen Glanowski
    David M. Tanenbaum
    Thomas J. White
    John J. Sninsky
    Ryan D. Hernandez
    Daniel Civello
    Mark D. Adams
    Michele Cargill
    Andrew G. Clark
    [J]. Nature, 2005, 437 : 1153 - 1157
  • [6] Selection on protein-coding genes of natural cyanobacterial populations
    Mes, Ted H. M.
    Doeleman, Marije
    Lodders, Nicole
    Nuebel, Ulrich
    Stal, Lucas J.
    [J]. ENVIRONMENTAL MICROBIOLOGY, 2006, 8 (09) : 1534 - 1543
  • [7] Natural selection on protein-coding genes in the human genome
    Bustamante, CD
    Fledel-Alon, A
    Williamson, S
    Nielsen, R
    Hubisz, MT
    Glanowski, S
    Tanenbaum, DM
    White, TJ
    Sninsky, JJ
    Hernandez, RD
    Civello, D
    Adams, MD
    Cargill, M
    Clark, AG
    [J]. NATURE, 2005, 437 (7062) : 1153 - 1157
  • [8] Analysis of selection in protein-coding sequences accounting for common biases
    Del Amparo, Roberto
    Branco, Catarina
    Arenas, Jesus
    Vicens, Alberto
    Arenas, Miguel
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [9] Protein-coding tRNA sequences?
    Jimenez, Juan
    [J]. GENE, 2022, 814
  • [10] ON THE ORIGIN OF THE PERIODICITY OF 3 IN PROTEIN-CODING DNA-SEQUENCES
    GUTIERREZ, G
    OLIVER, JL
    MARIN, A
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 1994, 167 (04) : 413 - 414