Accuracy and power of Bayes prediction of amino acid sites under positive selection

被引:303
|
作者
Anisimova, M
Bielawski, JP
Yang, ZH
机构
[1] UCL, Dept Biol, Galton Lab, London WC1E 6BT, England
[2] UCL, Ctr Math & Phys Life Sci & Expt Biol, London WC1E 6BT, England
关键词
Bayes inference; likelihood; nonsynonymous-synonymous rate ratio; positive selection; posterior probability;
D O I
10.1093/oxfordjournals.molbev.a004152
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Bayes prediction quantifies uncertainty by assigning posterior probabilities. It Was used to identify amino acids in a protein under recurrent diversifying selection indicated by higher nonsynonymous, (d(N)) than synonymous (d(S)) substitution rates or by omega = d(N)/d(S) > 1. Parameters were estimated by maximum likelihood under a codon substitution model that assumed several classes of sites with different w ratios. The Bayes theorem was used to calculate the posterior probabilities of each site falling into these site classes. Here. we evaluate the performance of Bayes prediction of amino acids under positive selection by computer simulation. We measured the accuracy by the proportion of predicted sites that were truly under selection and the power by the proportion of true positively selected sites that were predicted by the method. The accuracy was slightly better for longer sequences, whereas the power was largely unaffected by the increase in sequence length. Both accuracy and power were higher for medium or highly diverged sequences than for similar sequences. We found that accuracy and power were unacceptably low when data contained only a few highly similar sequences. However, sampling a large number of lineage improved the performance substantially. Even for very similar sequences. accuracy and Power can he high if over 100 taxa are used in the analysis. We make the following recommendations: (1) prediction of positive selection sites is not feasible for a few closely related sequences: (2) using it large number of lineages is the best way to improve the accuracy and power of the prediction: and (3) multiple models of heterogeneous selective pressures among sites should he applied in real data analysis.
引用
收藏
页码:950 / 958
页数:9
相关论文
共 50 条
  • [1] Bayes empirical Bayes inference of amino acid sites under positive selection
    Yang, ZH
    Wong, WSW
    Nielsen, R
    MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (04) : 1107 - 1118
  • [2] Detecting amino acid sites under positive selection and purifying selection
    Massingham, T
    Goldman, N
    GENETICS, 2005, 169 (03) : 1753 - 1762
  • [3] A counting renaissance: combining stochastic mapping and empirical Bayes to quickly detect amino acid sites under positive selection
    Lemey, Philippe
    Minin, Vladimir N.
    Bielejec, Filip
    Pond, Sergei L. Kosakovsky
    Suchard, Marc A.
    BIOINFORMATICS, 2012, 28 (24) : 3248 - 3256
  • [4] Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites
    Anisimova, M
    Nielsen, R
    Yang, ZH
    GENETICS, 2003, 164 (03) : 1229 - 1236
  • [5] Anopheles Immune Genes and Amino Acid Sites Evolving Under the Effect of Positive Selection
    Parmakelis, Aristeidis
    Moustaka, Marina
    Poulakakis, Nikolaos
    Louis, Christos
    Slotman, Michel A.
    Marshall, Jonathon C.
    Awono-Ambene, Parfait H.
    Antonio-Nkondjio, Christophe
    Simard, Frederic
    Caccone, Adalgisa
    Powell, Jeffrey R.
    PLOS ONE, 2010, 5 (01):
  • [6] A method for detecting positive selection at single amino acid sites
    Suzuki, Y
    Gojobori, T
    MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (10) : 1315 - 1328
  • [7] New methods for detecting positive selection at single amino acid sites
    Suzuki, Y
    JOURNAL OF MOLECULAR EVOLUTION, 2004, 59 (01) : 11 - 19
  • [8] New Methods for Detecting Positive Selection at Single Amino Acid Sites
    Yoshiyuki Suzuki
    Journal of Molecular Evolution, 2004, 59 : 11 - 19
  • [9] Positive Darwinian Selection at Single Amino Acid Sites Conferring Plant Virus Resistance
    Cavatorta, J. R.
    Savage, A. E.
    Yeam, I.
    Gray, S. M.
    Jahn, M. M.
    JOURNAL OF MOLECULAR EVOLUTION, 2008, 67 (05) : 551 - 559
  • [10] Positive selection at sites of multiple amino acid replacements since rat–mouse divergence
    Georgii A. Bazykin
    Fyodor A. Kondrashov
    Aleksey Y. Ogurtsov
    Shamil Sunyaev
    Alexey S. Kondrashov
    Nature, 2004, 429 : 558 - 562