andi: Fast and accurate estimation of evolutionary distances between closely related genomes

被引:59
|
作者
Haubold, Bernhard [1 ]
Kloetzl, Fabian [1 ,2 ]
Pfaffelhuber, Peter [3 ]
机构
[1] Max Planck Inst Evolut Biol, Dept Evolutionary Genet, D-24306 Plon, Germany
[2] Med Univ Lubeck, Inst Neuro & Bioinformat, D-23562 Lubeck, Germany
[3] Univ Freiburg, Math Inst, Math Stochast, Freiburg, Germany
关键词
COMMON SUBSTRING APPROACH; MULTIPLE ALIGNMENT; SEQUENCE; RECONSTRUCTION; RECOMBINATION; ORGANISMS;
D O I
10.1093/bioinformatics/btu815
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A standard approach to classifying sets of genomes is to calculate their pairwise distances. This is difficult for large samples. We have therefore developed an algorithm for rapidly computing the evolutionary distances between closely related genomes. Results: Our distance measure is based on ungapped local alignments that we anchor through pairs of maximal unique matches of a minimum length. These exact matches can be looked up efficiently using enhanced suffix arrays and our implementation requires approximately only 1 s and 45 MB RAM/Mbase analysed. The pairing of matches distinguishes non-homologous from homologous regions leading to accurate distance estimation. We show this by analysing simulated data and genome samples ranging from 29 Escherichia coli/Shigella genomes to 3085 genomes of Streptococcus pneumoniae.
引用
收藏
页码:1169 / 1175
页数:7
相关论文
共 41 条
  • [31] So Closely Related and Yet So Different: Strong Contrasts Between the Evolutionary Histories of Species of the Cardamine pratensis Polyploid Complex in Central Europe
    Melicharkova, Andrea
    Slenker, Marek
    Zozomova-Lihova, Judita
    Skokanova, Katarina
    Singliarova, Barbora
    Kacmarova, Tatiana
    Cabonova, Michaela
    Kempa, Matus
    Sramkova, Gabriela
    Mandakova, Terezie
    Lysak, Martin A.
    Svitok, Marek
    Martonfiova, Lenka
    Marhold, Karol
    FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [32] Estimation of protein similarities between four closely related Phyllotreta species (Coleoptera: Chrysomelidae: Alticinae) by means of isoelectric focusing and silver staining
    Verdyck, P
    Hulselmans, J
    JOURNAL OF ZOOLOGICAL SYSTEMATICS AND EVOLUTIONARY RESEARCH, 1998, 36 (1-2) : 71 - 74
  • [33] Track: Protein and Ligand - A New Marriage Between an Old Couple SPADE: A Fast and Accurate Estimation of Entropy from the Molecular Surface Properties
    Roy, Amitava
    Venkatraman, Vishwesh
    PROTEIN SCIENCE, 2023, 32
  • [34] Fragmented mitochondrial genomes evolved in opposite directions between closely related macaque louse Pedicinus obtusus and colobus louse Pedicinus badii (vol 112, pg 4924, 2020)
    Fu, Yi-Tian
    Dong, Yalun
    Wang, Wei
    Nie, Yu
    Liu, Guo-Hua
    Shao, Renfu
    GENOMICS, 2021, 113 (02) : 727 - 727
  • [36] A technique for genome-wide identification of differences in the interspersed repeats integrations between closely related genomes and its application to detection of human-specific integrations of HERV-K LTRs
    Buzdin, A
    Khodosevich, K
    Mamedov, I
    Vinogradova, T
    Lebedev, Y
    Hunsmann, G
    Sverdlov, E
    GENOMICS, 2002, 79 (03) : 413 - 422
  • [37] Comparative genomic analysis of Bacillus paralicheniformis MDJK30 with its closely related species reveals an evolutionary relationship between B. paralicheniformis and B. licheniformis
    Du, Yuhui
    Ma, Jinjin
    Yin, Zhiqiu
    Liu, Kai
    Yao, Gan
    Xu, Wenfeng
    Fan, Lingchao
    Du, Binghai
    Ding, Yanqin
    Wang, Chengqiang
    BMC GENOMICS, 2019, 20 (1)
  • [38] Comparative genomic analysis of Bacillus paralicheniformis MDJK30 with its closely related species reveals an evolutionary relationship between B. paralicheniformis and B. licheniformis
    Yuhui Du
    Jinjin Ma
    Zhiqiu Yin
    Kai Liu
    Gan Yao
    Wenfeng Xu
    Lingchao Fan
    Binghai Du
    Yanqin Ding
    Chengqiang Wang
    BMC Genomics, 20
  • [39] A simple, fast, and accurate thermodynamic-based approach for transfer and prediction of GC retention times between columns and instruments Part II: Estimation of target column geometry
    Hou, Siyuan
    Stevenson, Keisean A. J. M.
    Harynuk, James J.
    JOURNAL OF SEPARATION SCIENCE, 2018, 41 (12) : 2553 - 2558
  • [40] A simple, fast, and accurate thermodynamic-based approach for transfer and prediction of gas chromatography retention times between columns and instruments Part I: Estimation of reference column geometry and thermodynamic parameters
    Hou, Siyuan
    Stevenson, Keisean A. J. M.
    Harynuk, James J.
    JOURNAL OF SEPARATION SCIENCE, 2018, 41 (12) : 2544 - 2552