Proper Distance Metrics for Phylogenetic Analysis Using Complete Genomes without Sequence Alignment

被引:20
|
作者
Yu, Zu-Guo [1 ,2 ]
Zhan, Xiao-Wen [1 ]
Han, Guo-Sheng [1 ]
Wang, Roger W. [3 ]
Anh, Vo [2 ]
Chu, Ka Hou [4 ]
机构
[1] Xiangtan Univ, Sch Math & Computat Sci, Xiangtan 411105, Hunan, Peoples R China
[2] Queensland Univ Technol, Sch Math Sci, Brisbane, Qld 4001, Australia
[3] Chinese Univ Hong Kong, Dept Math, Shatin, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Dept Biol, Shatin, Hong Kong, Peoples R China
基金
澳大利亚研究理事会;
关键词
phylogenetic analysis; complete genome; composition vector; correlation-related distance metric; PROTEIN SEQUENCES; WHOLE; INFORMATION;
D O I
10.3390/ijms11031141
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A shortcoming of most correlation distance methods based on the composition vectors without alignment developed for phylogenetic analysis using complete genomes is that the "distances" are not proper distance metrics in the strict mathematical sense. In this paper we propose two new correlation-related distance metrics to replace the old one in our dynamical language approach. Four genome datasets are employed to evaluate the effects of this replacement from a biological point of view. We find that the two proper distance metrics yield trees with the same or similar topologies as/to those using the old "distance" and agree with the tree of life based on 16S rRNA in a majority of the basic branches. Hence the two proper correlation-related distance metrics proposed here improve our dynamical language approach for phylogenetic analysis.
引用
收藏
页码:1141 / 1154
页数:14
相关论文
共 50 条
  • [1] Phylogenetic tree based on complete genomes using fractal and correlation analyses without sequence alignment
    Yu, Zu-Guo
    Anh, Vo
    [J]. WMSCI 2005: 9TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 6, 2005, : 204 - 210
  • [2] Distance-based analyses to reveal vertebrate phylogeny without sequence alignment using complete mitochondrial genomes
    Yu, Z. G.
    Zhou, L. Q.
    Anh, V. V.
    Chu, K. H.
    Li, C. P.
    Chien, Y. J.
    [J]. WMSCI 2007: 11TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL I, PROCEEDINGS, 2007, : 206 - +
  • [3] Prokaryotic phylogeny based on complete genomes without sequence alignment
    Hao, BL
    Qi, J
    Wang, B
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON FRONTIERS OF SCIENCE, 2003, : 441 - 444
  • [4] Prokaryotic phylogeny based on complete genomes without sequence alignment
    Hao, BL
    Qi, J
    Wang, B
    [J]. MODERN PHYSICS LETTERS B, 2003, 17 (03): : 91 - 94
  • [5] Analysis of Shape Alignment using Euclidean and Manhattan Distance Metrics
    Kumar, Rohit
    [J]. 2017 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN SIGNAL PROCESSING AND EMBEDDED SYSTEMS (RISE), 2017, : 326 - 331
  • [6] Multiple sequence alignment in phylogenetic analysis
    Phillips, A
    Janies, D
    Wheeler, W
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2000, 16 (03) : 317 - 330
  • [7] Phylogenetic analysis of Polyomaviruses based on their complete genomes
    Yu, Z. G.
    Zhou, L. Q.
    Chu, K. H.
    Li, C. P.
    Anh, V. V.
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 5, PROCEEDINGS, 2008, : 80 - +
  • [8] A mutual information based sequence distance for vertebrate phylogeny using complete mitochondrial genomes
    Yu, Z. G.
    Mao, Z.
    Zhou, L. Q.
    Anh, V. V.
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2007, : 253 - +
  • [9] Intraspecific phylogenetic analysis of Siberian woolly mammoths using complete mitochondrial genomes
    Gilbert, M. Thomas P.
    Drautz, Daniela I.
    Lesk, Arthur M.
    Ho, Simon Y. W.
    Qi, Ji
    Ratan, Aakrosh
    Hsu, Chih-Hao
    Sher, Andrei
    Dalen, Love
    Gotherstrom, Anders
    Tomsho, Lynn P.
    Rendulic, Snjezana
    Packard, Michael
    Campos, Paula F.
    Kuznetsova, Tatyana V.
    Shidlovskiy, Fyodor
    Tikhonov, Alexei
    Willerslev, Eske
    Iacumin, Paola
    Buigues, Bernard
    Ericson, Per G. P.
    Germonpre, Mietje
    Kosintsev, Pavel
    Nikolaev, Vladimir
    Nowak-Kemp, Malgosia
    Knight, James R.
    Irzyk, Gerard P.
    Perbost, Clotilde S.
    Fredrikson, Karin M.
    Harkins, Timothy T.
    Sheridan, Sharon
    Miller, Webb
    Schuster, Stephan C.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (24) : 8327 - 8332
  • [10] kSNP3.0: SNP detection and phylogenetic analysis of genomes without genome alignment or reference genome
    Gardner, Shea N.
    Slezak, Tom
    Hall, Barry G.
    [J]. BIOINFORMATICS, 2015, 31 (17) : 2877 - 2878