Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses

被引:5
|
作者
Kirichenko, Anastasiya D. [1 ]
Poroshina, Anastasiya A. [2 ]
Sherbakov, Dmitry Yu [2 ,3 ,4 ]
Sadovsky, Michael G. [5 ,6 ,7 ]
Krutovsky, Konstantin, V [1 ,8 ,9 ,10 ,11 ,12 ]
机构
[1] Siberian Fed Univ, Inst Fundamental Biol & Biotechnol, Dept Genom & Bioinformat, Krasnoyarsk, Russia
[2] Russian Acad Sci, Limnol Inst, Lab Mol Systemat, Siberian Branch, Irkutsk, Russia
[3] Irkutsk State Univ, Fac Biol & Soil Studies, Irkutsk, Russia
[4] Novosibirsk State Univ, Fac Nat Sci, Novosibirsk, Russia
[5] Russian Acad Sci, Inst Computat Modelling, Siberian Branch, Krasnoyarsk, Russia
[6] VF Voino Yasenetsky Krasnoyarsk State Med Univ, Krasnoyarsk, Russia
[7] Fed Med Biol Agcy, Fed Res & Clin Ctr, Krasnoyarsk, Russia
[8] Georg August Univ Gottingen, Dept Forest Genet & Forest Tree Breeding, Gottingen, Germany
[9] Georg August Univ Gottingen, Ctr Integrated Breeding Res, Gottingen, Germany
[10] Siberian Fed Univ, Inst Fundamental Biol & Biotechnol, Genome Res & Educ Ctr, Lab Forest Genom, Krasnoyarsk, Russia
[11] Russian Acad Sci, NI Vavilov Inst Gen Genet, Lab Populat Genet, Moscow, Russia
[12] GF Morozov Voronezh State Univ Forestry & Technol, Sci & Methodol Ctr, Voronezh, Russia
来源
PLOS ONE | 2022年 / 17卷 / 03期
关键词
RECOMBINATION; EVOLUTION;
D O I
10.1371/journal.pone.0264640
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The SARS-CoV-2 is the third coronavirus in addition to SARS-CoV and MERS-CoV that causes severe respiratory syndrome in humans. All of them likely crossed the interspecific barrier between animals and humans and are of zoonotic origin, respectively. The origin and evolution of viruses and their phylogenetic relationships are of great importance for study of their pathogenicity and development of antiviral drugs and vaccines. The main objective of the presented study was to compare two methods for identifying relationships between coronavirus genomes: phylogenetic one based on the whole genome alignment followed by molecular phylogenetic tree inference and alignment-free clustering of triplet frequencies, respectively, using 69 coronavirus genomes selected from two public databases. Both approaches resulted in well-resolved robust classifications. In general, the clusters identified by the first approach were in good agreement with the classes identified by the second using K-means and the elastic map method, but not always, which still needs to be explained. Both approaches demonstrated also a significant divergence of genomes on a taxonomic level, but there was less correspondence between genomes regarding the types of diseases they caused, which may be due to the individual characteristics of the host. This research showed that alignment-free methods are efficient in combination with alignment-based methods. They have a significant advantage in computational complexity and provide valuable additional alternative information on the genomes relationships.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] KITSUNE: A Tool for Identifying Empirically Optimal K-mer Length for Alignment-Free Phylogenomic Analysis
    Pornputtapong, Natapol
    Acheampong, Daniel A.
    Patumcharoenpol, Preecha
    Jenjaroenpun, Piroon
    Wongsurawat, Thidathip
    Jun, Se-Ran
    Yongkiettrakul, Suganya
    Chokesajjawatee, Nipa
    Nookaew, Intawat
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 8
  • [42] Genome-Wide Analysis of Promoters: Clustering by Alignment and Analysis of Regular Patterns
    Pettinato, Lucia
    Calistri, Elisa
    Di Patti, Francesca
    Livi, Roberto
    Luccioli, Stefano
    PLOS ONE, 2014, 9 (01):
  • [43] Assessment of bisulfite sequencing alignment tools for whole genome analysis in plants
    Wu, Qiufei
    Yang, Mengdi
    Yang, Yaodong
    Iqbal, Amjad
    Zhou, Lixia
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2025, 305
  • [44] Bloom Filter Trie: an alignment-free and reference-free data structure for pan-genome storage
    Holley, Guillaume
    Wittler, Roland
    Stoye, Jens
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2016, 11
  • [45] Application of Sequence Alignment-Free Comparison-Based SeqDistK to Microbial Flora Clustering
    Liu X.
    Huang G.
    Huang T.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2019, 47 (11): : 71 - 77
  • [46] Bloom Filter Trie: an alignment-free and reference-free data structure for pan-genome storage
    Guillaume Holley
    Roland Wittler
    Jens Stoye
    Algorithms for Molecular Biology, 11
  • [47] Alignment-free similarity analysis for protein sequences based on fuzzy integral
    Saw, Ajay Kumar
    Tripathy, Binod Chandra
    Nandi, Soumyadeep
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [48] Alignment-free similarity analysis for protein sequences based on fuzzy integral
    Ajay Kumar Saw
    Binod Chandra Tripathy
    Soumyadeep Nandi
    Scientific Reports, 9
  • [49] Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes
    Seligmann, Herve
    BIOSYSTEMS, 2018, 167 : 33 - 46
  • [50] Generating Minimal Models of H1N1 NS1 Gene Sequences Using Alignment-Based and Alignment-Free Algorithms
    Fang, Meng
    Xu, Jiawei
    Sun, Nan
    Yau, Stephen S. -T.
    GENES, 2023, 14 (01)