Augmenting bacterial similarity measures using a graph-based genome representation

被引:0
|
作者
Ramanan, Vivek [1 ,2 ]
Sarkar, Indra Neil [1 ,2 ,3 ]
机构
[1] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
[2] Brown Univ, Ctr Biomed Informat, Providence, RI 02912 USA
[3] Rhode Isl Qual Inst, Providence, RI 02908 USA
关键词
synteny; genome analysis; microbiome; 16S RIBOSOMAL-RNA; IDENTIFICATION; PHYLOGENY; CORE;
D O I
10.1128/msystems.00497-24
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Relationships between bacterial taxa are traditionally defined using 16S rRNA nucleotide similarity or average nucleotide identity. Improvements in sequencing technology provide additional pairwise information on genome sequences, which may provide valuable information on genomic relationships. Mapping orthologous gene locations between genome pairs, known as synteny, is typically implemented in the discovery of new species and has not been systematically applied to bacterial genomes. Using a data set of 378 bacterial genomes, we developed and tested a new measure of synteny similarity between a pair of genomes, which was scaled onto 16S rRNA distance using covariance matrices. Based on the input gene functions used (i.e., core, antibiotic resistance, and virulence), we observed varying topological arrangements of bacterial relationship networks by applying (i) complete linkage hierarchical clustering and (ii) K-nearest neighbor graph structures to synteny-scaled 16S data. Our metric improved clustering quality comparatively to state-of-the-art average nucleotide identity metrics while preserving clustering assignments for the highest similarity relationships. Our findings indicate that syntenic relationships provide more granular and interpretable relationships for within-genera taxa compared to pairwise similarity measures, particularly in functional contexts.IMPORTANCEGiven the prevalence and necessity of the 16S rRNA measure in bacterial identification and analysis, this additional analysis adds a functional and synteny-based layer to the identification of relatives and clustering of bacteria genomes. It is also of computational interest to model the bacterial genome as a graph structure, which presents new avenues of genomic analysis for bacteria and their closely related strains and species. Given the prevalence and necessity of the 16S rRNA measure in bacterial identification and analysis, this additional analysis adds a functional and synteny-based layer to the identification of relatives and clustering of bacteria genomes. It is also of computational interest to model the bacterial genome as a graph structure, which presents new avenues of genomic analysis for bacteria and their closely related strains and species.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Graph-based sparse representation for image denoising
    Ge, Qi
    Cheng, Xiaogang
    Shao, Wenze
    Dong, Yue
    Zhuang, Wenqin
    Li, Haibo
    6TH INTERNATIONAL CONFERENCE ON APPLIED HUMAN FACTORS AND ERGONOMICS (AHFE 2015) AND THE AFFILIATED CONFERENCES, AHFE 2015, 2015, 3 : 2049 - 2056
  • [32] Augmenting LOD-Based Recommender Systems Using Graph Centrality Measures
    van Rossum, Bart
    Frasincar, Flavius
    WEB ENGINEERING (ICWE 2019), 2019, 11496 : 19 - 31
  • [33] Graph-Based Representation for Multiview Image Geometry
    Maugey, Thomas
    Ortega, Antonio
    Frossard, Pascal
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (05) : 1573 - 1586
  • [34] A Graph-Based Representation Method for Fashion Color
    Chen, Yuyilan
    Dai, Yuqian
    Li, Li
    Ma, Chenqu
    Liu, Xiaogang
    APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [35] Graph-Based Representation of Symbolic Musical Data
    Mokbel, Bassam
    Hasenfuss, Alexander
    Hammer, Barbara
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2009, 5534 : 42 - 51
  • [36] Graph-based knowledge representation for GIS data
    Palacio, MP
    Sol, D
    González, J
    PROCEEDINGS OF THE FOURTH MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC 2003), 2003, : 117 - 124
  • [37] Graph-based Text Representation and Knowledge Discovery
    Jin, Wei
    Srihari, Rohini K.
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 807 - 811
  • [38] GRAPH-BASED REPRESENTATION AND CODING OF MULTIVIEW GEOMETRY
    Maugey, Thomas
    Ortega, Antonio
    Frossard, Pascal
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 1325 - 1329
  • [39] Graph-based Document Representation for Relation Extraction
    Cabaleiro, Bernardo
    Penas, Anselmo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 57 - 64
  • [40] A Graph-Based Developmental Swarm Representation and Algorithm
    von Mammen, Sebastian
    Phillips, David
    Davison, Timothy
    Jacob, Christian
    SWARM INTELLIGENCE, 2010, 6234 : 1 - 12