Augmenting bacterial similarity measures using a graph-based genome representation

被引:0
|
作者
Ramanan, Vivek [1 ,2 ]
Sarkar, Indra Neil [1 ,2 ,3 ]
机构
[1] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
[2] Brown Univ, Ctr Biomed Informat, Providence, RI 02912 USA
[3] Rhode Isl Qual Inst, Providence, RI 02908 USA
关键词
synteny; genome analysis; microbiome; 16S RIBOSOMAL-RNA; IDENTIFICATION; PHYLOGENY; CORE;
D O I
10.1128/msystems.00497-24
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Relationships between bacterial taxa are traditionally defined using 16S rRNA nucleotide similarity or average nucleotide identity. Improvements in sequencing technology provide additional pairwise information on genome sequences, which may provide valuable information on genomic relationships. Mapping orthologous gene locations between genome pairs, known as synteny, is typically implemented in the discovery of new species and has not been systematically applied to bacterial genomes. Using a data set of 378 bacterial genomes, we developed and tested a new measure of synteny similarity between a pair of genomes, which was scaled onto 16S rRNA distance using covariance matrices. Based on the input gene functions used (i.e., core, antibiotic resistance, and virulence), we observed varying topological arrangements of bacterial relationship networks by applying (i) complete linkage hierarchical clustering and (ii) K-nearest neighbor graph structures to synteny-scaled 16S data. Our metric improved clustering quality comparatively to state-of-the-art average nucleotide identity metrics while preserving clustering assignments for the highest similarity relationships. Our findings indicate that syntenic relationships provide more granular and interpretable relationships for within-genera taxa compared to pairwise similarity measures, particularly in functional contexts.IMPORTANCEGiven the prevalence and necessity of the 16S rRNA measure in bacterial identification and analysis, this additional analysis adds a functional and synteny-based layer to the identification of relatives and clustering of bacteria genomes. It is also of computational interest to model the bacterial genome as a graph structure, which presents new avenues of genomic analysis for bacteria and their closely related strains and species. Given the prevalence and necessity of the 16S rRNA measure in bacterial identification and analysis, this additional analysis adds a functional and synteny-based layer to the identification of relatives and clustering of bacteria genomes. It is also of computational interest to model the bacterial genome as a graph structure, which presents new avenues of genomic analysis for bacteria and their closely related strains and species.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] High Quality Graph-Based Similarity Search
    Yu, Weiren
    McCann, Julie A.
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 83 - 92
  • [22] Graph-based Strategy for Establishing Morphology Similarity
    Juneja, Namit
    Zola, Jaroslaw
    Chandola, Varun
    Wodo, Olga
    33RD INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2021), 2020, : 169 - 180
  • [23] Collaborative filtering with a graph-based similarity measure
    Do Thi Lien
    Nguyen Duy Phuong
    2014 INTERNATIONAL CONFERENCE ON COMPUTING, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2014, : 251 - 256
  • [24] Imitation from Observation using RL and Graph-based Representation of Demonstrations
    El Manyari, Yassine
    Le Callet, Patrick
    Dolle, Laurent
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1258 - 1265
  • [25] A New Graph-Based Molecular Descriptor Using the Canonical Representation of the Molecule
    Hentabli, Hamza
    Saeed, Faisal
    Abdo, Ammar
    Salim, Naomie
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [26] GRAPH-BASED LIGHT FIELDS REPRESENTATION AND CODING USING GEOMETRY INFORMATION
    Su, Xin
    Rizkallah, Mira
    Maugey, Thomas
    Guillemot, Christine
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 4023 - 4027
  • [27] Determining Degree of Relevance of Reviews Using a Graph-Based Text Representation
    Ramachandran, Lakshmi
    Gehringer, Edward F.
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 442 - 445
  • [28] Integrating assembly and machining planning using graph-based representation models
    Tseng, YJ
    Liou, LC
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2000, 38 (12) : 2619 - 2641
  • [29] Towards Performance Evaluation of Graph-Based Representation
    Jouili, Salim
    Tabbone, Salvatore
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, 2011, 6658 : 72 - 81
  • [30] Graph-based Isometry Invariant Representation Learning
    Khasanova, Renata
    Frossard, Pascal
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70