INSIDER: alignment-free detection of foreign DNA sequences

被引:4
|
作者
Tay, Aidan P. [1 ,3 ]
Hosking, Brendan [1 ]
Hosking, Cameron [1 ]
Bauer, Denis C. [1 ,2 ,3 ]
Wilson, Laurence O. W. [1 ,3 ]
机构
[1] CSIRO, Australian E Hlth Res Ctr, Sydney, NSW, Australia
[2] Macquarie Univ, Dept Biomed Sci, Sydney, NSW, Australia
[3] Macquarie Univ, Fac Sci & Engn, Appl BioSci, Sydney, NSW, Australia
关键词
Integrated DNA; k-mers; Alignment free; Gene drive; Genomic signature; Anti-microbial resistance detection; Viral integration; HORIZONTAL GENE-TRANSFER; PLASMID; DRIVE;
D O I
10.1016/j.csbj.2021.06.045
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
External DNA sequences can be inserted into an organism's genome either through natural processes such as gene transfer, or through targeted genome engineering strategies. Being able to robustly identify such foreign DNA is a crucial capability for health and biosecurity applications, such as anti-microbial resistance (AMR) detection or monitoring gene drives. This capability does not exist for poorly characterised host genomes or with limited information about the integrated sequence. To address this, we developed the INserted Sequence Information DEtectoR (INSIDER). INSIDER analyses whole genome sequencing data and identifies segments of potentially foreign origin by their significant shift in k-mer signatures. We demonstrate the power of INSIDER to separate integrated DNA sequences from normal genomic sequences on a synthetic dataset simulating the insertion of a CRISPR-Cas gene drive into wild-type yeast. As a proof-of-concept, we use INSIDER to detect the exact AMR plasmid in whole genome sequencing data from a Citrobacter freundii patient isolate. INSIDER streamlines the process of identifying integrated DNA in poorly characterised wild species or when the insert is of unknown origin, thus enhancing the monitoring of emerging biosecurity threats. (C) 2021 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:3810 / 3816
页数:7
相关论文
共 50 条
  • [21] Alignment-free similarity analysis for protein sequences based on fuzzy integral
    Saw, Ajay Kumar
    Tripathy, Binod Chandra
    Nandi, Soumyadeep
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [22] Alignment-free Sequence Comparison for Biologically Realistic Sequences of Moderate Length
    Burden, Conrad J.
    Jing, Junmei
    Wilson, Susan R.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2012, 11 (01)
  • [23] Alignment-free comparison of metagenomics sequences via approximate string matching
    Chen, Jian
    Yang, Le
    Li, Lu
    Goodison, Steve
    Sun, Yijun
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [24] Alignment-free similarity analysis for protein sequences based on fuzzy integral
    Ajay Kumar Saw
    Binod Chandra Tripathy
    Soumyadeep Nandi
    Scientific Reports, 9
  • [25] An improved alignment-free model for dna sequence similarity metric
    Junpeng Bao
    Ruiyu Yuan
    Zhe Bao
    BMC Bioinformatics, 15
  • [26] An improved alignment-free model for dna sequence similarity metric
    Bao, Junpeng
    Yuan, Ruiyu
    Bao, Zhe
    BMC BIOINFORMATICS, 2014, 15
  • [27] Is BCH Code Useful to DNA Classification as an Alignment-Free Method?
    Arruda, Milena M.
    De Assis, Francisco M.
    De Souza, Taciana A.
    IEEE ACCESS, 2021, 9 : 68552 - 68560
  • [28] An alignment-free test for recombination
    Haubold, Bernhard
    Krause, Linda
    Horn, Thomas
    Pfaffelhuber, Peter
    BIOINFORMATICS, 2013, 29 (24) : 3121 - 3127
  • [29] An alignment-free method for detection of missing regions for phylogenetic analysis
    Islam, Rubyeat
    Rahman, Atif
    HELIYON, 2024, 10 (11)
  • [30] Alignment-Free Phylogenetic Reconstruction
    Daskalakis, Constantinos
    Roch, Sebastien
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2010, 6044 : 123 - +