INSIDER: alignment-free detection of foreign DNA sequences

被引:4
|
作者
Tay, Aidan P. [1 ,3 ]
Hosking, Brendan [1 ]
Hosking, Cameron [1 ]
Bauer, Denis C. [1 ,2 ,3 ]
Wilson, Laurence O. W. [1 ,3 ]
机构
[1] CSIRO, Australian E Hlth Res Ctr, Sydney, NSW, Australia
[2] Macquarie Univ, Dept Biomed Sci, Sydney, NSW, Australia
[3] Macquarie Univ, Fac Sci & Engn, Appl BioSci, Sydney, NSW, Australia
关键词
Integrated DNA; k-mers; Alignment free; Gene drive; Genomic signature; Anti-microbial resistance detection; Viral integration; HORIZONTAL GENE-TRANSFER; PLASMID; DRIVE;
D O I
10.1016/j.csbj.2021.06.045
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
External DNA sequences can be inserted into an organism's genome either through natural processes such as gene transfer, or through targeted genome engineering strategies. Being able to robustly identify such foreign DNA is a crucial capability for health and biosecurity applications, such as anti-microbial resistance (AMR) detection or monitoring gene drives. This capability does not exist for poorly characterised host genomes or with limited information about the integrated sequence. To address this, we developed the INserted Sequence Information DEtectoR (INSIDER). INSIDER analyses whole genome sequencing data and identifies segments of potentially foreign origin by their significant shift in k-mer signatures. We demonstrate the power of INSIDER to separate integrated DNA sequences from normal genomic sequences on a synthetic dataset simulating the insertion of a CRISPR-Cas gene drive into wild-type yeast. As a proof-of-concept, we use INSIDER to detect the exact AMR plasmid in whole genome sequencing data from a Citrobacter freundii patient isolate. INSIDER streamlines the process of identifying integrated DNA in poorly characterised wild species or when the insert is of unknown origin, thus enhancing the monitoring of emerging biosecurity threats. (C) 2021 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:3810 / 3816
页数:7
相关论文
共 50 条
  • [31] Local alignment-free sequences based on D2shepp statistics
    Liu, Xue-Mei
    Wen, De-Hua
    Yu, Huang-Zhong
    Gao, Ya-Ni
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2012, 40 (08): : 106 - 109
  • [32] Alignment-free Comparison of Protein Sequences Based on Reduced Amino Acid Alphabets
    Jia, Cangzhi
    Liu, Tian
    Zhang, Xiangde
    Fu, Haoyue
    Yang, Qing
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2009, 26 (06): : 763 - 769
  • [33] Alignment-free analysis of barcode sequences by means of compression-based methods
    La Rosa, Massimo
    Fiannaca, Antonino
    Rizzo, Riccardo
    Urso, Alfonso
    BMC BIOINFORMATICS, 2013, 14
  • [34] Use of an Alignment-Free Method for the Geographical Discrimination of GTPVs Based on the GPCR Sequences
    Chibssa, Tesfaye Rufael
    Liu, Yang
    Sombo, Melaku
    Lichoti, Jacqueline Kasiiti
    Erdenebaatar, Janchivdorj
    Boldbaatar, Bazartseren
    Grabherr, Reingard
    Settypalli, Tirumala Bharani K.
    Berguido, Francisco J.
    Loitsch, Angelika
    Damena, Delesa
    Cattoli, Giovanni
    Diallo, Adama
    Lamien, Charles Euloge
    MICROORGANISMS, 2021, 9 (04)
  • [35] Classification of Protein Sequences by a Novel Alignment-Free Method on Bacterial and Virus Families
    Guan, Mengcen
    Zhao, Leqi
    Yau, Stephen S-T
    GENES, 2022, 13 (10)
  • [36] Toward an Alignment-Free Method for Feature Extraction and Accurate Classification of Viral Sequences
    Lebatteux, Dylan
    Remita, Amine M.
    Diallo, Abdoulaye Banire
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (06) : 519 - 535
  • [37] Alignment-free analysis of barcode sequences by means of compression-based methods
    Massimo La Rosa
    Antonino Fiannaca
    Riccardo Rizzo
    Alfonso Urso
    BMC Bioinformatics, 14
  • [38] Alignment-free detection of local similarity among viral and bacterial genomes
    Domazet-Loso, Mirjana
    Haubold, Bernhard
    BIOINFORMATICS, 2011, 27 (11) : 1466 - 1472
  • [39] aliFreeFold: an alignment-free approach to predict secondary structure from homologous RNA sequences
    Glouzon, Jean-Pierre Sehi
    Ouangraoua, Aida
    BIOINFORMATICS, 2018, 34 (13) : 70 - 78
  • [40] Realization of Alignment-Free WPT System
    Park, Byung-Chul
    Son, Yong-Ho
    Jang, Byung-Jun
    Lee, Jeong-Hae
    JOURNAL OF ELECTROMAGNETIC ENGINEERING AND SCIENCE, 2014, 14 (04) : 329 - 331