BacWGSpipe: A Snakemake Workflow for a Complete Analysis of Bacterial Whole-Genome Sequencing Data

被引:0
|
作者
Wang, Weixin [1 ]
Li, Xiangcheng [1 ]
Lu, Yewei [1 ]
机构
[1] Key Lab Precis Med Diag & Monitoring Res Zhejiang, Hangzhou, Peoples R China
关键词
bacteria; bioinformatics; genomics; pipeline; CLASSIFICATION; VIRULENCE; TOOL;
D O I
10.1109/ICBCB57893.2023.10246579
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Whole-genome sequencing (WGS) provides a comprehensive view of the bacterial genome, enabling the identification of genetic determinants associated with antibiotic resistance, virulence, and other key clinical traits. Translating WGS data into the clinic requires a diverse collection of bioinformatics tools. Effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. We have developed the BacWGSpipe, an automated, scalable, reproducible, and open-source framework for bacterial genomics using WGS data from Illumina, PacBio and Nanopore platforms. BacWGSpipe combines some state-of-the-art tools to take genomic analysis from raw sequencing data through quality control, de novo genome assembly, genotyping, gene annotation and functional analysis, antimicrobial resistance (AMR), virulence and mobile genetic elements profiling, in addition to pangenome analysis, phylogenetic reconstruction and single-nucleotide polymorphism (SNP) variant calling. Once the analysis is finished, BacWGSpipe generates an interactive weblike html report. Using Snakemake and Conda, BacWGSpipe can be easily installed to any computation environment.
引用
收藏
页码:26 / 31
页数:6
相关论文
共 50 条
  • [1] Genome analysis TransFlow: a Snakemake workflow for transmission analysis of Mycobacterium tuberculosis whole-genome sequencing data
    Pan, Junhang
    Li, Xiangchen
    Zhang, Mingwu
    Lu, Yewei
    Zhu, Yelei
    Wu, Kunyang
    Wu, Yiwen
    Wang, Weixin
    Chen, Bin
    Liu, Zhengwei
    Wang, Xiaomeng
    Gao, Junshun
    BIOINFORMATICS, 2023, 39 (01)
  • [2] CamPype: an open-source workflow for automated bacterial whole-genome sequencing analysis focused on Campylobacter
    Ortega-Sanz, Irene
    Barbero-Aparicio, Jose A.
    Canepa-Oneto, Antonio
    Rovira, Jordi
    Melero, Beatriz
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [3] CamPype: an open-source workflow for automated bacterial whole-genome sequencing analysis focused on Campylobacter
    Irene Ortega-Sanz
    José A. Barbero-Aparicio
    Antonio Canepa-Oneto
    Jordi Rovira
    Beatriz Melero
    BMC Bioinformatics, 24
  • [4] A portable and scalable workflow for detecting structural variants in whole-genome sequencing data
    Kuzniar, Arnold
    Maassen, Jason
    Verhoeven, Stefan
    Santuari, Luca
    Shneider, Carl
    Kloosterman, Wigard
    de Bidder, Jeroen
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE 2018), 2018, : 303 - 304
  • [5] Saturation analysis for whole-genome bisulfite sequencing data
    Emanuele Libertini
    Simon C Heath
    Rifat A Hamoudi
    Marta Gut
    Michael J Ziller
    Javier Herrero
    Agata Czyz
    Victor Ruotti
    Hendrik G Stunnenberg
    Mattia Frontini
    Willem H Ouwehand
    Alexander Meissner
    Ivo G Gut
    Stephan Beck
    Nature Biotechnology, 2016, 34 : 691 - 693
  • [6] Saturation analysis for whole-genome bisulfite sequencing data
    Libertini, Emanuele
    Heath, Simon C.
    Hamoudi, Rifat A.
    Gut, Marta
    Ziller, Michael J.
    Herrero, Javier
    Czyz, Agata
    Ruotti, Victor
    Stunnenberg, Hendrik G.
    Frontini, Mattia
    Ouwehand, Willem H.
    Meissner, Alexander
    Gut, Ivo G.
    Beck, Stephan
    NATURE BIOTECHNOLOGY, 2016, 34 (07) : 691 - 693
  • [7] Validation of Whole-Genome Sequencing of Bacterial Isolate
    Muto, N.
    Basqueira, M.
    Franco, R.
    Malta, F.
    Soares, A.
    Oliveira, P.
    Koga, P.
    Petroni, R.
    Sitnik, R.
    Cervato, M.
    Martino, M.
    Pinho, J.
    Mangueira, C.
    Doi, A.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2022, 24 (10): : S65 - S65
  • [8] PennCNV in whole-genome sequencing data
    Lima, Leandro de Araujo
    Wang, Kai
    BMC BIOINFORMATICS, 2017, 18
  • [9] PennCNV in whole-genome sequencing data
    Leandro de Araújo Lima
    Kai Wang
    BMC Bioinformatics, 18
  • [10] A New Workflow for Whole-Genome Sequencing of Single Human Cells
    Binder, Vera
    Bartenhagen, Christoph
    Okpanyi, Vera
    Gombert, Michael
    Moehlendick, Birte
    Behrens, Bianca
    Klein, Hans-Ulrich
    Rieder, Harald
    Krell, Pina Fanny Ida
    Dugas, Martin
    Stoecklein, Nikolas Hendrik
    Borkhardt, Arndt
    HUMAN MUTATION, 2014, 35 (10) : 1260 - 1270