SimpleMetaPipeline: Breaking the bioinformatics bottleneck in metabarcoding

被引:1
|
作者
Williams, Jake [1 ,2 ]
Pettorelli, Nathalie [2 ]
Dowell, Rosalie [1 ,2 ]
Macdonald, Kenneth [3 ]
Meyer, Christopher [3 ]
Steyaert, Margaux [1 ,2 ]
Tweedt, Sarah [3 ]
Ransome, Emma [1 ]
机构
[1] Imperial Coll London, Dept Life Sci, Ascot, England
[2] Zool Soc London, Inst Zool, London, England
[3] Smithsonian Inst, Natl Museum Nat Hist, Washington, DC USA
来源
METHODS IN ECOLOGY AND EVOLUTION | 2024年 / 15卷 / 11期
基金
英国自然环境研究理事会;
关键词
amplicon sequence variants; bioinformatics pipeline; eDNA; metabarcoding; next-generation sequencing; R;
D O I
10.1111/2041-210X.14434
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
<p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px"> The democratisation of next-generation sequencing has vastly increased the availability of sequencing data from metabarcoding. However, to effectively prepare these metabarcoding data for subsequent analysis, researchers must consistently apply several different bioinformatic tools-including those which denoise reads, cluster sequences and assign taxonomic identities. This often creates a bioinformatics bottleneck in workflows for non-specialists due to obstacles around: (a) integrating different tools, (b) the inability to easily modify and rerun bioinformatic pipelines involving non-scripted ('point-and-click') elements and (c) the multiple outputs that may be required of a single dataset (e.g. amplicon sequence variants [ASVs] and operational taxonomic units [OTUs]), which often results in users running pipelines multiple times. Here, we introduce SimpleMetaPipeline, an open-source bioinformatics pipeline implemented in R, which addresses these obstacles. SimpleMetaPipeline integrates the most robust and commonly used existing bioinformatic tools in a single reproducible pipeline, with a streamlined choice of parameters, to generate a sequence data table containing alternative clustering and assignment options. SimpleMetaPipeline accepts demultiplexed paired-end and single reads from multiple sequencing runs. We describe the pipeline and demonstrate how alternative annotations enable the easy implementation of multi-algorithm agreement tests to strengthen inferences. SimpleMetaPipeline represents a valuable addition to the existing library of pipelines, providing easy and reproducible bioinformatics, including a range of commonly desired clustering and assignment options, such as OTUs and ASVs. <p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px">
引用
收藏
页码:1949 / 1957
页数:9
相关论文
共 50 条
  • [31] BREAKING THE BOTTLENECK TO REAL-TIME VIDEO
    不详
    ELECTRONICS, 1993, 66 (20): : 10 - 10
  • [32] Nanomaterials: Breaking through the bottleneck of tumor immunotherapy
    Kang, Yue
    Li, Shenglong
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 230
  • [33] Breaking the Expression Bottleneck of Graph Neural Networks
    Yang, Mingqi
    Wang, Renjian
    Shen, Yanming
    Qi, Heng
    Yin, Baocai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5652 - 5664
  • [34] Breaking the data bottleneck with speedy optical interconnects
    Zhu, SX
    PHOTONICS SPECTRA, 1996, 30 (07) : 95 - &
  • [35] Breaking the bottleneck in plastic-bottle recycling
    不详
    PLASTICS ENGINEERING, 2000, 56 (12) : 24 - 24
  • [36] THE GUI TEST BUILDER BREAKING THE TESTING BOTTLENECK
    BRADLEY, NS
    COMPUTER DESIGN, 1993, 32 (09): : 71 - &
  • [37] Silicon to package integration - breaking the packaging bottleneck
    DiOrio, M
    SOLID STATE TECHNOLOGY, 2002, 45 (12) : S5 - S5
  • [38] Breaking the new bottleneck: Our way into robotics
    Harre, M
    Neh, H
    Schulz, C
    Tilstam, U
    Wessa, T
    Weinmann, H
    ORGANIC PROCESS RESEARCH & DEVELOPMENT, 2001, 5 (03) : 335 - 339
  • [39] A pile of pipelines: An overview of the bioinformatics software for metabarcoding data analyses
    Hakimzadeh, Ali
    Abdala Asbun, Alejandro
    Albanese, Davide
    Bernard, Maria
    Buchner, Dominik
    Callahan, Benjamin
    Caporaso, J. Gregory
    Curd, Emily
    Djemiel, Christophe
    Brandstrom Durling, Mikael
    Elbrecht, Vasco
    Gold, Zachary
    Gweon, Hyun S.
    Hajibabaei, Mehrdad
    Hildebrand, Falk
    Mikryukov, Vladimir
    Normandeau, Eric
    Ozkurt, Ezgi
    M. Palmer, Jonathan
    Pascal, Geraldine
    Porter, Teresita M.
    Straub, Daniel
    Vasar, Martti
    Vetrovsky, Tomas
    Zafeiropoulos, Haris
    Anslan, Sten
    MOLECULAR ECOLOGY RESOURCES, 2024, 24 (05)
  • [40] Want to track pandemic variants faster? Fix the bioinformatics bottleneck
    Emma B. Hodcroft
    Nicola De Maio
    Rob Lanfear
    Duncan R. MacCannell
    Bui Quang Minh
    Heiko A. Schmidt
    Alexandros Stamatakis
    Nick Goldman
    Christophe Dessimoz
    Nature, 2021, 591 : 30 - 33