SimpleMetaPipeline: Breaking the bioinformatics bottleneck in metabarcoding

被引:1
|
作者
Williams, Jake [1 ,2 ]
Pettorelli, Nathalie [2 ]
Dowell, Rosalie [1 ,2 ]
Macdonald, Kenneth [3 ]
Meyer, Christopher [3 ]
Steyaert, Margaux [1 ,2 ]
Tweedt, Sarah [3 ]
Ransome, Emma [1 ]
机构
[1] Imperial Coll London, Dept Life Sci, Ascot, England
[2] Zool Soc London, Inst Zool, London, England
[3] Smithsonian Inst, Natl Museum Nat Hist, Washington, DC USA
来源
METHODS IN ECOLOGY AND EVOLUTION | 2024年 / 15卷 / 11期
基金
英国自然环境研究理事会;
关键词
amplicon sequence variants; bioinformatics pipeline; eDNA; metabarcoding; next-generation sequencing; R;
D O I
10.1111/2041-210X.14434
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
<p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px"> The democratisation of next-generation sequencing has vastly increased the availability of sequencing data from metabarcoding. However, to effectively prepare these metabarcoding data for subsequent analysis, researchers must consistently apply several different bioinformatic tools-including those which denoise reads, cluster sequences and assign taxonomic identities. This often creates a bioinformatics bottleneck in workflows for non-specialists due to obstacles around: (a) integrating different tools, (b) the inability to easily modify and rerun bioinformatic pipelines involving non-scripted ('point-and-click') elements and (c) the multiple outputs that may be required of a single dataset (e.g. amplicon sequence variants [ASVs] and operational taxonomic units [OTUs]), which often results in users running pipelines multiple times. Here, we introduce SimpleMetaPipeline, an open-source bioinformatics pipeline implemented in R, which addresses these obstacles. SimpleMetaPipeline integrates the most robust and commonly used existing bioinformatic tools in a single reproducible pipeline, with a streamlined choice of parameters, to generate a sequence data table containing alternative clustering and assignment options. SimpleMetaPipeline accepts demultiplexed paired-end and single reads from multiple sequencing runs. We describe the pipeline and demonstrate how alternative annotations enable the easy implementation of multi-algorithm agreement tests to strengthen inferences. SimpleMetaPipeline represents a valuable addition to the existing library of pipelines, providing easy and reproducible bioinformatics, including a range of commonly desired clustering and assignment options, such as OTUs and ASVs. <p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px">
引用
收藏
页码:1949 / 1957
页数:9
相关论文
共 50 条
  • [21] SOFTWARE-DESIGN - BREAKING THE BOTTLENECK
    GRAHAM, AK
    IEEE SPECTRUM, 1982, 19 (03) : 43 - 50
  • [22] BREAKING THE POWER-FACTOR BOTTLENECK
    COLE, BC
    ELECTRONICS, 1989, 62 (07): : 83 - 84
  • [23] BREAKING THE SYSTEMS-DEVELOPMENT BOTTLENECK
    GREMILLION, LL
    PYBURN, P
    HARVARD BUSINESS REVIEW, 1983, 61 (02) : 130 - 137
  • [24] Mixtape: Breaking the Softmax Bottleneck Efficiently
    Yang, Zhilin
    Thang Luong
    Salakhutdinov, Ruslan
    Quoc Le
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [25] Breaking the Bottleneck in the Protein Biomarker Pipeline
    Witkowska, H. Ewa
    Hall, Steven C.
    Fisher, Susan J.
    CLINICAL CHEMISTRY, 2012, 58 (02) : 321 - 323
  • [26] Breaking the Sample-Prep Bottleneck
    Micic, Miodrag
    Whyte, Jeffrey D.
    GENETIC ENGINEERING & BIOTECHNOLOGY NEWS, 2012, 32 (02): : 31 - +
  • [27] Solving the Bottleneck Problem in Bioinformatics Computing: An Architectural Perspective
    Feng Lin
    Heiko Schröder
    Bertil Schmidt
    The Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, 2007, 48 : 185 - 188
  • [28] Breaking the memory bottleneck with an optical data path
    Fritts, JE
    Chamberlain, RD
    35TH ANNUAL SIMULATION SYMPOSIUM, PROCEEDINGS, 2002, : 352 - 362
  • [29] Breaking the Synchronization Bottleneck with Reconfigurable Transactional Execution
    Li, Zhaoshi
    Liu, Leibo
    Deng, Yangdong
    Yin, Shouyi
    Wei, Shaojun
    IEEE COMPUTER ARCHITECTURE LETTERS, 2018, 17 (02) : 147 - 150
  • [30] Solving the bottleneck problem in bioinformatics computing:: An architectural perspective
    Lin, Feng
    Schroeder, Heiko
    Schmidt, Bertil
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2007, 48 (03): : 185 - 188