SimpleMetaPipeline: Breaking the bioinformatics bottleneck in metabarcoding

被引:1
|
作者
Williams, Jake [1 ,2 ]
Pettorelli, Nathalie [2 ]
Dowell, Rosalie [1 ,2 ]
Macdonald, Kenneth [3 ]
Meyer, Christopher [3 ]
Steyaert, Margaux [1 ,2 ]
Tweedt, Sarah [3 ]
Ransome, Emma [1 ]
机构
[1] Imperial Coll London, Dept Life Sci, Ascot, England
[2] Zool Soc London, Inst Zool, London, England
[3] Smithsonian Inst, Natl Museum Nat Hist, Washington, DC USA
来源
METHODS IN ECOLOGY AND EVOLUTION | 2024年 / 15卷 / 11期
基金
英国自然环境研究理事会;
关键词
amplicon sequence variants; bioinformatics pipeline; eDNA; metabarcoding; next-generation sequencing; R;
D O I
10.1111/2041-210X.14434
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
<p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px"> The democratisation of next-generation sequencing has vastly increased the availability of sequencing data from metabarcoding. However, to effectively prepare these metabarcoding data for subsequent analysis, researchers must consistently apply several different bioinformatic tools-including those which denoise reads, cluster sequences and assign taxonomic identities. This often creates a bioinformatics bottleneck in workflows for non-specialists due to obstacles around: (a) integrating different tools, (b) the inability to easily modify and rerun bioinformatic pipelines involving non-scripted ('point-and-click') elements and (c) the multiple outputs that may be required of a single dataset (e.g. amplicon sequence variants [ASVs] and operational taxonomic units [OTUs]), which often results in users running pipelines multiple times. Here, we introduce SimpleMetaPipeline, an open-source bioinformatics pipeline implemented in R, which addresses these obstacles. SimpleMetaPipeline integrates the most robust and commonly used existing bioinformatic tools in a single reproducible pipeline, with a streamlined choice of parameters, to generate a sequence data table containing alternative clustering and assignment options. SimpleMetaPipeline accepts demultiplexed paired-end and single reads from multiple sequencing runs. We describe the pipeline and demonstrate how alternative annotations enable the easy implementation of multi-algorithm agreement tests to strengthen inferences. SimpleMetaPipeline represents a valuable addition to the existing library of pipelines, providing easy and reproducible bioinformatics, including a range of commonly desired clustering and assignment options, such as OTUs and ASVs. <p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px">
引用
收藏
页码:1949 / 1957
页数:9
相关论文
共 50 条