共 50 条
SimpleMetaPipeline: Breaking the bioinformatics bottleneck in metabarcoding
被引:1
|作者:
Williams, Jake
[1
,2
]
Pettorelli, Nathalie
[2
]
Dowell, Rosalie
[1
,2
]
Macdonald, Kenneth
[3
]
Meyer, Christopher
[3
]
Steyaert, Margaux
[1
,2
]
Tweedt, Sarah
[3
]
Ransome, Emma
[1
]
机构:
[1] Imperial Coll London, Dept Life Sci, Ascot, England
[2] Zool Soc London, Inst Zool, London, England
[3] Smithsonian Inst, Natl Museum Nat Hist, Washington, DC USA
来源:
基金:
英国自然环境研究理事会;
关键词:
amplicon sequence variants;
bioinformatics pipeline;
eDNA;
metabarcoding;
next-generation sequencing;
R;
D O I:
10.1111/2041-210X.14434
中图分类号:
Q14 [生态学(生物生态学)];
学科分类号:
071012 ;
0713 ;
摘要:
<p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px"> The democratisation of next-generation sequencing has vastly increased the availability of sequencing data from metabarcoding. However, to effectively prepare these metabarcoding data for subsequent analysis, researchers must consistently apply several different bioinformatic tools-including those which denoise reads, cluster sequences and assign taxonomic identities. This often creates a bioinformatics bottleneck in workflows for non-specialists due to obstacles around: (a) integrating different tools, (b) the inability to easily modify and rerun bioinformatic pipelines involving non-scripted ('point-and-click') elements and (c) the multiple outputs that may be required of a single dataset (e.g. amplicon sequence variants [ASVs] and operational taxonomic units [OTUs]), which often results in users running pipelines multiple times. Here, we introduce SimpleMetaPipeline, an open-source bioinformatics pipeline implemented in R, which addresses these obstacles. SimpleMetaPipeline integrates the most robust and commonly used existing bioinformatic tools in a single reproducible pipeline, with a streamlined choice of parameters, to generate a sequence data table containing alternative clustering and assignment options. SimpleMetaPipeline accepts demultiplexed paired-end and single reads from multiple sequencing runs. We describe the pipeline and demonstrate how alternative annotations enable the easy implementation of multi-algorithm agreement tests to strengthen inferences. SimpleMetaPipeline represents a valuable addition to the existing library of pipelines, providing easy and reproducible bioinformatics, including a range of commonly desired clustering and assignment options, such as OTUs and ASVs. <p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px">
引用
收藏
页码:1949 / 1957
页数:9
相关论文