SimpleMetaPipeline: Breaking the bioinformatics bottleneck in metabarcoding

被引：1

作者：

Williams, Jake ^{[1
,2
]}

Pettorelli, Nathalie ^{[2
]}

Dowell, Rosalie ^{[1
,2
]}

Macdonald, Kenneth ^{[3
]}

Meyer, Christopher ^{[3
]}

Steyaert, Margaux ^{[1
,2
]}

Tweedt, Sarah ^{[3
]}

Ransome, Emma ^{[1
]}

机构：

[1] Imperial Coll London, Dept Life Sci, Ascot, England

[2] Zool Soc London, Inst Zool, London, England

[3] Smithsonian Inst, Natl Museum Nat Hist, Washington, DC USA

来源：

METHODS IN ECOLOGY AND EVOLUTION | 2024年 / 15卷 / 11期

基金：

英国自然环境研究理事会;

关键词：

amplicon sequence variants; bioinformatics pipeline; eDNA; metabarcoding; next-generation sequencing; R;

D O I：

10.1111/2041-210X.14434

中图分类号：

Q14 [生态学（生物生态学）];

学科分类号：

071012 ; 0713 ;

摘要：

<p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px"> The democratisation of next-generation sequencing has vastly increased the availability of sequencing data from metabarcoding. However, to effectively prepare these metabarcoding data for subsequent analysis, researchers must consistently apply several different bioinformatic tools-including those which denoise reads, cluster sequences and assign taxonomic identities. This often creates a bioinformatics bottleneck in workflows for non-specialists due to obstacles around: (a) integrating different tools, (b) the inability to easily modify and rerun bioinformatic pipelines involving non-scripted ('point-and-click') elements and (c) the multiple outputs that may be required of a single dataset (e.g. amplicon sequence variants [ASVs] and operational taxonomic units [OTUs]), which often results in users running pipelines multiple times. Here, we introduce SimpleMetaPipeline, an open-source bioinformatics pipeline implemented in R, which addresses these obstacles. SimpleMetaPipeline integrates the most robust and commonly used existing bioinformatic tools in a single reproducible pipeline, with a streamlined choice of parameters, to generate a sequence data table containing alternative clustering and assignment options. SimpleMetaPipeline accepts demultiplexed paired-end and single reads from multiple sequencing runs. We describe the pipeline and demonstrate how alternative annotations enable the easy implementation of multi-algorithm agreement tests to strengthen inferences. SimpleMetaPipeline represents a valuable addition to the existing library of pipelines, providing easy and reproducible bioinformatics, including a range of commonly desired clustering and assignment options, such as OTUs and ASVs. <p style="border:0px; display:block; height:0px; left:-9999px; margin-bottom:0px; margin-left:0px; margin-right:0px; margin-top:0px; opacity:0; overflow:hidden; padding:0px; position:absolute; top:0px; width:0px">

引用

页码：1949 / 1957

页数：9

共 50 条

[1] Breaking the Bottleneck
Henry, Ben Andrew
SCIENTIST, 2017, 31 (01): : 15 - 16
[2] Breaking the bandwidth bottleneck
New Electron, 18 (57):
[3] Breaking the metro bottleneck
Hecht, J.
Technology Review, 2001, 104 (05):
[4] Breaking the Battery Bottleneck
Frishberg, Manny
RESEARCH-TECHNOLOGY MANAGEMENT, 2019, 62 (06) : 6 - 8
[5] BREAKING THE MULTICORE BOTTLENECK
Moore, Samuel K.
IEEE SPECTRUM, 2016, 53 (11) : 16 - 17
[6] Breaking the storage bottleneck
Hotch, R
COMMUNICATIONS NEWS, 1998, 35 (07): : 12 - +
[7] Breaking the bottleneck.
Smaglik P.
Nature, 2005, 434 (7036) : 1047 - 1047
[8] BREAKING THE MAILROOM BOTTLENECK
MYHILL, P
DATA PROCESSING, 1981, 23 (06): : 33 - 35
[9] Breaking the bottleneck of synthetic cells
Staufer, Oskar
NATURE NANOTECHNOLOGY, 2024, 19 (01) : 3 - 4
[10] Breaking the bottleneck of synthetic cells
Oskar Staufer
Nature Nanotechnology, 2024, 19 : 3 - 4

← 1 2 3 4 5 →