A comparison of short-read, HiFi long-read, and hybrid strategies for genome-resolved metagenomics

被引:8
|
作者
Eisenhofer, Raphael [1 ]
Nesme, Joseph [2 ]
Santos-Bay, Luisa [1 ]
Koziol, Adam [1 ]
Sorensen, Soren Johannes [2 ]
Alberdi, Antton [1 ]
Aizpurua, Ostaizka [1 ]
机构
[1] Univ Copenhagen, Globe Inst, Ctr Evolutionary Hologen, Copenhagen, Denmark
[2] Univ Copenhagen, Dept Biol, Sect Microbiol, Copenhagen, Denmark
基金
新加坡国家研究基金会;
关键词
microbiology; metagenomics; long read; mice; gut microbiome; microbiome; MICROBIAL GENOMES; BACTERIA;
D O I
10.1128/spectrum.03590-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Shotgun metagenomics enables the reconstruction of complex microbial communities at a high level of detail. Such an approach can be conducted using both short-read and long-read sequencing data, as well as a combination of both. To assess the pros and cons of these different approaches, we used 22 fecal DNA extracts collected weekly for 11 weeks from two respective lab mice to study seven performance metrics over four combinations of sequencing depth and technology: (i) 20 Gbp of Illumina short-read data, (ii) 40 Gbp of short-read data, (iii) 20 Gbp of PacBio HiFi long-read data, and (iv) 40 Gbp of hybrid (20 Gbp of short-read +20 Gbp of long-read) data. No strategy was best for all metrics; instead, each one excelled across different metrics. The long-read approach yielded the best assembly statistics, with the highest N50 and lowest number of contigs. The 40 Gbp short-read approach yielded the highest number of refined bins. Finally, the hybrid approach yielded the longest assemblies and the highest mapping rate to the bacterial genomes. Our results suggest that while long-read sequencing significantly improves the quality of reconstructed bacterial genomes, it is more expensive and requires deeper sequencing than short-read approaches to recover a comparable amount of reconstructed genomes. The most optimal strategy is study-specific and depends on how researchers assess the trade-off between the quantity and quality of recovered genomes.IMPORTANCEMice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments. Mice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] MetaCC allows scalable and integrative analyses of both long-read and short-read metagenomic Hi-C data
    Yuxuan Du
    Fengzhu Sun
    Nature Communications, 14
  • [32] A combination of short-read and long-read RNA sequencing reveals NOVA1's role in telomere biology
    Ludlow, Andrew T.
    Sayed, Mohammed E.
    Slusher, Aaron L.
    Ribick, Mark
    Pancholi, Anisha
    Sereni, Brian
    Qui, Yu
    Tseng, Elizabeth
    Ashby, Meredith
    Corney, David C.
    CANCER RESEARCH, 2019, 79 (13)
  • [33] Oxford nanopore long-read sequencing enables the generation of complete bacterial and plasmid genomes without short-read sequencing
    Zhao, Wenxuan
    Zeng, Wei
    Pang, Bo
    Luo, Ming
    Peng, Yao
    Xu, Jialiang
    Kan, Biao
    Li, Zhenpeng
    Lu, Xin
    FRONTIERS IN MICROBIOLOGY, 2023, 14
  • [34] VILOCA: sequencing quality-aware viral haplotype reconstruction and mutation calling for short-read and long-read data
    Fuhrmann, Lara
    Langer, Benjamin
    Topolsky, Ivan
    Beerenwinkel, Niko
    NAR GENOMICS AND BIOINFORMATICS, 2024, 6 (04)
  • [35] Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing
    Cook, David E.
    Valle-Inclan, Jose Espejo
    Pajoro, Alice
    Rovenich, Hanna
    Thomma, Bart P. H. J.
    Faino, Luigi
    PLANT PHYSIOLOGY, 2019, 179 (01) : 38 - 54
  • [36] Short-read and long-read RNA sequencing of mouse hematopoietic stem cells at bulk and single-cell levels
    Xiuran Zheng
    Dan Zhang
    Mengying Xu
    Wanqin Zeng
    Ran Zhou
    Yiming Zhang
    Chao Tang
    Li Chen
    Lu Chen
    Jing-wen Lin
    Scientific Data, 8
  • [37] Short- and long-read metagenomics expand individualized structural variations in gut microbiomes
    Liang Chen
    Na Zhao
    Jiabao Cao
    Xiaolin Liu
    Jiayue Xu
    Yue Ma
    Ying Yu
    Xuan Zhang
    Wenhui Zhang
    Xiangyu Guan
    Xiaotong Yu
    Zhipeng Liu
    Yanqun Fan
    Yang Wang
    Fan Liang
    Depeng Wang
    Linhua Zhao
    Moshi Song
    Jun Wang
    Nature Communications, 13
  • [38] Short- and long-read metagenomics expand individualized structural variations in gut microbiomes
    Chen, Liang
    Zhao, Na
    Cao, Jiabao
    Liu, Xiaolin
    Xu, Jiayue
    Ma, Yue
    Yu, Ying
    Zhang, Xuan
    Zhang, Wenhui
    Guan, Xiangyu
    Yu, Xiaotong
    Liu, Zhipeng
    Fan, Yanqun
    Wang, Yang
    Liang, Fan
    Wang, Depeng
    Zhao, Linhua
    Song, Moshi
    Wang, Jun
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [39] A long-read and short-read transcriptomics approach provides the first high-quality reference transcriptome and genome annotation for Pseudotsuga menziesii (Douglas-fir)
    Velasco, Vera Marjorie Elauria
    Ferreira, Alyssa
    Zaman, Sumaira
    Noordermeer, Devin
    Ensminger, Ingo
    Wegrzyn, Jill L.
    G3-GENES GENOMES GENETICS, 2023, 13 (02):
  • [40] Impact of short-read sequencing on the misassembly of a plant genome
    Peipei Wang
    Fanrui Meng
    Bethany M. Moore
    Shin-Han Shiu
    BMC Genomics, 22