A comparison of short-read, HiFi long-read, and hybrid strategies for genome-resolved metagenomics

被引:8
|
作者
Eisenhofer, Raphael [1 ]
Nesme, Joseph [2 ]
Santos-Bay, Luisa [1 ]
Koziol, Adam [1 ]
Sorensen, Soren Johannes [2 ]
Alberdi, Antton [1 ]
Aizpurua, Ostaizka [1 ]
机构
[1] Univ Copenhagen, Globe Inst, Ctr Evolutionary Hologen, Copenhagen, Denmark
[2] Univ Copenhagen, Dept Biol, Sect Microbiol, Copenhagen, Denmark
基金
新加坡国家研究基金会;
关键词
microbiology; metagenomics; long read; mice; gut microbiome; microbiome; MICROBIAL GENOMES; BACTERIA;
D O I
10.1128/spectrum.03590-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Shotgun metagenomics enables the reconstruction of complex microbial communities at a high level of detail. Such an approach can be conducted using both short-read and long-read sequencing data, as well as a combination of both. To assess the pros and cons of these different approaches, we used 22 fecal DNA extracts collected weekly for 11 weeks from two respective lab mice to study seven performance metrics over four combinations of sequencing depth and technology: (i) 20 Gbp of Illumina short-read data, (ii) 40 Gbp of short-read data, (iii) 20 Gbp of PacBio HiFi long-read data, and (iv) 40 Gbp of hybrid (20 Gbp of short-read +20 Gbp of long-read) data. No strategy was best for all metrics; instead, each one excelled across different metrics. The long-read approach yielded the best assembly statistics, with the highest N50 and lowest number of contigs. The 40 Gbp short-read approach yielded the highest number of refined bins. Finally, the hybrid approach yielded the longest assemblies and the highest mapping rate to the bacterial genomes. Our results suggest that while long-read sequencing significantly improves the quality of reconstructed bacterial genomes, it is more expensive and requires deeper sequencing than short-read approaches to recover a comparable amount of reconstructed genomes. The most optimal strategy is study-specific and depends on how researchers assess the trade-off between the quantity and quality of recovered genomes.IMPORTANCEMice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments. Mice are an important model organism for understanding the gut microbiome. When studying these gut microbiomes using DNA techniques, researchers can choose from technologies that use short or long DNA reads. In this study, we perform an extensive benchmark between short- and long-read DNA sequencing for studying mice gut microbiomes. We find that no one approach was best for all metrics and provide information that can help guide researchers in planning their experiments.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Short- and long-read metabarcoding of the eukaryotic rRNA operon: Evaluation of primers and comparison to shotgun metagenomics sequencing
    Latz, Meike A. C.
    Grujcic, Vesna
    Brugel, Sonia
    Lycken, Jenny
    John, Uwe
    Karlson, Bengt
    Andersson, Agneta
    Andersson, Anders F.
    MOLECULAR ECOLOGY RESOURCES, 2022, 22 (06) : 2304 - 2318
  • [42] Germline long insertions in BRCA1/PALB2 exon revealed by PCR-free short-read and long-read whole-genome sequencing
    Kwong, Ava
    Au, Chun Hang
    Ho, Dona N.
    Wong, Elaine Y. L.
    Leung, Henry C. M.
    Leung, Amy W. S.
    Law, Janet H. Y.
    Law, Fian B. F.
    Tey, Sze Keong
    Ho, Cecilia Y. S.
    Ma, Edmond S. K.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 911 - 912
  • [43] Computational complexity of algorithms for sequence comparison, short-read assembly and genome alignment
    Baichoo, Shakuntala
    Ouzounis, Christos A.
    BIOSYSTEMS, 2017, 156 : 72 - 85
  • [44] Comprehensive de novo mutation discovery with HiFi long-read sequencing
    Kucuk, Erdi
    van der Sanden, Bart
    O'Gorman, Luke
    Kwint, Michael
    Derks, Ronny
    Wenger, Aaron
    Lambert, Christine
    Chakraborty, Shreyasee
    Baybayan, Primo
    Rowell, William
    Kronenberg, Zev
    Brunner, Han
    Vissers, Lisenka
    Hoischen, Alexander
    Gilissen, Christian
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 608 - 608
  • [45] Unraveling metagenomics through long-read sequencing: a comprehensive review
    Kim, Chankyung
    Pongpanich, Monnat
    Porntaveetus, Thantrira
    JOURNAL OF TRANSLATIONAL MEDICINE, 2024, 22 (01)
  • [46] Unraveling metagenomics through long-read sequencing: a comprehensive review
    Chankyung Kim
    Monnat Pongpanich
    Thantrira Porntaveetus
    Journal of Translational Medicine, 22
  • [47] Impact of short-read sequencing on the misassembly of a plant genome
    Wang, Peipei
    Meng, Fanrui
    Moore, Bethany M.
    Shiu, Shin-Han
    BMC GENOMICS, 2021, 22 (01)
  • [48] Comprehensive de novo mutation discovery with HiFi long-read sequencing
    Kucuk, Erdi
    van der Sanden, Bart P. G. H.
    O'Gorman, Luke
    Kwint, Michael
    Derks, Ronny
    Wenger, Aaron M.
    Lambert, Christine
    Chakraborty, Shreyasee
    Baybayan, Primo
    Rowell, William J.
    Brunner, Han G.
    Vissers, Lisenka E. L. M.
    Hoischen, Alexander
    Gilissen, Christian
    GENOME MEDICINE, 2023, 15 (01)
  • [49] Comprehensive de novo mutation discovery with HiFi long-read sequencing
    Erdi Kucuk
    Bart P. G. H. van der Sanden
    Luke O’Gorman
    Michael Kwint
    Ronny Derks
    Aaron M. Wenger
    Christine Lambert
    Shreyasee Chakraborty
    Primo Baybayan
    William J. Rowell
    Han G. Brunner
    Lisenka E. L. M. Vissers
    Alexander Hoischen
    Christian Gilissen
    Genome Medicine, 15
  • [50] PSI-Sigma: a comprehensive splicing-detection method for short-read and long-read RNA-seq analysis
    Lin, Kuan-Ting
    Krainer, Adrian R.
    BIOINFORMATICS, 2019, 35 (23) : 5048 - 5054