Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution

被引:4
|
作者
Brennan, Caitriona [1 ]
Salido, Rodolfo A. [2 ]
Belda-Ferre, Pedro [1 ]
Bryant, MacKenzie [1 ]
Cowart, Charles [1 ]
Tiu, Maria D. [3 ]
Gonzalez, Antonio [1 ]
McDonald, Daniel [1 ]
Tribelhorn, Caitlin [1 ]
Zarrinpar, Amir [3 ,4 ,5 ]
Knight, Rob [1 ,2 ,4 ,6 ]
机构
[1] Univ Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Bioengn, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Div Gastroenterol, La Jolla, CA USA
[4] Univ Calif San Diego, Ctr Microbiome Innovat, La Jolla, CA 92093 USA
[5] VA San Diego Hlth Sci, La Jolla, CA USA
[6] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
基金
美国国家卫生研究院;
关键词
metagenomics; large-scale studies; NGS normalization; automation; multiplexing; quantification; high-throughput sequencing;
D O I
10.1128/msystems.00006-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
High-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology. Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lagged behind. To counter this, researchers have adapted library miniaturization protocols and large sample pools to maximize the number of samples that can be prepared by a certain amount of reagents and sequenced in a single run. However, due to high variability of sample quality, over and underrepresentation of samples in a sequencing run has become a major issue in high-throughput sequencing. This leads to misinterpretation of results due to increased noise, and additional time and cost rerunning underrepresented samples. To overcome this problem, we present a normalization method that uses shallow iSeq sequencing to accurately inform pooling volumes based on read distribution. This method is superior to the widely used fluorometry methods, which cannot specifically target adapter-ligated molecules that contribute to sequencing output. Our normalization method not only quantifies adapter-ligated molecules but also allows normalization of feature space; for example, we can normalize to reads of interest such as non-ribosomal reads. As a result, this normalization method improves the efficiency of high-throughput next-generation sequencing by reducing noise and producing higher average reads per sample with more even sequencing depth. IMPORTANCEHigh-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution (vol 8, e00006-23, 2023)
    Brennan, Caitriona
    Salido, Rodolfo A.
    Belda-Ferre, Pedro
    Bryant, Mackenzie
    Cowart, Charles
    Tiu, Maria D.
    Gonzalez, Antonio
    Mcdonald, Daniel
    Tribelhorn, Caitlin
    Zarrinpar, Amir
    Knight, Rob
    MSYSTEMS, 2025, 10 (01):
  • [2] High-Throughput Next-Generation Sequencing of Polioviruses
    Montmayeur, Anna M.
    Ng, Terry Fei Fan
    Schmidt, Alexander
    Zhao, Kun
    Magana, Laura
    Iber, Jane
    Castro, Christina J.
    Chen, Qi
    Henderson, Elizabeth
    Ramos, Edward
    Shaw, Jing
    Tatusov, Roman L.
    Dybdahl-Sissoko, Naomi
    Endegue-Zanga, Marie Claire
    Adeniji, Johnson A.
    Oberste, M. Steven
    Burns, Cara C.
    JOURNAL OF CLINICAL MICROBIOLOGY, 2017, 55 (02) : 606 - 615
  • [3] High-Throughput Microdissection for Next-Generation Sequencing
    Rosenberg, Avi Z.
    Armani, Michael D.
    Fetsch, Patricia A.
    Xi, Liqiang
    Tina Thu Pham
    Raffeld, Mark
    Chen, Yun
    O'Flaherty, Neil
    Stussman, Rebecca
    Blackler, Adele R.
    Du, Qiang
    Hanson, Jeffrey C.
    Roth, Mark J.
    Filie, Armando C.
    Roh, Michael H.
    Emmert-Buck, Michael R.
    Hipp, Jason D.
    Tangrea, Michael A.
    PLOS ONE, 2016, 11 (03):
  • [4] Rapid, high-throughput library preparation for next-generation sequencing
    Grunenwald, Haiying
    Baas, Brad
    Caruccio, Nicholas
    Syed, Fraz
    NATURE METHODS, 2010, 7 (08) : III - IV
  • [5] Rapid, high-throughput library preparation for next-generation sequencing
    Haiying Grunenwald
    Brad Baas
    Nicholas Caruccio
    Fraz Syed
    Nature Methods, 2010, 7 : iii - iv
  • [6] High-Throughput Blood Donor Genotyping Using Next-Generation Sequencing
    Nygren, Anders
    Ochsenfeld, John J.
    Tressler, Rob
    Harris, Crane
    Bixenman, Helen
    Spinosa, John
    TRANSFUSION, 2019, 59 : 116A - 117A
  • [7] High-Throughput B Cell Epitope Determination by Next-Generation Sequencing
    Walker, Lauren M.
    Shiakolas, Andrea R.
    Venkat, Rohit
    Liu, Zhaojing Ariel
    Wall, Steven
    Raju, Nagarajan
    Pilewski, Kelsey A.
    Setliff, Ian
    Murji, Amyn A.
    Gillespie, Rebecca
    Makoah, Nigel A.
    Kanekiyo, Masaru
    Connors, Mark
    Morris, Lynn
    Georgiev, Ivelin S.
    FRONTIERS IN IMMUNOLOGY, 2022, 13
  • [8] High-resolution, high-throughput HLA genotyping by next-generation sequencing
    Bentley, G.
    Higuchi, R.
    Hoglund, B.
    Goodridge, D.
    Sayer, D.
    Trachtenberg, E. A.
    Erlich, H. A.
    TISSUE ANTIGENS, 2009, 74 (05): : 393 - 403
  • [9] DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
    Nagasaki, Hideki
    Mochizuki, Takako
    Kodama, Yuichi
    Saruhashi, Satoshi
    Morizaki, Shota
    Sugawara, Hideaki
    Ohyanagi, Hajime
    Kurata, Nori
    Okubo, Kousaku
    Takagi, Toshihisa
    Kaminuma, Eli
    Nakamura, Yasukazu
    DNA RESEARCH, 2013, 20 (04) : 383 - 390
  • [10] HIGH-THROUGHPUT SEQUENCING USING NEXT-GENERATION SEQUENCING WITHIN 72 H.
    Sheh, Alexander
    Lebedeva, Tatiana V.
    Yu, Neng
    HUMAN IMMUNOLOGY, 2015, 76 : 10 - 10