Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution

被引:4
|
作者
Brennan, Caitriona [1 ]
Salido, Rodolfo A. [2 ]
Belda-Ferre, Pedro [1 ]
Bryant, MacKenzie [1 ]
Cowart, Charles [1 ]
Tiu, Maria D. [3 ]
Gonzalez, Antonio [1 ]
McDonald, Daniel [1 ]
Tribelhorn, Caitlin [1 ]
Zarrinpar, Amir [3 ,4 ,5 ]
Knight, Rob [1 ,2 ,4 ,6 ]
机构
[1] Univ Calif San Diego, Dept Pediat, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Bioengn, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Div Gastroenterol, La Jolla, CA USA
[4] Univ Calif San Diego, Ctr Microbiome Innovat, La Jolla, CA 92093 USA
[5] VA San Diego Hlth Sci, La Jolla, CA USA
[6] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
基金
美国国家卫生研究院;
关键词
metagenomics; large-scale studies; NGS normalization; automation; multiplexing; quantification; high-throughput sequencing;
D O I
10.1128/msystems.00006-23
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
High-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology. Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lagged behind. To counter this, researchers have adapted library miniaturization protocols and large sample pools to maximize the number of samples that can be prepared by a certain amount of reagents and sequenced in a single run. However, due to high variability of sample quality, over and underrepresentation of samples in a sequencing run has become a major issue in high-throughput sequencing. This leads to misinterpretation of results due to increased noise, and additional time and cost rerunning underrepresented samples. To overcome this problem, we present a normalization method that uses shallow iSeq sequencing to accurately inform pooling volumes based on read distribution. This method is superior to the widely used fluorometry methods, which cannot specifically target adapter-ligated molecules that contribute to sequencing output. Our normalization method not only quantifies adapter-ligated molecules but also allows normalization of feature space; for example, we can normalize to reads of interest such as non-ribosomal reads. As a result, this normalization method improves the efficiency of high-throughput next-generation sequencing by reducing noise and producing higher average reads per sample with more even sequencing depth. IMPORTANCEHigh-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] High-throughput and Cost-effective Chicken Genotyping Using Next-Generation Sequencing
    Fábio Pértille
    Carlos Guerrero-Bosagna
    Vinicius Henrique da Silva
    Clarissa Boschiero
    José de Ribamar da Silva Nunes
    Mônica Corrêa Ledur
    Per Jensen
    Luiz Lehmann Coutinho
    Scientific Reports, 6
  • [22] HybSelect: high-throughput access to genomic regions of interest for targeted next-generation sequencing
    Daniel Summerer
    Nature Methods, 2009, 6 (9) : v - vi
  • [23] Evaluation of High-Throughput Next-Generation Sequencing Applied in the Pathogenic Diagnosis of Bloodstream Infections
    Fang, Yuan
    Wang, Tao
    Jin, Li
    Li, Zhi-Tao
    Zhang, Jian-Qing
    Yang, Yang
    Zeng, Zhong
    Huang, Han Fei
    JUNDISHAPUR JOURNAL OF MICROBIOLOGY, 2020, 13 (10) : 1 - 5
  • [24] High-throughput and Cost-effective Chicken Genotyping Using Next-Generation Sequencing
    Pertille, Fabio
    Guerrero-Bosagna, Carlos
    da Silva, Vinicius Henrique
    Boschiero, Clarissa
    da Silva Nunes, Jose de Ribamar
    Ledur, Monica Correa
    Jensen, Per
    Coutinho, Luiz Lehmann
    SCIENTIFIC REPORTS, 2016, 6
  • [25] A high-throughput MAC strategy for next-generation WLANs
    Kim, S
    Kim, Y
    Choi, S
    Jang, K
    Chang, JB
    SIXTH IEEE INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS MOBILE AND MULTIMEDIA NETWORKS, PROCEEDINGS, 2005, : 278 - 285
  • [26] High-throughput SNP discovery in the rabbit (Oryctolagus cuniculus) genome by next-generation semiconductor-based sequencing
    Bertolini, F.
    Schiavo, G.
    Scotti, E.
    Ribani, A.
    Martelli, P. L.
    Casadio, R.
    Fontanesi, L.
    ANIMAL GENETICS, 2014, 45 (02) : 304 - 307
  • [27] Plastic bronchitis linked to human bocavirus 1 identified through high-throughput next-generation sequencing: A case report
    Zhang, Xiumin
    Zhao, Jing
    MEDICINE, 2024, 103 (36)
  • [28] A high-throughput belowground plant diversity assay using next-generation sequencing of the trnL intron
    E. G. Lamb
    T. Winsley
    C. L. Piper
    S. A. Freidrich
    S. D. Siciliano
    Plant and Soil, 2016, 404 : 361 - 372
  • [29] Next-Generation High-Throughput Sequencing to Evaluate Bacterial Communities in Freshwater Ecosystem in Hydroelectric Reservoirs
    Rojas, Martha Virginia R.
    Alonso, Diego Peres
    Dropa, Milena
    Razzolini, Maria Tereza P.
    de Carvalho, Dario Pires
    Nabas Ribeiro, Kaio Augusto
    Ribolla, Paulo Eduardo M.
    Sallum, Maria Anice M.
    MICROORGANISMS, 2022, 10 (07)
  • [30] High-throughput targeted genotyping using next-generation sequencing applied in Coffea canephora breeding
    Alkimim, Emilly Ruas
    Caixeta, Eveline Teixeira
    Sousa, Tiago Vieira
    da Silva, Felipe Lopes
    Sakiyama, Ney Sussumu
    Zambolim, Laercio
    EUPHYTICA, 2018, 214 (03)