Utility of long-read sequencing for All of Us

被引:16
|
作者
Mahmoud, M. [1 ,2 ]
Huang, Y. [3 ]
Garimella, K. [3 ]
Audano, P. A. [4 ]
Wan, W. [3 ]
Prasad, N. [5 ]
Handsaker, R. E. [6 ,7 ]
Hall, S. [5 ]
Pionzio, A. [5 ]
Schatz, M. C. [8 ]
Talkowski, M. E. [7 ,9 ]
Eichler, E. E. [10 ,11 ]
Levy, S. E. [12 ]
Sedlazeck, F. J. [1 ,2 ,13 ]
机构
[1] Baylor Coll Med, Human Genome Sequencing Ctr, Houston, TX 77030 USA
[2] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
[3] Broad Inst MIT & Harvard, Data Sci Platform, Cambridge, MA 02141 USA
[4] Jackson Lab Genom Med, Farmington, CT 06032 USA
[5] Discovery Life Sci, Huntsville, AL 35806 USA
[6] Harvard Med Sch, Dept Genet, Boston, MA USA
[7] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02141 USA
[8] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD USA
[9] Massachusetts Gen Hosp, Ctr Genom Med, Boston, MA USA
[10] Univ Washington, Sch Med, Dept Genome Sci, Seattle, WA USA
[11] Univ Washington, Howard Hughes Med Inst, Seattle, WA USA
[12] HudsonAlpha Inst Biotechnol, Huntsville, AL 35806 USA
[13] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
基金
美国国家卫生研究院;
关键词
MISSING HERITABILITY; DISEASES; GENOME;
D O I
10.1038/s41467-024-44804-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The All of Us (AoU) initiative aims to sequence the genomes of over one million Americans from diverse ethnic backgrounds to improve personalized medical care. In a recent technical pilot, we compare the performance of traditional short-read sequencing with long-read sequencing in a small cohort of samples from the HapMap project and two AoU control samples representing eight datasets. Our analysis reveals substantial differences in the ability of these technologies to accurately sequence complex medically relevant genes, particularly in terms of gene coverage and pathogenic variant identification. We also consider the advantages and challenges of using low coverage sequencing to increase sample numbers in large cohort analysis. Our results show that HiFi reads produce the most accurate results for both small and large variants. Further, we present a cloud-based pipeline to optimize SNV, indel and SV calling at scale for long-reads analysis. These results lead to widespread improvements across AoU. Using All of Us pilot data, the authors compared short- and long-read performance across medically relevant genes and showcased the utility of long reads to improve variant detection and phasing in easy and hard to resolve medically relevant genes.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Long-read sequencing in the era of epigenomics and epitranscriptomics
    Lucas, Morghan C.
    Novoa, Eva Maria
    NATURE METHODS, 2023, 20 (01) : 25 - 29
  • [22] Long-Read Sequencing Emerging in Medical Genetics
    Mantere, Tuomo
    Kersten, Simone
    Hoischen, Alexander
    FRONTIERS IN GENETICS, 2019, 10
  • [23] Applications of long-read sequencing in clinical Neurology
    Mitsuhashi, Satomi
    Tachikawa, Keiji
    Imai, Takeshi
    Isahaya, Kenji
    Shimizu, Takahiro
    Yamano, Yoshihisa
    Frith, Martin C.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 614 - 614
  • [24] Applications of long-read sequencing to Mendelian genetics
    Mastrorosa, Francesco Kumara
    Miller, Danny E.
    Eichler, Evan E.
    GENOME MEDICINE, 2023, 15 (01)
  • [25] Long-read sequencing in the era of epigenomics and epitranscriptomics
    Morghan C. Lucas
    Eva Maria Novoa
    Nature Methods, 2023, 20 : 25 - 29
  • [26] Long-read sequencing of new Drosophila genomes
    Koch L.
    Nature Reviews Genetics, 2021, 22 (10) : 625 - 625
  • [27] Tandem repeats in the long-read sequencing era
    不详
    NATURE REVIEWS GENETICS, 2024, 25 (07) : 449 - 449
  • [28] LONG-READ SEQUENCING FOR THE METAGENOMIC ANALYSIS OF MICROBIOMES
    Free, Tristan
    BIOTECHNIQUES, 2023, 74 (04) : 153 - 155
  • [29] CRISPR and Long-Read Sequencing: A Perfect Match
    Ameur, Adam
    CRISPR JOURNAL, 2020, 3 (06): : 425 - 427
  • [30] Long-read sequencing data analysis for yeasts
    Jia-Xing Yue
    Gianni Liti
    Nature Protocols, 2018, 13 : 1213 - 1231