Utility of long-read sequencing for All of Us

被引:0
|
作者
M. Mahmoud
Y. Huang
K. Garimella
P. A. Audano
W. Wan
N. Prasad
R. E. Handsaker
S. Hall
A. Pionzio
M. C. Schatz
M. E. Talkowski
E. E. Eichler
S. E. Levy
F. J. Sedlazeck
机构
[1] Human Genome Sequencing Center,Department of Molecular and Human Genetics
[2] Baylor College of Medicine,Data Sciences Platform
[3] Baylor College of Medicine,Department of Genetics
[4] Broad Institute of MIT and Harvard,Program in Medical and Population Genetics
[5] The Jackson Laboratory for Genomic Medicine,Department of Computer Science
[6] Discovery Life Sciences,Center for Genomic Medicine
[7] Harvard Medical School,Department of Genome Sciences
[8] Broad Institute of MIT and Harvard,Howard Hughes Medical Institute
[9] Johns Hopkins University,Department of Computer Science
[10] Massachusetts General Hospital,undefined
[11] University of Washington School of Medicine,undefined
[12] University of Washington,undefined
[13] HudsonAlpha Institute for Biotechnology,undefined
[14] Rice University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The All of Us (AoU) initiative aims to sequence the genomes of over one million Americans from diverse ethnic backgrounds to improve personalized medical care. In a recent technical pilot, we compare the performance of traditional short-read sequencing with long-read sequencing in a small cohort of samples from the HapMap project and two AoU control samples representing eight datasets. Our analysis reveals substantial differences in the ability of these technologies to accurately sequence complex medically relevant genes, particularly in terms of gene coverage and pathogenic variant identification. We also consider the advantages and challenges of using low coverage sequencing to increase sample numbers in large cohort analysis. Our results show that HiFi reads produce the most accurate results for both small and large variants. Further, we present a cloud-based pipeline to optimize SNV, indel and SV calling at scale for long-reads analysis. These results lead to widespread improvements across AoU.
引用
收藏
相关论文
共 50 条
  • [21] Long-read sequencing in the era of epigenomics and epitranscriptomics
    Lucas, Morghan C.
    Novoa, Eva Maria
    NATURE METHODS, 2023, 20 (01) : 25 - 29
  • [22] Long-Read Sequencing Emerging in Medical Genetics
    Mantere, Tuomo
    Kersten, Simone
    Hoischen, Alexander
    FRONTIERS IN GENETICS, 2019, 10
  • [23] Applications of long-read sequencing in clinical Neurology
    Mitsuhashi, Satomi
    Tachikawa, Keiji
    Imai, Takeshi
    Isahaya, Kenji
    Shimizu, Takahiro
    Yamano, Yoshihisa
    Frith, Martin C.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 614 - 614
  • [24] Applications of long-read sequencing to Mendelian genetics
    Mastrorosa, Francesco Kumara
    Miller, Danny E.
    Eichler, Evan E.
    GENOME MEDICINE, 2023, 15 (01)
  • [25] Long-read sequencing in the era of epigenomics and epitranscriptomics
    Morghan C. Lucas
    Eva Maria Novoa
    Nature Methods, 2023, 20 : 25 - 29
  • [26] Long-read sequencing of new Drosophila genomes
    Koch L.
    Nature Reviews Genetics, 2021, 22 (10) : 625 - 625
  • [27] Tandem repeats in the long-read sequencing era
    不详
    NATURE REVIEWS GENETICS, 2024, 25 (07) : 449 - 449
  • [28] LONG-READ SEQUENCING FOR THE METAGENOMIC ANALYSIS OF MICROBIOMES
    Free, Tristan
    BIOTECHNIQUES, 2023, 74 (04) : 153 - 155
  • [29] CRISPR and Long-Read Sequencing: A Perfect Match
    Ameur, Adam
    CRISPR JOURNAL, 2020, 3 (06): : 425 - 427
  • [30] Long-read sequencing data analysis for yeasts
    Jia-Xing Yue
    Gianni Liti
    Nature Protocols, 2018, 13 : 1213 - 1231