Haplotype-aware diplotyping from noisy long reads

被引:35
|
作者
Ebler, Jana [1 ,2 ,3 ]
Haukness, Marina [4 ]
Pesout, Trevor [4 ]
Marschall, Tobias [1 ,2 ]
Paten, Benedict [4 ]
机构
[1] Saarland Univ, Ctr Bioinformat, Saarland Informat Campus E2-1, D-66123 Saarbrucken, Germany
[2] Max Planck Inst Informat, Saarland Informat Campus E1-4, Saarbrucken, Germany
[3] Saarland Univ, Grad Sch Comp Sci, Saarland Informat Campus E1-3, Saarbrucken, Germany
[4] Univ Calif Santa Cruz, UC Santa Cruz Genom Inst, Santa Cruz, CA 95064 USA
基金
美国国家卫生研究院;
关键词
Computational genomics; Long reads; Genotyping; Phasing; Haplotypes; Diplotypes; HUMAN GENOME; ACCURATE; METHYLATION; COMPLEXITY; EFFICIENT;
D O I
10.1186/s13059-019-1709-0
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Current genotyping approaches for single-nucleotide variations rely on short, accurate reads from second-generation sequencing devices. Presently, third-generation sequencing platforms are rapidly becoming more widespread, yet approaches for leveraging their long but error-prone reads for genotyping are lacking. Here, we introduce a novel statistical framework for the joint inference of haplotypes and genotypes from noisy long reads, which we term diplotyping. Our technique takes full advantage of linkage information provided by long reads. We validate hundreds of thousands of candidate variants that have not yet been included in the high-confidence reference set of the Genome-in-a-Bottle effort.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Detection of structural variation and haplotype-aware genome assembly through Strand-Seq
    Sanders, Ashley D.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (SUPPL 1) : 16 - 16
  • [32] A haplotype-aware de novo assembly of related individuals using pedigree sequence graph
    Garg, Shilpa
    Aach, John
    Li, Heng
    Sebenius, Isaac
    Durbin, Richard
    Church, George
    BIOINFORMATICS, 2020, 36 (08) : 2385 - 2392
  • [33] NanoCaller for accurate detection of SNPs and indels in difficult-to-map regions from long-read sequencing by haplotype-aware deep neural networks
    Ahsan, Mian Umair
    Liu, Qian
    Fang, Li
    Wang, Kai
    GENOME BIOLOGY, 2021, 22 (01)
  • [34] Haplotype-aware modeling of cis-regulatory effects highlights the gaps remaining in eQTL data
    Nava Ehsan
    Bence M. Kotis
    Stephane E. Castel
    Eric J. Song
    Nicholas Mancuso
    Pejman Mohammadi
    Nature Communications, 15
  • [35] HapCol: accurate and memory-efficient haplotype assembly from long reads
    Pirola, Yuri
    Zaccaria, Simone
    Dondi, Riccardo
    Klau, Gunnar W.
    Pisanti, Nadia
    Bonizzoni, Paola
    BIOINFORMATICS, 2016, 32 (11) : 1610 - 1617
  • [36] Repeat and haplotype aware error correction in nanopore sequencing reads with DeChat
    Liu, Yuansheng
    Li, Yichen
    Chen, Enlian
    Xu, Jialu
    Zhang, Wenhai
    Zeng, Xiangxiang
    Luo, Xiao
    COMMUNICATIONS BIOLOGY, 2024, 7 (01)
  • [37] Haplotype-aware modeling of cis-regulatory effects highlights the gaps remaining in eQTL data
    Ehsan, Nava
    Kotis, Bence M.
    Castel, Stephane E.
    Song, Eric J.
    Mancuso, Nicholas
    Mohammadi, Pejman
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [38] NanoSNP: a progressive and haplotype-aware SNP caller on low-coverage nanopore sequencing data
    Huang, Neng
    Xu, Minghua
    Nie, Fan
    Ni, Peng
    Xiao, Chuan-Le
    Luo, Feng
    Wang, Jianxin
    BIOINFORMATICS, 2023, 39 (01)
  • [39] HaploDMF: viral haplotype reconstruction from long reads via deep matrix factorization
    Cai, Dehan
    Shang, Jiayu
    Sun, Yanni
    BIOINFORMATICS, 2022, 38 (24) : 5360 - 5367
  • [40] Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing
    Stapleton, James A.
    Kim, Jeongwoon
    Hamilton, John P.
    Wu, Ming
    Irber, Luiz C.
    Maddamsetti, Rohan
    Briney, Bryan
    Newton, Linsey
    Burton, Dennis R.
    Brown, C. Titus
    Chan, Christina
    Buell, C. Robin
    Whitehead, Timothy A.
    PLOS ONE, 2016, 11 (01):