Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank

被引:44
|
作者
Hofmeister, Robin J. [1 ]
Ribeiro, Diogo M. [1 ]
Rubinacci, Simone [1 ]
Delaneau, Olivier [1 ]
机构
[1] Univ Lausanne, Dept Computat Biol, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
LINKAGE DISEQUILIBRIUM; GENOTYPE IMPUTATION; WIDE ASSOCIATION;
D O I
10.1038/s41588-023-01415-w
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
SHAPEIT5, a phasing method that accurately processes large sequencing datasets, was applied on the UK Biobank whole-genome and whole-exome sequencing data to generate reference panels of haplotypes that boost imputation accuracy and enable the detection of compound heterozygous loss-of-function events for 549 genes. Phasing involves distinguishing the two parentally inherited copies of each chromosome into haplotypes. Here, we introduce SHAPEIT5, a new phasing method that quickly and accurately processes large sequencing datasets and applied it to UK Biobank (UKB) whole-genome and whole-exome sequencing data. We demonstrate that SHAPEIT5 phases rare variants with low switch error rates of below 5% for variants present in just 1 sample out of 100,000. Furthermore, we outline a method for phasing singletons, which, although less precise, constitutes an important step towards future developments. We then demonstrate that the use of UKB as a reference panel improves the accuracy of genotype imputation, which is even more pronounced when phased with SHAPEIT5 compared with other methods. Finally, we screen the UKB data for loss-of-function compound heterozygous events and identify 549 genes where both gene copies are knocked out. These genes complement current knowledge of gene essentiality in the human genome.
引用
收藏
页码:1243 / +
页数:23
相关论文
共 50 条
  • [1] Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank
    Robin J. Hofmeister
    Diogo M. Ribeiro
    Simone Rubinacci
    Olivier Delaneau
    Nature Genetics, 2023, 55 : 1243 - 1249
  • [2] Quantifying tumor heterogeneity in whole-genome and whole-exome sequencing data
    Oesper, Layla
    Satas, Gryte
    Raphael, Benjamin J.
    BIOINFORMATICS, 2014, 30 (24) : 3532 - 3540
  • [3] Whole-genome and whole-exome sequencing in neurological diseases
    Foo, Jia-Nee
    Liu, Jian-Jun
    Tan, Eng-King
    NATURE REVIEWS NEUROLOGY, 2012, 8 (09) : 508 - 517
  • [4] Whole-genome and whole-exome sequencing in neurological diseases
    Jia-Nee Foo
    Jian-Jun Liu
    Eng-King Tan
    Nature Reviews Neurology, 2012, 8 : 508 - 517
  • [5] Whole-genome sequencing of the UK Biobank
    Halldorsson, Bjarni, V
    Stefansson, Kari
    NATURE, 2022,
  • [6] Rare variant associations and novel loci for eosinophil count - a UK Biobank whole-exome sequencing study
    Hoglund, Julia
    Hadizadeh, Fatemeh
    Ek, Weronica
    Karlsson, Torgny
    Johansson, Asa
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 695 - 696
  • [7] Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression
    Ruoyu Tian
    Tian Ge
    Hyeokmoon Kweon
    Daniel B. Rocha
    Max Lam
    Jimmy Z. Liu
    Kritika Singh
    Daniel F. Levey
    Joel Gelernter
    Murray B. Stein
    Ellen A. Tsai
    Hailiang Huang
    Christopher F. Chabris
    Todd Lencz
    Heiko Runz
    Chia-Yen Chen
    Nature Communications, 15
  • [8] Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression
    Tian, Ruoyu
    Ge, Tian
    Kweon, Hyeokmoon
    Rocha, Daniel B.
    Lam, Max
    Liu, Jimmy Z.
    Singh, Kritika
    Levey, Daniel F.
    Gelernter, Joel
    Stein, Murray B.
    Tsai, Ellen A.
    Huang, Hailiang
    Chabris, Christopher F.
    Lencz, Todd
    Runz, Heiko
    Chen, Chia-Yen
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [9] Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants
    Belkadi, Aziz
    Bolze, Alexandre
    Itan, Yuval
    Cobat, Aurelie
    Vincent, Quentin B.
    Antipenko, Alexander
    Shang, Lei
    Boisson, Bertrand
    Casanova, Jean-Laurent
    Abel, Laurent
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (17) : 5473 - 5478
  • [10] Advantages and Perils of Clinical Whole-Exome and Whole-Genome Sequencing in Cardiomyopathy
    Francesco Mazzarotto
    Iacopo Olivotto
    Roddy Walsh
    Cardiovascular Drugs and Therapy, 2020, 34 : 241 - 253