Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank

被引:44
|
作者
Hofmeister, Robin J. [1 ]
Ribeiro, Diogo M. [1 ]
Rubinacci, Simone [1 ]
Delaneau, Olivier [1 ]
机构
[1] Univ Lausanne, Dept Computat Biol, Lausanne, Switzerland
基金
瑞士国家科学基金会;
关键词
LINKAGE DISEQUILIBRIUM; GENOTYPE IMPUTATION; WIDE ASSOCIATION;
D O I
10.1038/s41588-023-01415-w
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
SHAPEIT5, a phasing method that accurately processes large sequencing datasets, was applied on the UK Biobank whole-genome and whole-exome sequencing data to generate reference panels of haplotypes that boost imputation accuracy and enable the detection of compound heterozygous loss-of-function events for 549 genes. Phasing involves distinguishing the two parentally inherited copies of each chromosome into haplotypes. Here, we introduce SHAPEIT5, a new phasing method that quickly and accurately processes large sequencing datasets and applied it to UK Biobank (UKB) whole-genome and whole-exome sequencing data. We demonstrate that SHAPEIT5 phases rare variants with low switch error rates of below 5% for variants present in just 1 sample out of 100,000. Furthermore, we outline a method for phasing singletons, which, although less precise, constitutes an important step towards future developments. We then demonstrate that the use of UKB as a reference panel improves the accuracy of genotype imputation, which is even more pronounced when phased with SHAPEIT5 compared with other methods. Finally, we screen the UKB data for loss-of-function compound heterozygous events and identify 549 genes where both gene copies are knocked out. These genes complement current knowledge of gene essentiality in the human genome.
引用
收藏
页码:1243 / +
页数:23
相关论文
共 50 条
  • [21] Whole-Exome/Genome Sequencing and Genomics
    Grody, Wayne W.
    Thompson, Barry H.
    Hudgins, Louanne
    PEDIATRICS, 2013, 132 : S211 - S215
  • [22] Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing
    Wenming Xiao
    Luyao Ren
    Zhong Chen
    Li Tai Fang
    Yongmei Zhao
    Justin Lack
    Meijian Guan
    Bin Zhu
    Erich Jaeger
    Liz Kerrigan
    Thomas M. Blomquist
    Tiffany Hung
    Marc Sultan
    Kenneth Idler
    Charles Lu
    Andreas Scherer
    Rebecca Kusko
    Malcolm Moos
    Chunlin Xiao
    Stephen T. Sherry
    Ogan D. Abaan
    Wanqiu Chen
    Xin Chen
    Jessica Nordlund
    Ulrika Liljedahl
    Roberta Maestro
    Maurizio Polano
    Jiri Drabek
    Petr Vojta
    Sulev Kõks
    Ene Reimann
    Bindu Swapna Madala
    Timothy Mercer
    Chris Miller
    Howard Jacob
    Tiffany Truong
    Ali Moshrefi
    Aparna Natarajan
    Ana Granat
    Gary P. Schroth
    Rasika Kalamegham
    Eric Peters
    Virginie Petitjean
    Ashley Walton
    Tsai-Wei Shen
    Keyur Talsania
    Cristobal Juan Vera
    Kurt Langenbach
    Maryellen de Mars
    Jennifer A. Hipp
    Nature Biotechnology, 2021, 39 : 1141 - 1150
  • [23] BALSA: integrated secondary analysis for whole-genome and whole-exome sequencing, accelerated by GPU
    Luo, Ruibang
    Wong, Yiu-Lun
    Law, Wai-Chun
    Lee, Lap-Kei
    Cheung, Jeanno
    Liu, Chi-Man
    Lam, Tak-Wah
    PEERJ, 2014, 2
  • [24] The role of rare variants in male-pattern hair loss: Analysis of whole-exome sequencing data in the UK Biobank
    Henne, Sabrina
    Sivalingam, Sugirthan
    Hochfeld, Lara
    Maj, Carlo
    Borisov, Oleg
    Buness, Andreas
    Noethen, Markus M.
    Krawitz, Peter
    Heilmann-Heimbach, Stefanie
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 136 - 136
  • [25] Reconstructing Native American Migrations from Whole-Genome and Whole-Exome Data
    Gravel, Simon
    Zakharia, Fouad
    Moreno-Estrada, Andres
    Byrnes, Jake K.
    Muzzio, Marina
    Rodriguez-Flores, Juan L.
    Kenny, Eimear E.
    Gignoux, Christopher R.
    Maples, Brian K.
    Guiblet, Wilfried
    Dutil, Julie
    Via, Marc
    Sandoval, Karla
    Bedoya, Gabriel
    Oleksyk, Taras K.
    Ruiz-Linares, Andres
    Burchard, Esteban G.
    Martinez-Cruzado, Juan Carlos
    Bustamante, Carlos D.
    PLOS GENETICS, 2013, 9 (12):
  • [26] Prenatal diagnosis of non-immune hydrops fetalis: whole-exome sequencing or whole-genome sequencing?
    Westenius, E.
    Sahlin, E.
    Conner, P.
    Lindstrand, A.
    Iwarsson, E.
    ULTRASOUND IN OBSTETRICS & GYNECOLOGY, 2022, 60 (04) : 585 - 586
  • [27] Whole-genome sequencing offers additional but limited clinical utility compared with reanalysis of whole-exome sequencing
    Alfares, Ahmed
    Aloraini, Taghrid
    Al Subaie, Lamia
    Alissa, Abdulelah
    Al Qudsi, Ahmed
    Alahmad, Ahmed
    Al Mutairi, Fuad
    Alswaid, Abdulrahman
    Alothaim, Ali
    Eyaid, Wafaa
    Albalwi, Mohammed
    Alturki, Saeed
    Alfadhel, Majid
    GENETICS IN MEDICINE, 2018, 20 (11) : 1328 - 1333
  • [28] Accelerated somatic mutation calling for whole-genome and whole-exome sequencing data from heterogenous tumor samples
    Ji, Shuangxi
    Zhu, Tong
    Sethia, Ankit
    Wang, Wenyi
    GENOME RESEARCH, 2024, 34 (04) : 633 - 641
  • [29] miRMut: Annotation of mutations in miRNA genes from human whole-exome or whole-genome sequencing
    Urbanek-Trzeciak, Martyna O.
    Kozlowski, Piotr
    Galka-Marciniak, Paulina
    STAR PROTOCOLS, 2022, 3 (01):
  • [30] Whole-exome sequencing for variant discovery in blepharospasm
    Tian, Jun
    Vemula, Satya R.
    Xiao, Jianfeng
    Valente, Enza Maria
    Defazio, Giovanni
    Petrucci, Simona
    Gigante, Angelo Fabio
    Rudzinska-Bar, Monika
    Wszolek, Zbigniew K.
    Kennelly, Kathleen D.
    Uitti, Ryan J.
    van Gerpen, Jay A.
    Hedera, Peter
    Trimble, Elizabeth J.
    LeDoux, Mark S.
    MOLECULAR GENETICS & GENOMIC MEDICINE, 2018, 6 (04): : 601 - 626