GCAT|Panel, a comprehensive structural variant haplotype map of the Iberian population from high-coverage whole-genome sequencing

被引:5
|
作者
Valls-Margarit, Jordi [1 ]
Galvan-Femenia, Ivan [2 ,15 ]
Matias-Sanchez, Daniel [1 ]
Blay, Natalia [2 ]
Puiggros, Montserrat [1 ]
Carreras, Anna [2 ]
Salvoro, Cecilia [1 ]
Cortes, Beatriz [2 ]
Amela, Ramon [1 ]
Farre, Xavier [2 ]
Lerga-Jaso, Jon [3 ]
Puig, Marta [3 ]
Sanchez-Herrero, Jose Francisco [4 ]
Moreno, Victor [5 ,6 ,7 ,8 ]
Perucho, Manuel [9 ,10 ]
Sumoy, Lauro [4 ]
Armengol, Lluis [11 ]
Delaneau, Olivier [12 ,13 ]
Caceres, Mario [3 ,14 ]
de Cid, Rafael [2 ]
Torrents, David [1 ,14 ]
机构
[1] Barcelona Supercomp Ctr BSC, Life Sci Dept, Barcelona 08034, Spain
[2] Inst Hlth Sci Res Germans Trias & Pujol IGTP, Genomes Life GCAT Lab Grp, Badalona 08916, Spain
[3] Univ Autonoma Barcelona, Inst Biotecnol & Biomed, Barcelona 08193, Spain
[4] Inst Hlth Sci Res Germans Trias & Pujol IGTP, High Content Genom & Bioinformat Unit, Badalona 08916, Spain
[5] Catalan Inst Oncol, Lhospitalet De Llobregat 08908, Spain
[6] Bellvitge Biomed Res Inst IDIBELL, Lhospitalet De Llobregat 08908, Spain
[7] CIBER Epidemiol & Salud Publ CIBERESP, Madrid 28029, Spain
[8] Univ Barcelona UB, Barcelona 08007, Spain
[9] Sanford Burnham Prebys Med Discovery Inst SBP, La Jolla, CA 92037 USA
[10] Hlth Sci Res Inst Germans Trias & Pujol IGTP, Program Predict & Personalized Med Canc PMPPC, Canc Genet & Epigenet, Badalona 08916, Spain
[11] Quantitat Genom Med Labs qGen, Esplugues Del Llobregat 08950, Spain
[12] Univ Lausanne, Dept Computat Biol, CH-1015 Lausanne, Switzerland
[13] Univ Lausanne, Swiss Inst Bioinformat SIB, Quartier Sorge Batiment Amphipole, CH-1015 Lausanne, Switzerland
[14] ICREA, Barcelona 08010, Spain
[15] Barcelona Inst Sci & Technol, Inst Res Biomed IRB Barcelona, Barcelona 08028, Spain
基金
欧盟地平线“2020”;
关键词
DROSOPHILA-MELANOGASTER; GENOTYPE; CANCER; DISCOVERY; PROGRAM;
D O I
10.1093/nar/gkac076
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The combined analysis of haplotype panels with phenotype clinical cohorts is a common approach to explore the genetic architecture of human diseases. However, genetic studies are mainly based on single nucleotide variants (SNVs) and small insertions and deletions (indels). Here, we contribute to fill this gap by generating a dense haplotype map focused on the identification, characterization, and phasing of structural variants (SVs). By integrating multiple variant identification methods and Logistic Regression Models (LRMs), we present a catalogue of 35 431 441 variants, including 89 178 SVs (>= 50 bp), 30 325 064 SNVs and 5 017 199 indels, across 785 Illumina high coverage (30x) whole-genomes from the Iberian GCAT Cohort, containing a median of 3.52M SNVs, 606 336 indels and 6393 SVs per individual. The haplotype panel is able to impute up to 14 360 728 SNVs/indels and 23 179 SVs, showing a 2.7-fold increase for SVs compared with available genetic variation panels. The value of this panel for SVs analysis is shown through an imputed rare Alu element located in a new locus associated with Mononeuritis of lower limb, a rare neuromuscular disease. This study represents the first deep characterization of genetic variation within the Iberian population and the first operational haplotype panel to systematically include the SVs into genome-wide genetic studies.
引用
收藏
页码:2464 / 2479
页数:16
相关论文
共 50 条
  • [1] Evolutionary history and adaptation from high-coverage whole-genome sequences of the pygmy population of Flores, Indonesia
    Tucci, Serena
    McCoy, Rajiv
    Vernot, Benjamin
    Vohr, Sam
    Robinson, Matthew R.
    Barbieri, Chiara
    Fu, Wenqing
    Sudoyo, Herawati
    Visscher, Peter M.
    Barbujani, Guido
    Akey, Joshua M.
    Green, Richard E.
    [J]. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2018, 165 : 278 - 278
  • [2] High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios
    Byrska-Bishop, Marta
    Evani, Uday S.
    Zhao, Xuefang
    Basile, Anna O.
    Abel, Haley J.
    Regier, Allison A.
    Corvelo, Andre
    Clarke, Wayne E.
    Musunuri, Rajeeva
    Nagulapalli, Kshithija
    Fairley, Susan
    Runnels, Alexi
    Winterkorn, Lara
    Lowy, Ernesto
    Flicek, Paul
    Germer, Soren
    Brand, Harrison
    Hall, Ira M.
    Talkowski, Michael E.
    Narzisi, Giuseppe
    Zody, Michael C.
    [J]. CELL, 2022, 185 (18) : 3426 - +
  • [3] Comprehensive Characterization of Human Genome Variation by High Coverage Whole-Genome Sequencing of Forty Four Caucasians
    Shen, Hui
    Li, Jian
    Zhang, Jigang
    Xu, Chao
    Jiang, Yan
    Wu, Zikai
    Zhao, Fuping
    Liao, Li
    Chen, Jun
    Lin, Yong
    Tian, Qing
    Papasian, Christopher J.
    Deng, Hong-Wen
    [J]. PLOS ONE, 2013, 8 (04):
  • [4] Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies
    Rieber, Nora
    Zapatka, Marc
    Lasitschka, Baerbel
    Jones, David
    Northcott, Paul
    Hutter, Barbara
    Jaeger, Natalie
    Kool, Marcel
    Taylor, Michael
    Lichter, Peter
    Pfister, Stefan
    Wolf, Stephan
    Brors, Benedikt
    Eils, Roland
    [J]. PLOS ONE, 2013, 8 (06):
  • [5] Improved Imputation Accuracy of Rare and Low-Frequency Genetic Variants Using Population-Specific High-Coverage Whole-Genome Sequencing Data Based Imputation Reference Panel
    Mitt, Mario
    Kals, Mart
    Parn, Kalle
    Gabriel, Stacey B.
    Lander, Eric S.
    Palotie, Aarno
    Ripatti, Samuli
    Morris, Andrew P.
    Metspalu, Andres
    Esko, Tonu
    Magi, Reedik
    Palta, Priit
    [J]. HUMAN HEREDITY, 2016, 81 (04) : 235 - 235
  • [6] Multiplex structural variant detection by whole-genome mapping and nanopore sequencing
    Lahari Uppuluri
    Yilin Wang
    Eleanor Young
    Jessica S. Wong
    Heba Z. Abid
    Ming Xiao
    [J]. Scientific Reports, 12
  • [7] Multiplex structural variant detection by whole-genome mapping and nanopore sequencing
    Uppuluri, Lahari
    Wang, Yilin
    Young, Eleanor
    Wong, Jessica S.
    Abid, Heba Z.
    Xiao, Ming
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [8] Phylogenomics from low-coverage whole-genome sequencing
    Zhang, Feng
    Ding, Yinhuan
    Zhu, Chao-Dong
    Zhou, Xin
    Orr, Michael C.
    Scheu, Stefan
    Luan, Yun-Xia
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (04): : 507 - 517
  • [9] Comprehensive rare variant analysis of individuals with neurodevelopmental disorders by whole-genome sequencing
    Sanchis-Juan, A.
    Armirola, C.
    Megy, K.
    Low, K.
    French, C. E.
    Grozeva, D.
    Dewhurst, E.
    Stephens, J.
    Stirrups, K.
    Erwood, M.
    Penkett, C.
    Shamardina, O.
    Ambegaonkar, G.
    Chitre, M.
    Josifova, D.
    Kurian, M.
    Parker, A.
    Rankin, J.
    Reid, E.
    Wakeling, E.
    Wassmer, E.
    Woods, G.
    Ouwehand, W. H.
    Raymond, F.
    Carss, K. J.
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 1471 - 1471
  • [10] Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing
    Akihiro Fujimoto
    Hidewaki Nakagawa
    Naoya Hosono
    Kaoru Nakano
    Tetsuo Abe
    Keith A Boroevich
    Masao Nagasaki
    Rui Yamaguchi
    Tetsuo Shibuya
    Michiaki Kubo
    Satoru Miyano
    Yusuke Nakamura
    Tatsuhiko Tsunoda
    [J]. Nature Genetics, 2010, 42 : 931 - 936