Whole-genome sequencing of 1029 Indian individuals reveals unique and rare structural variants

被引:3
|
作者
Divakar, Mohit Kumar [1 ,2 ]
Jain, Abhinav [1 ,2 ]
Bhoyar, Rahul C. [1 ]
Senthivel, Vigneshwar [1 ,2 ]
Jolly, Bani [1 ,2 ]
Imran, Mohamed [1 ,2 ]
Sharma, Disha [1 ,2 ]
Bajaj, Anjali [1 ,2 ]
Gupta, Vishu [1 ,2 ]
Scaria, Vinod [1 ,2 ]
Sivasubbu, Sridhar [1 ,2 ]
机构
[1] Inst Genom & Integrat Biol CSIR IGIB, CSIR, Mathura Rd, New Delhi 110025, India
[2] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, India
关键词
DISEASE; IDENTIFICATION;
D O I
10.1038/s10038-023-01131-7
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Structural variants contribute to genetic variability in human genomes and they can be presented in population-specific patterns. We aimed to understand the landscape of structural variants in the genomes of healthy Indian individuals and explore their potential implications in genetic disease conditions. For the identification of structural variants, a whole genome sequencing dataset of 1029 self-declared healthy Indian individuals from the IndiGen project was analysed. Further, these variants were evaluated for potential pathogenicity and their associations with genetic diseases. We also compared our identified variations with the existing global datasets. We generated a compendium of total 38,560 high-confident structural variants, comprising 28,393 deletions, 5030 duplications, 5038 insertions, and 99 inversions. Particularly, we identified around 55% of all these variants were found to be unique to the studied population. Further analysis revealed 134 deletions with predicted pathogenic/likely pathogenic effects and their affected genes were majorly enriched for neurological disease conditions, such as intellectual disability and neurodegenerative diseases. The IndiGenomes dataset helped us to understand the unique spectrum of structural variants in the Indian population. More than half of identified variants were not present in the publicly available global dataset on structural variants. Clinically important deletions identified in IndiGenomes might aid in improving the diagnosis of unsolved genetic diseases, particularly in neurological conditions. Along with basal allele frequency data and clinically important deletions, IndiGenomes data might serve as a baseline resource for future studies on genomic structural variant analysis in the Indian population.
引用
收藏
页码:409 / 417
页数:9
相关论文
共 50 条
  • [11] Comprehensive rare variant analysis of individuals with neurodevelopmental disorders by whole-genome sequencing
    Sanchis-Juan, A.
    Armirola, C.
    Megy, K.
    Low, K.
    French, C. E.
    Grozeva, D.
    Dewhurst, E.
    Stephens, J.
    Stirrups, K.
    Erwood, M.
    Penkett, C.
    Shamardina, O.
    Ambegaonkar, G.
    Chitre, M.
    Josifova, D.
    Kurian, M.
    Parker, A.
    Rankin, J.
    Reid, E.
    Wakeling, E.
    Wassmer, E.
    Woods, G.
    Ouwehand, W. H.
    Raymond, F.
    Carss, K. J.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 1471 - 1471
  • [12] Whole-genome sequencing data of Kazakh individuals
    Ulykbek Kairov
    Askhat Molkenov
    Saule Rakhimova
    Ulan Kozhamkulov
    Aigul Sharip
    Daniyar Karabayev
    Asset Daniyarov
    Joseph H.Lee
    Joseph D.Terwilliger
    Ainur Akilzhanova
    Zhaxybay Zhumadilov
    BMC Research Notes, 14
  • [13] A portable and scalable workflow for detecting structural variants in whole-genome sequencing data
    Kuzniar, Arnold
    Maassen, Jason
    Verhoeven, Stefan
    Santuari, Luca
    Shneider, Carl
    Kloosterman, Wigard
    de Bidder, Jeroen
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE 2018), 2018, : 303 - 304
  • [14] Contribution of rare whole-genome sequencing variants to plasma protein levels and the missing heritability
    Kierczak, Marcin
    Rafati, Nima
    Hoglund, Julia
    Gourle, Hadrien
    Lo Faro, Valeria
    Schmitz, Daniel
    Ek, Weronica E.
    Gyllensten, Ulf
    Enroth, Stefan
    Ekman, Diana
    Nystedt, Bjorn
    Karlsson, Torgny
    Johansson, Asa
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [15] Contribution of rare whole-genome sequencing variants to plasma protein levels and the missing heritability
    Marcin Kierczak
    Nima Rafati
    Julia Höglund
    Hadrien Gourlé
    Valeria Lo Faro
    Daniel Schmitz
    Weronica E. Ek
    Ulf Gyllensten
    Stefan Enroth
    Diana Ekman
    Björn Nystedt
    Torgny Karlsson
    Åsa Johansson
    Nature Communications, 13
  • [16] Whole-genome sequencing data reveals higher number of structural variants in Chernobyl catastrophe cleanup workers from Lithuania
    Domarkiene, Ingrida
    Zukauskaite, Gabriele
    Urnikyte, Alina
    Pranckeniene, Laura
    Dauengauer-Kirliene, Svetlana
    Arasimavicius, Justas
    Molyte, Alma
    Matuleviciene, Ausra
    Pilypiene, Ingrida
    Kucinskas, Vaidutis
    Ambrozaityte, Laima
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 570 - 570
  • [17] Longitudinal whole-genome sequencing reveals the evolution of MPAL
    Zhang, Yu
    Kang, Zhijie
    Lv, Dekang
    Zhang, Xuehong
    Liao, Yuwei
    Li, Yulong
    Liu, Ruimei
    Li, Peiying
    Tong, Mengying
    Tian, Jichao
    Shao, Yanyan
    Huang, Chao
    Ge, Dongcen
    Zhang, Jingkai
    Bai, Wanting
    Wang, Yichen
    Liu, Quentin
    Li, Zhiguang
    Yan, Jinsong
    CANCER GENETICS, 2020, 240 : 59 - 65
  • [18] Whole genome sequencing of families diagnosed with cardiac channelopathies reveals structural variants missed by whole exome sequencing
    Senthivel, Vigneshwar
    Jolly, Bani
    Arvinden, V. R.
    Bajaj, Anjali
    Bhoyar, Rahul
    Imran, Mohamed
    Vignesh, Harie
    Divakar, Mohit Kumar
    Sharma, Gautam
    Rai, Nitin
    Kumar, Kapil
    Jayakrishnan, M. P.
    Krishna, Maniram
    Shenthar, Jeyaprakash
    Ali, Muzaffar
    Abqari, Shaad
    Nadri, Gulnaz
    Scaria, Vinod
    Naik, Nitish
    Sivasubbu, Sridhar
    JOURNAL OF HUMAN GENETICS, 2024, 69 (09) : 455 - 465
  • [19] Estimation of allele frequency of pathological variants based on whole-genome sequencing of 1070 Japanese individuals
    Yamaguchi-Kabata, Yumi
    Kawai, Yosuke
    Kojima, Kaname
    Nariai, Naoki
    Mimori, Takahiro
    Sato, Yukuto
    Katsuoka, Fumiki
    Yasuda, Jun
    Yamamoto, Masayuki
    Nagasaki, Masao
    GENES & GENETIC SYSTEMS, 2015, 90 (06) : 379 - 379
  • [20] Whole-genome sequencing of East Asian lung cancers reveals new germline pathogenic variants
    Mukherjee, Semanti
    Carrot-Zhang, Jian
    CANCER CELL, 2022, 40 (10) : 1081 - 1083