Whole genome variant analysis in three ethnically diverse Indians

被引:2
|
作者
Malhotra, Seema [1 ]
Singh, Sayar [1 ]
Sarkar, Soma [1 ]
机构
[1] Govt India, DIPAS, Def Res & Dev Org, Minist Def, Lucknow Rd, Delhi 110054, India
关键词
Indian genome; Ethnic; Genetic diversity; Whole genome sequencing; HIGH-ALTITUDE; POPULATION; MTDNA; SEQUENCE; MUTATION; ROLES; DNA;
D O I
10.1007/s13258-018-0650-z
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
India represents an amazing confluence of geographically, linguistically and socially disparate ethnic populations (Indian Genome Variation Consortium, J Genet 87:3-20, 2008). Understanding the genetic diversity of Indian population remains a daunting task. In this paper we present detailed analysis of genomic variations (high-depth coverage ( 30x) using Illumina Hiseq 2000 platform) from three healthy Indian male individuals each belonging to three geographically delineated regions and linguistic phylum viz. high altitude region of Ladakh (Tibeto-Burman linguistic phylum), sub mountainous region of Kumaun (Indo-European linguistic phylum) and sea level region of Telangana (Dravidian linguistic phylum) for probing the extent of genetic diversity in our population. The sequencing analysis provided high quality data ( 95% of the total reads aligned to the human reference genome for each sample) and very good alignment quality (> 80% of the filtered mapped reads had a quality score of 60). A total of 4.3, 3.7 and 4.3 million single nucleotide variations were identified in the genome of high altitude, sub mountainous and sea level respectively by comparing with human reference genome. Approximately 17.3, 18.2, 17.4% of the variants were unique in the three genomes. The study identified many novel variations in the three diverse genomes (132,970 in Ladakh, 112,317 in Kumaun and 128,881 in Telangana individual) and is an important resource for creating a baseline and a comprehensive catalogue of human genomic variation across the Indian as well as the Asian continent.
引用
收藏
页码:497 / 510
页数:14
相关论文
共 50 条
  • [21] Meta-analysis of genome-wide association studies of asthma in ethnically diverse North American populations
    Torgerson, Dara G.
    Ampleford, Elizabeth J.
    Chiu, Grace Y.
    Gauderman, W. James
    Gignoux, Christopher R.
    Graves, Penelope E.
    Himes, Blanca E.
    Levin, Albert M.
    Mathias, Rasika A.
    Hancock, Dana B.
    Baurley, James W.
    Eng, Celeste
    Stern, Debra A.
    Celedon, Juan C.
    Rafaels, Nicholas
    Capurso, Daniel
    Conti, David V.
    Roth, Lindsey A.
    Soto-Quiros, Manuel
    Togias, Alkis
    Li, Xingnan
    Myers, Rachel A.
    Romieu, Isabelle
    Van Den Berg, David J.
    Hu, Donglei
    Hansel, Nadia N.
    Hernandez, Ryan D.
    Israel, Elliott
    Salam, Muhammad T.
    Galanter, Joshua
    Avila, Pedro C.
    Avila, Lydiana
    Rodriquez-Santana, Jose R.
    Chapela, Rocio
    Rodriguez-Cintron, William
    Diette, Gregory B.
    Adkinson, N. Franklin
    Abel, Rebekah A.
    Ross, Kevin D.
    Shi, Min
    Faruque, Mezbah U.
    Dunston, Georgia M.
    Watson, Harold R.
    Mantese, Vito J.
    Ezurum, Serpil C.
    Liang, Liming
    Ruczinski, Ingo
    Ford, Jean G.
    Huntsman, Scott
    Chung, Kian Fan
    NATURE GENETICS, 2011, 43 (09) : 887 - U103
  • [22] Whole-Genome Sequencing and Genetic Variant Analysis of a Quarter Horse Mare
    Doan, Ryan
    Cohen, Noah D.
    Sawyer, Jason
    Ghaffari, Noushin
    Johnson, Charlie D.
    Dindot, Scott V.
    BMC GENOMICS, 2012, 13
  • [23] Joint structural variant analysis of colorectal cancer whole genome sequencing data
    Pitkanen, Esa
    Cajuso, Tatiana
    Katainen, Riku
    Lundgren, Sofie
    Tuupanen, Sari
    Kilpivaara, Outi
    Aaltonen, Lauri A.
    CANCER RESEARCH, 2015, 75
  • [24] Whole-Genome Sequencing and Variant Analysis of Human Papillomavirus 16 Infections
    van der Weele, Pascal
    Meijer, Chris J. L. M.
    King, Audrey J.
    JOURNAL OF VIROLOGY, 2017, 91 (19)
  • [25] Whole-Genome sequencing and genetic variant analysis of a quarter Horse mare
    Ryan Doan
    Noah D Cohen
    Jason Sawyer
    Noushin Ghaffari
    Charles D Johnson
    Scott V Dindot
    BMC Genomics, 13
  • [26] A global analysis of CNVs in diverse yak populations using whole-genome resequencing
    Hui Wang
    Zhixin Chai
    Dan Hu
    Qiumei Ji
    Jinwei Xin
    Chengfu Zhang
    Jincheng Zhong
    BMC Genomics, 20
  • [27] A global analysis of CNVs in diverse yak populations using whole-genome resequencing
    Wang, Hui
    Chai, Zhixin
    Hu, Dan
    Ji, Qiumei
    Xin, Jinwei
    Zhang, Chengfu
    Zhong, Jincheng
    BMC GENOMICS, 2019, 20 (1)
  • [28] Comparative whole genome analysis of three consecutive Salmonella diarizonae isolates
    Gerlach, Roman G.
    Walter, Steffi
    McClelland, Michael
    Schmidt, Christiane
    Steglich, Matthias
    Prager, Rita
    Bender, Jennifer K.
    Fuchs, Stephan
    Schoerner, Christoph
    Rabsch, Wolfgang
    Lang, Werner
    Jantsch, Jonathan
    INTERNATIONAL JOURNAL OF MEDICAL MICROBIOLOGY, 2017, 307 (08) : 542 - 551
  • [29] Comparative whole genome transcriptome analysis of three Plasmodium falciparum strains
    Llinás, M
    Bozdech, Z
    Wong, ED
    Adai, AT
    DeRisi, JL
    NUCLEIC ACIDS RESEARCH, 2006, 34 (04) : 1166 - 1173
  • [30] Combined analysis of three whole genome linkage scans for Ankylosing Spondylitis
    Carter, K. W.
    Pluzhnikov, A.
    Timms, A. E.
    Miceli-Richard, C.
    Bourgain, C.
    Wordsworth, B. P.
    Jean-Pierre, H.
    Cox, N. J.
    Palmer, L. J.
    Breban, M.
    Reveille, J. D.
    Brown, M. A.
    RHEUMATOLOGY, 2007, 46 (05) : 763 - 771