Characterization of genome-wide STR variation in 6487 human genomes

被引:28
|
作者
Shi, Yirong [1 ,2 ]
Niu, Yiwei [1 ,3 ]
Zhang, Peng [1 ]
Luo, Huaxia [1 ]
Liu, Shuai [1 ,3 ]
Zhang, Sijia [1 ,3 ]
Wang, Jiajia [1 ]
Li, Yanyan [1 ]
Liu, Xinyue [1 ,2 ]
Song, Tingrui [1 ]
Xu, Tao [4 ,5 ]
He, Shunmin [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Biophys, Ctr Big Data Res Hlth, Key Lab RNA Biol, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Biophys, CAS Ctr Excellence Biomacromolecules, Natl Lab Biomacromolecules, Beijing 100101, Peoples R China
[5] Shandong First Med Univ & Shandong Acad Med Sci, Jinan 250117, Shandong, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金; 国家重点研发计划;
关键词
TANDEM REPEATS; GENE-EXPRESSION; FRAGILE-X; MICROSATELLITE REPEAT; STRUCTURAL VARIATION; POPULATION; MUTATIONS; DNA; DISCOVERY; EVOLUTION;
D O I
10.1038/s41467-023-37690-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (similar to 31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (similar to 33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3 ' UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Characterization of genome-wide STR variation in 6487 human genomes
    Yirong Shi
    Yiwei Niu
    Peng Zhang
    Huaxia Luo
    Shuai Liu
    Sijia Zhang
    Jiajia Wang
    Yanyan Li
    Xinyue Liu
    Tingrui Song
    Tao Xu
    Shunmin He
    Nature Communications, 14
  • [2] Genome-wide characterization of simple sequence repeats in Palmae genomes
    Manee, Manee M.
    Al-Shomrani, Badr M.
    Al-Fageeh, Mohamed B.
    GENES & GENOMICS, 2020, 42 (05) : 597 - 608
  • [3] Genome-wide characterization of simple sequence repeats in Palmae genomes
    Manee M. Manee
    Badr M. Al-Shomrani
    Mohamed B. Al-Fageeh
    Genes & Genomics, 2020, 42 : 597 - 608
  • [4] Genome-wide variation in the human and fruitfly: a comparison
    Aquadro, CF
    DuMont, VB
    Reed, FA
    CURRENT OPINION IN GENETICS & DEVELOPMENT, 2001, 11 (06) : 627 - 634
  • [5] Genome-wide characterization of centromeric satellites from multiple mammalian genomes
    Alkan, Can
    Cardone, Maria Francesca
    Catacchio, Claudia Rita
    Antonacci, Francesca
    O'Brien, Stephen J.
    Ryder, Oliver A.
    Purgato, Stefania
    Zoli, Monica
    Della Valle, Giuliano
    Eichler, Evan E.
    Ventura, Mario
    GENOME RESEARCH, 2011, 21 (01) : 137 - 145
  • [6] A genome-wide perspective of genetic variation in human metabolism
    Thomas Illig
    Christian Gieger
    Guangju Zhai
    Werner Römisch-Margl
    Rui Wang-Sattler
    Cornelia Prehn
    Elisabeth Altmaier
    Gabi Kastenmüller
    Bernet S Kato
    Hans-Werner Mewes
    Thomas Meitinger
    Martin Hrabé de Angelis
    Florian Kronenberg
    Nicole Soranzo
    H-Erich Wichmann
    Tim D Spector
    Jerzy Adamski
    Karsten Suhre
    Nature Genetics, 2010, 42 : 137 - 141
  • [7] A genome-wide perspective of genetic variation in human metabolism
    Illig, Thomas
    Gieger, Christian
    Zhai, Guangju
    Roemisch-Margl, Werner
    Wang-Sattler, Rui
    Prehn, Cornelia
    Altmaier, Elisabeth
    Kastenmueller, Gabi
    Kato, Bernet S.
    Mewes, Hans-Werner
    Meitinger, Thomas
    de Angelis, Martin Hrabe
    Kronenberg, Florian
    Soranzo, Nicole
    Wichmann, H-Erich
    Spector, Tim D.
    Adamski, Jerzy
    Suhre, Karsten
    NATURE GENETICS, 2010, 42 (02) : 137 - U66
  • [8] A genome-wide atlas of recurrent repeat expansions in human cancer genomes
    Erwin, Graham S.
    Gursoy, Gamze
    Al-Abri, Rashid
    Hoerner, Christian
    Dolzhenko, Egor
    Eberle, Michael
    Fan, Alice
    Leppert, John
    Gerstein, Mark
    Snyder, Michael P.
    CANCER RESEARCH, 2022, 82 (12)
  • [9] Genome-Wide Variation in Betacoronaviruses
    LaTourrette, Katherine
    Holste, Natalie M.
    Rodriguez-Pena, Rosalba
    Leme, Raquel Arruda
    Garcia-Ruiz, Hernan
    JOURNAL OF VIROLOGY, 2021, 95 (15)
  • [10] Genome-Wide Variation in Potyviruses
    Nigam, Deepti
    LaTourrette, Katherine
    Souza, Pedro F. N.
    Garcia-Ruiz, Hernan
    FRONTIERS IN PLANT SCIENCE, 2019, 10