Large-Scale Genomics Reveals the Genetic Characteristics of Seven Species and Importance of Phylogenetic Distance for Estimating Pan-Genome Size

被引:55
|
作者
Park, Sang-Cheol [1 ]
Lee, Kihyun [2 ]
Kim, Yeong Ouk [3 ]
Won, Sungho [1 ,3 ,4 ]
Chun, Jongsik [3 ,5 ,6 ]
机构
[1] Seoul Natl Univ, Inst Hlth & Environm, Seoul, South Korea
[2] Chung Ang Univ, Dept Syst Biotechnol, Anseong, South Korea
[3] Seoul Natl Univ, Interdisciplinary Program Bioinformat, Seoul, South Korea
[4] Seoul Natl Univ, Dept Publ Hlth Sci, Seoul, South Korea
[5] Seoul Natl Univ, Dept Biol Sci, Seoul, South Korea
[6] Seoul Natl Univ, Inst Mol Biol & Genet, Seoul, South Korea
来源
FRONTIERS IN MICROBIOLOGY | 2019年 / 10卷
基金
新加坡国家研究基金会;
关键词
pan-genome; core-genome; Heaps' law; gene pool; large-scale genomics; seven species; estimation model; ANTIBIOTIC-RESISTANCE; SEQUENCE SIMILARITY; ESCHERICHIA-COLI; STRAINS;
D O I
10.3389/fmicb.2019.00834
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
For more than a decade, pan-genome analysis has been applied as an effective method for explaining the genetic contents variation of prokaryotic species. However, genomic characteristics and detailed structures of gene pools have not been fully clarified, because most studies have used a small number of genomes. Here, we constructed pan-genomes of seven species in order to elucidate variations in the genetic contents of >27,000 genomes belonging to Streptococcus pneumoniae, Staphylococcus aureus subsp. aureus, Salmonella enterica subsp. enterica, Escherichia coli and Shigella spp., Mycobacterium tuberculosis complex, Pseudomonas aeruginosa, and Acinetobacter baumannii. This work showed the pan-genomes of all seven species has open property. Additionally, systematic evaluation of the characteristics of their pan-genome revealed that phylogenetic distance provided valuable information for estimating the parameters for pan-genome size among several models including Heaps' law. Our results provide a better understanding of the species and a solution to minimize sampling biases associated with genome-sequencing preferences for pathogenic strains.
引用
收藏
页数:12
相关论文
共 29 条
  • [21] A large-scale genome-wide cross-trait analysis reveals shared genetic architecture between Alzheimer's disease and gastrointestinal tract disorders
    Adewuyi, Emmanuel O.
    O'Brien, Eleanor K.
    Nyholt, Dale R.
    Porter, Tenielle
    Laws, Simon M.
    COMMUNICATIONS BIOLOGY, 2022, 5 (01)
  • [22] Large-scale genome-wide CHANGE-seq profiling of CRISPR-Cas9 therapeutic targets reveals genetic and epigenetic determinants of activity
    Lazzarotto, Cicera R.
    Malinin, Nikolay
    Katta, Varun
    Li, Yichao
    Cheng, Yong
    Tsai, Shengdar Q.
    TRANSGENIC RESEARCH, 2020, 29 (04) : 473 - 473
  • [23] A large-scale genome-wide cross-trait analysis reveals shared genetic architecture between Alzheimer’s disease and gastrointestinal tract disorders
    Emmanuel O. Adewuyi
    Eleanor K. O’Brien
    Dale R. Nyholt
    Tenielle Porter
    Simon M. Laws
    Communications Biology, 5
  • [24] Large-Scale CRISPR-Cas Genome-Wide Activity Profiling in Human Primary T-Cells Reveals Genetic and Epigenetic Determinants of Genome-Wide Nuclease Activity
    Lazzarotto, Cicera
    Malinin, Nikolay
    Li, Yichao
    Hang, Ruochi Z.
    Yang, Yang
    Lee, GaHyun
    Cowley, Eleanor
    He, Yanghua
    Lan, Xin
    Jividen, Kasey
    Katta, Varun
    Kolmakova, Natalia
    Petersen, Chris
    Qi, Qian
    Strelcov, Evgheni
    Maragh, Samantha
    Krenciute, Giedre
    Ma, Jian
    Cheng, Yong
    Tsai, Shengdar
    MOLECULAR THERAPY, 2020, 28 (04) : 227 - 228
  • [25] Large-scale meta-genome-wide association study reveals common genetic factors linked to radiation-induced acute toxicities across cancer types
    Naderi, Elnaz
    Aguado-Barrera, Miguel E.
    Schack, Line M. H.
    Dorling, Leila
    Rattay, Tim
    Fachal, Laura
    Summersgill, Holly
    Martinez-Calvo, Laura
    Welsh, Ceilidh
    Dudding, Tom
    Odding, Yasmin
    Varela-Pazos, Ana
    Jena, Rajesh
    Thomson, David J.
    Steenbakkers, Roel J. H. M.
    Dennis, Joe
    Lobato-Busto, Ramon
    Alsner, Jan
    Ness, Andy
    Nutting, Chris
    Gomez-Caamano, Antonio
    Eriksen, Jesper G.
    Thomas, Steve J.
    Bates, Amy M.
    Webb, Adam J.
    Choudhury, Ananya
    Rosenstein, Barry S.
    Taboada-Valladares, Begona
    Herskind, Carsten
    Azria, David
    Dearnaley, David P.
    de Ruysscher, Dirk
    Sperk, Elena
    Hall, Emma
    Stobart, Hilary
    Chang-Claude, Jenny
    De Ruyck, Kim
    Veldeman, Liv
    Altabas, Manuel
    De Santis, Maria Carmen
    Farcy-Jacquet, Marie-Pierre
    Veldwijk, Marlon R.
    Sydes, Matthew R.
    Parliament, Matthew
    Usmani, Nawaid
    Burnet, Neil G.
    Seibold, Petra
    Symonds, R. Paul
    Elliott, Rebecca M.
    Bultijnck, Renee
    JNCI CANCER SPECTRUM, 2023, 7 (06)
  • [26] Large-scale genome-wide association study of Asian population reveals genetic factors in FRMD4A and other loci influencing smoking initiation and nicotine dependence
    Dankyu Yoon
    Young-Jin Kim
    Wen-Yan Cui
    Andrew Van der Vaart
    Yoon Shin Cho
    Jong-Young Lee
    Jennie Z. Ma
    Thomas J. Payne
    Ming D. Li
    Taesung Park
    Human Genetics, 2012, 131 : 1009 - 1021
  • [27] Large-scale genome-wide association study of Asian population reveals genetic factors in FRMD4A and other loci influencing smoking initiation and nicotine dependence
    Yoon, Dankyu
    Kim, Young-Jin
    Cui, Wen-Yan
    Van der Vaart, Andrew
    Cho, Yoon Shin
    Lee, Jong-Young
    Ma, Jennie Z.
    Payne, Thomas J.
    Li, Ming D.
    Park, Taesung
    HUMAN GENETICS, 2012, 131 (06) : 1009 - 1021
  • [28] The Influence of Age and Sex on Genetic Associations with Adult Body Size and Shape: A Large-Scale Genome-Wide Interaction Study (vol 11, e1005378, 2015)
    Winkler, Thomas W.
    Justice, Anne E.
    Graff, Mariaelisa
    Barata, Llilda
    Feitosa, Mary F.
    Chu, Su
    Czajkowski, Jacek
    Esko, Tonu
    Fall, Tove
    Kilpelainen, Tuomas O.
    Lu, Yingchang
    Magi, Reedik
    Mihailov, Evelin
    Pers, Tune H.
    Rueger, Sina
    Teumer, Alexander
    Ehret, Georg B.
    Ferreira, Teresa
    Heard-Costa, Nancy L.
    Karjalainen, Juha
    Lagou, Vasiliki
    Mahajan, Anubha
    Neinast, Michael D.
    Prokopenko, Inga
    Simino, Jeannette
    Teslovich, Tanya M.
    Jansen, Rick
    Westra, Harm-Jan
    White, Charles C.
    Absher, Devin
    Ahluwalia, Tarunveer S.
    Ahmad, Shafqat
    Albrecht, Eva
    Alves, Alexessander Couto
    Bragg-Gresham, Jennifer L.
    de Craen, Anton J. M.
    Bis, Joshua C.
    Bonnefond, Amelie
    Boucher, Gabrielle
    Cadby, Gemma
    Cheng, Yu-Ching
    Chiang, Charleston W. K.
    Delgado, Graciela
    Demirkan, Ayse
    Dueker, Nicole
    Eklund, Niina
    Eiriksdottir, Gudny
    Eriksson, Joel
    Feenstra, Bjarke
    Fischer, Krista
    PLOS GENETICS, 2016, 12 (06):
  • [29] Large-Scale CRISPR-Cas Genome-Wide Activity Profiling in Human Primary T-Cells Reveals Genetic and Epigenetic Determinants of Off-Target Effects
    Lazzarotto, Cicera R.
    Malinin, Nikolay L.
    Katta, Varun
    Qi, Qian
    Cheng, Yong
    Tsai, Shengdar Q.
    MOLECULAR THERAPY, 2019, 27 (04) : 5 - 6