De Novo Origin of Protein-Coding Genes in Murine Rodents

被引:39
|
作者
Murphy, Daniel N. [1 ]
McLysaght, Aoife [1 ]
机构
[1] Univ Dublin Trinity Coll, Smurfit Inst Genet, Dublin 2, Ireland
来源
PLOS ONE | 2012年 / 7卷 / 11期
基金
爱尔兰科学基金会;
关键词
MULTIPLE SEQUENCE ALIGNMENT; GENOME; EVOLUTION; RESOURCE; MICE;
D O I
10.1371/journal.pone.0048650
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: New genes in eukaryotes are created through a variety of different mechanisms. De novo origin from non-coding DNA is a mechanism that has recently gained attention. So far, de novo genes have been described in a handful of organisms, with Drosophila being the most extensively studied. We searched for genes that have appeared de novo in the mouse and rat lineages. Methodology: Using a rigorous and conservative approach we identify 75 murine genes (69 mouse genes and 6 rat genes) for which there is good evidence of de novo origin since the divergence of mouse and rat. Each of these genes is only found in either the mouse or rat lineages, with no candidate orthologs nor evidence for potentially-unannotated orthologs in the other lineage. The veracity of each of these genes is supported by expression evidence. Additionally, their presence in one lineage and absence in the other cannot be explained by sequencing gaps. For 11 of the 75 candidate novel genes we could identify a mouse-specific mutation that led to the creation of the open reading frame (ORF) specifically in mouse. None of the six rat-specific genes had an unequivocal rat-specific mutation creating the ORF, which may at least be partly due to lower data quality for that genome. Conclusions: All 75 candidate genes presented in this study are relatively small and encode short peptides. A large number of them (51 out of 69 mouse genes and 3 out of 6 rat genes) also overlap with other genes, either within introns, or on the opposite strand. These characteristics have previously been documented for de novo genes. The description of these genes opens up the opportunity to integrate this evolutionary analysis with the rich experimental data available for these two model organisms.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] De Novo Origin of Human Protein-Coding Genes
    Wu, Dong-Dong
    Irwin, David M.
    Zhang, Ya-Ping
    PLOS GENETICS, 2011, 7 (11)
  • [2] Recent de novo origin of human protein-coding genes
    Knowles, David G.
    McLysaght, Aoife
    GENOME RESEARCH, 2009, 19 (10) : 1752 - 1759
  • [3] Tracing the De Novo Origin of Protein-Coding Genes in Yeast
    Wu, Baojun
    Knudson, Alicia
    MBIO, 2018, 9 (04):
  • [4] Introns and the origin of protein-coding genes
    Senapathy, P.
    Bettrolaet, B.L.
    Siedel, H.M.
    Knowles, J.R.
    Sroltzfus, A.
    Spencer, D.F.
    Zuker, M.
    Logsdon, J.M.
    Doolittle, W.F.
    Science, 1995, 268 (5215)
  • [5] A Continuum of Evolving De Novo Genes Drives Protein-Coding Novelty in Drosophila
    Brennen Heames
    Jonathan Schmitz
    Erich Bornberg-Bauer
    Journal of Molecular Evolution, 2020, 88 : 382 - 398
  • [6] A Continuum of Evolving De Novo Genes Drives Protein-Coding Novelty in Drosophila
    Heames, Brennen
    Schmitz, Jonathan
    Bornberg-Bauer, Erich
    JOURNAL OF MOLECULAR EVOLUTION, 2020, 88 (04) : 382 - 398
  • [7] INTRONS AND THE ORIGIN OF PROTEIN-CODING GENES - REPLY
    STOLTZFUS, A
    SPENCER, DF
    ZUKER, M
    LOGSDON, JM
    DOOLITTLE, WF
    SCIENCE, 1995, 268 (5215) : 1367 - 1369
  • [8] New genes from non-coding sequence: the role of de novo protein-coding genes in eukaryotic evolutionary innovation
    McLysaght, Aoife
    Guerzoni, Daniele
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2015, 370 (1678)
  • [9] A putative scenario of how de novo protein-coding genes originate in the Saccharomyces cerevisiae lineage
    Yada, Tetsushi
    Taniguchi, Takeaki
    BMC GENOMICS, 2024, 25 (SUPPL 3):
  • [10] From De Novo to "De Nono": The Majority of Novel Protein-Coding Genes Identified with Phylostratigraphy Are Old Genes or Recent Duplicates
    Casola, Claudio
    GENOME BIOLOGY AND EVOLUTION, 2018, 10 (11): : 2906 - 2918