GENCODE 2025: reference gene annotation for human and mouse

被引:3
|
作者
Mudge, Jonathan M. [1 ]
Carbonell-Sala, Silvia [2 ]
Diekhans, Mark [3 ]
Martinez, Jose Gonzalez [1 ]
Hunt, Toby [1 ]
Jungreis, Irwin [4 ,5 ]
Loveland, Jane E. [1 ]
Arnan, Carme [2 ]
Barnes, If [1 ]
Bennett, Ruth [1 ]
Berry, Andrew [1 ]
Bignell, Alexandra [1 ]
Cerdan-Velez, Daniel [6 ]
Cochran, Kelly [7 ]
Cortes, Lucas T. [1 ]
Davidson, Claire [1 ]
Donaldson, Sarah [1 ]
Dursun, Cagatay [8 ,9 ]
Fatima, Reham [1 ]
Hardy, Matthew [1 ]
Hebbar, Prajna [3 ]
Hollis, Zoe [1 ]
James, Benjamin T. [4 ,5 ]
Jiang, Yunzhe [8 ,9 ]
Johnson, Rory [10 ,11 ]
Kaur, Gazaldeep [2 ]
Kay, Mike [1 ]
Mangan, Riley J. [4 ,5 ,12 ]
Maquedano, Miguel [6 ]
Martinez Gomez, Laura [6 ]
Mathlouthi, Nourhen [1 ]
Merritt, Ryan [1 ]
Ni, Pengyu [8 ,9 ]
Palumbo, Emilio [2 ]
Perteghella, Tamara [2 ,13 ]
Pozo, Fernando [6 ]
Raj, Shriya [1 ]
Sisu, Cristina [9 ,14 ]
Steed, Emily [1 ]
Sumathipala, Dulika [1 ]
Suner, Marie-Marthe [1 ]
Uszczynska-Ratajczak, Barbara [15 ]
Wass, Elizabeth [1 ]
Yang, Yucheng T. [9 ,16 ]
Zhang, Dingyao [8 ,9 ]
Finn, Robert D. [1 ]
Gerstein, Mark [8 ,9 ]
Guigo, Roderic [2 ,13 ]
Hubbard, Tim J. P. [17 ,18 ]
Kellis, Manolis [4 ,5 ]
机构
[1] European Mol Biol Lab, European Bioinformat Inst, Wellcome Genome Campus, Cambridge CB10 1SD, England
[2] Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Catalonia, Spain
[3] Univ Calif Santa Cruz, Genom Inst, 2300 Delaware Ave, Santa Cruz, CA 95060 USA
[4] MIT, Comp Sci & Artificial Intelligence Lab, 32 Vassar St, Cambridge, MA 02139 USA
[5] Broad Inst MIT & Harvard, 415 Main St, Cambridge, MA 02142 USA
[6] Spanish Natl Canc Res Ctr CNIO, Bioinformat Unit, Calle Melchor Fernandez Almagro 3, Madrid 28029, Spain
[7] Stanford Univ, Dept Comp Sci, 353 Jane Stanford Way, Stanford, CA 94305 USA
[8] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[9] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[10] Bern Univ Hosp, Dept Med Oncol, Murtenstr 35, CH-3008 Bern, Switzerland
[11] Univ Coll Dublin, Sch Biol & Environm Sci, Dublin D04 V1W8 4, Ireland
[12] Harvard Med Sch, Genet Training Program, Boston, MA 02115 USA
[13] Univ Pompeu Fabra, Dept Ciencies Expt & Salut, Carrer Merce 12, Barcelona 08002, Spain
[14] Brunel Univ London, Dept Life Sci, Kingston Lane, London UB8 3PH, England
[15] Polish Acad Sci, Inst Bioorgan Chem, Dept Computat Biol Noncoding RNA, Noskowskiego 12-14, PL-61704 Poznan, Poland
[16] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, 220 Handan Rd, Shanghai 200433, Peoples R China
[17] Kings Coll London, Guys Hosp, Dept Med & Mol Genet, London SE1 9RT, England
[18] ELIXIR Hub, Wellcome Genome Campus, Cambridge CB10 1SD, England
[19] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
基金
美国国家卫生研究院; 英国惠康基金;
关键词
SEQUENCE;
D O I
10.1093/nar/gkae1078
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
GENCODE produces comprehensive reference gene annotation for human and mouse. Entering its twentieth year, the project remains highly active as new technologies and methodologies allow us to catalog the genome at ever-increasing granularity. In particular, long-read transcriptome sequencing enables us to identify large numbers of missing transcripts and to substantially improve existing models, and our long non-coding RNA catalogs have undergone a dramatic expansion and reconfiguration as a result. Meanwhile, we are incorporating data from state-of-the-art proteomics and Ribo-seq experiments to fine-tune our annotation of translated sequences, while further insights into function can be gained from multi-genome alignments that grow richer as more species' genomes are sequenced. Such methodologies are combined into a fully integrated annotation workflow. However, the increasing complexity of our resources can present usability challenges, and we are resolving these with the creation of filtered genesets such as MANE Select and GENCODE Primary. The next challenge is to propagate annotations throughout multiple human and mouse genomes, as we enter the pangenome era. Our resources are freely available at our web portal www.gencodegenes.org, and via the Ensembl and UCSC genome browsers. [GRAPHICS] .
引用
收藏
页码:D966 / D975
页数:10
相关论文
共 50 条
  • [31] The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression
    Derrien, Thomas
    Johnson, Rory
    Bussotti, Giovanni
    Tanzer, Andrea
    Djebali, Sarah
    Tilgner, Hagen
    Guernec, Gregory
    Martin, David
    Merkel, Angelika
    Knowles, David G.
    Lagarde, Julien
    Veeravalli, Lavanya
    Ruan, Xiaoan
    Ruan, Yijun
    Lassmann, Timo
    Carninci, Piero
    Brown, James B.
    Lipovich, Leonard
    Gonzalez, Jose M.
    Thomas, Mark
    Davis, Carrie A.
    Shiekhattar, Ramin
    Gingeras, Thomas R.
    Hubbard, Tim J.
    Notredame, Cedric
    Harrow, Jennifer
    Guigo, Roderic
    GENOME RESEARCH, 2012, 22 (09) : 1775 - 1789
  • [32] Universal human, mouse & rat reference RNA as standards for microarray gene expression analysis
    Novoradovskaya, N
    Perou, C
    Whitfield, ML
    Basehore, S
    Pesich, R
    Aprelikova, O
    Fero, M
    Brown, PO
    Botstein, D
    Braman, J
    FASEB JOURNAL, 2003, 17 (04): : A78 - A78
  • [33] The Gene Wiki in 2011: community intelligence applied to human gene annotation
    Good, Benjamin M.
    Clarke, Erik L.
    de Alfaro, Luca
    Su, Andrew I.
    NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D1255 - D1261
  • [34] Functional annotation of human cytomegalovirus gene products: an update
    Van Damme, Ellen
    Van Loock, Marnix
    FRONTIERS IN MICROBIOLOGY, 2014, 5
  • [35] Annotation of Human Exome Gene Variants with Consensus Pathogenicity
    Jaravine, Victor
    Balmford, James
    Metzger, Patrick
    Boerries, Melanie
    Binder, Harald
    Boeker, Martin
    GENES, 2020, 11 (09) : 1 - 18
  • [36] Bioinformatics assisted gene discovery and annotation of human genome
    Wang, W
    Wang, YH
    Li, W
    CHEMICAL RESEARCH IN CHINESE UNIVERSITIES, 2002, 18 (04) : 491 - 494
  • [38] Manual Gene Ontology annotation workflow at the Mouse Genome Informatics Database
    Drabkin, Harold J.
    Blake, Judith A.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2012,
  • [39] Reference based annotation with GeneMapper
    Sourav Chatterji
    Lior Pachter
    Genome Biology, 7
  • [40] Reference based annotation with GeneMapper
    Chatterji, Sourav
    Pachter, Lior
    GENOME BIOLOGY, 2006, 7 (04)