GENCODE 2025: reference gene annotation for human and mouse

被引:3
|
作者
Mudge, Jonathan M. [1 ]
Carbonell-Sala, Silvia [2 ]
Diekhans, Mark [3 ]
Martinez, Jose Gonzalez [1 ]
Hunt, Toby [1 ]
Jungreis, Irwin [4 ,5 ]
Loveland, Jane E. [1 ]
Arnan, Carme [2 ]
Barnes, If [1 ]
Bennett, Ruth [1 ]
Berry, Andrew [1 ]
Bignell, Alexandra [1 ]
Cerdan-Velez, Daniel [6 ]
Cochran, Kelly [7 ]
Cortes, Lucas T. [1 ]
Davidson, Claire [1 ]
Donaldson, Sarah [1 ]
Dursun, Cagatay [8 ,9 ]
Fatima, Reham [1 ]
Hardy, Matthew [1 ]
Hebbar, Prajna [3 ]
Hollis, Zoe [1 ]
James, Benjamin T. [4 ,5 ]
Jiang, Yunzhe [8 ,9 ]
Johnson, Rory [10 ,11 ]
Kaur, Gazaldeep [2 ]
Kay, Mike [1 ]
Mangan, Riley J. [4 ,5 ,12 ]
Maquedano, Miguel [6 ]
Martinez Gomez, Laura [6 ]
Mathlouthi, Nourhen [1 ]
Merritt, Ryan [1 ]
Ni, Pengyu [8 ,9 ]
Palumbo, Emilio [2 ]
Perteghella, Tamara [2 ,13 ]
Pozo, Fernando [6 ]
Raj, Shriya [1 ]
Sisu, Cristina [9 ,14 ]
Steed, Emily [1 ]
Sumathipala, Dulika [1 ]
Suner, Marie-Marthe [1 ]
Uszczynska-Ratajczak, Barbara [15 ]
Wass, Elizabeth [1 ]
Yang, Yucheng T. [9 ,16 ]
Zhang, Dingyao [8 ,9 ]
Finn, Robert D. [1 ]
Gerstein, Mark [8 ,9 ]
Guigo, Roderic [2 ,13 ]
Hubbard, Tim J. P. [17 ,18 ]
Kellis, Manolis [4 ,5 ]
机构
[1] European Mol Biol Lab, European Bioinformat Inst, Wellcome Genome Campus, Cambridge CB10 1SD, England
[2] Barcelona Inst Sci & Technol, Ctr Genom Regulat CRG, Dr Aiguader 88, Barcelona 08003, Catalonia, Spain
[3] Univ Calif Santa Cruz, Genom Inst, 2300 Delaware Ave, Santa Cruz, CA 95060 USA
[4] MIT, Comp Sci & Artificial Intelligence Lab, 32 Vassar St, Cambridge, MA 02139 USA
[5] Broad Inst MIT & Harvard, 415 Main St, Cambridge, MA 02142 USA
[6] Spanish Natl Canc Res Ctr CNIO, Bioinformat Unit, Calle Melchor Fernandez Almagro 3, Madrid 28029, Spain
[7] Stanford Univ, Dept Comp Sci, 353 Jane Stanford Way, Stanford, CA 94305 USA
[8] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[9] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[10] Bern Univ Hosp, Dept Med Oncol, Murtenstr 35, CH-3008 Bern, Switzerland
[11] Univ Coll Dublin, Sch Biol & Environm Sci, Dublin D04 V1W8 4, Ireland
[12] Harvard Med Sch, Genet Training Program, Boston, MA 02115 USA
[13] Univ Pompeu Fabra, Dept Ciencies Expt & Salut, Carrer Merce 12, Barcelona 08002, Spain
[14] Brunel Univ London, Dept Life Sci, Kingston Lane, London UB8 3PH, England
[15] Polish Acad Sci, Inst Bioorgan Chem, Dept Computat Biol Noncoding RNA, Noskowskiego 12-14, PL-61704 Poznan, Poland
[16] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, 220 Handan Rd, Shanghai 200433, Peoples R China
[17] Kings Coll London, Guys Hosp, Dept Med & Mol Genet, London SE1 9RT, England
[18] ELIXIR Hub, Wellcome Genome Campus, Cambridge CB10 1SD, England
[19] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
基金
美国国家卫生研究院; 英国惠康基金;
关键词
SEQUENCE;
D O I
10.1093/nar/gkae1078
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
GENCODE produces comprehensive reference gene annotation for human and mouse. Entering its twentieth year, the project remains highly active as new technologies and methodologies allow us to catalog the genome at ever-increasing granularity. In particular, long-read transcriptome sequencing enables us to identify large numbers of missing transcripts and to substantially improve existing models, and our long non-coding RNA catalogs have undergone a dramatic expansion and reconfiguration as a result. Meanwhile, we are incorporating data from state-of-the-art proteomics and Ribo-seq experiments to fine-tune our annotation of translated sequences, while further insights into function can be gained from multi-genome alignments that grow richer as more species' genomes are sequenced. Such methodologies are combined into a fully integrated annotation workflow. However, the increasing complexity of our resources can present usability challenges, and we are resolving these with the creation of filtered genesets such as MANE Select and GENCODE Primary. The next challenge is to propagate annotations throughout multiple human and mouse genomes, as we enter the pangenome era. Our resources are freely available at our web portal www.gencodegenes.org, and via the Ensembl and UCSC genome browsers. [GRAPHICS] .
引用
收藏
页码:D966 / D975
页数:10
相关论文
共 50 条
  • [21] Assembly and annotation of an Ashkenazi human reference genome
    Alaina Shumate
    Aleksey V. Zimin
    Rachel M. Sherman
    Daniela Puiu
    Justin M. Wagner
    Nathan D. Olson
    Mihaela Pertea
    Marc L. Salit
    Justin M. Zook
    Steven L. Salzberg
    Genome Biology, 21
  • [22] Automating Gene Expression Annotation for Mouse Embryo
    Han, Liangxiu
    van Hemert, Jano
    Baldock, Richard
    Atkinson, Malcolm
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 469 - +
  • [23] AUTOMATED GENE EXPRESSION PATTERN ANNOTATION IN THE MOUSE BRAIN
    Yang, Tao
    Zhao, Xinlin
    Lin, Binbin
    Zeng, Tao
    Ji, Shuiwang
    Ye, Jieping
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2015 (PSB), 2015, : 144 - 155
  • [24] The GENCODE exome: sequencing the complete human exome
    Coffey, Alison J.
    Kokocinski, Felix
    Calafato, Maria S.
    Scott, Carol E.
    Palta, Priit
    Drury, Eleanor
    Joyce, Christopher J.
    LeProust, Emily M.
    Harrow, Jen
    Hunt, Sarah
    Lehesjoki, Anna-Elina
    Turner, Daniel J.
    Hubbard, Tim J.
    Palotie, Aarno
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2011, 19 (07) : 827 - 831
  • [25] 'Deep dive' disease gene re-annotation in GENCODE: identifying and reporting new variant interpretations of likely clinical relevance.
    Mudge, J. M.
    Hunt, T.
    Gonzalez, J. M.
    Frankish, A.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2020, 28 (SUPPL 1) : 668 - 669
  • [26] Mouse phenogenomics, toolbox for functional annotation of human genome
    Kim, Il Yong
    Shin, Jae Hoon
    Seong, Je Kyung
    BMB REPORTS, 2010, 43 (02) : 79 - 90
  • [27] The Gene Wiki: community intelligence applied to human gene annotation
    Huss, Jon W., III
    Lindenbaum, Pierre
    Martone, Michael
    Roberts, Donabel
    Pizarro, Angel
    Valafar, Faramarz
    Hogenesch, John B.
    Su, Andrew I.
    NUCLEIC ACIDS RESEARCH, 2010, 38 : D633 - D639
  • [28] Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome
    Amid, Clara
    Rehaume, Linda M.
    Brown, Kelly L.
    Gilbert, James G. R.
    Dougan, Gordon
    Hancock, Robert E. W.
    Harrow, Jennifer L.
    BMC GENOMICS, 2009, 10
  • [29] Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome
    Clara Amid
    Linda M Rehaume
    Kelly L Brown
    James GR Gilbert
    Gordon Dougan
    Robert EW Hancock
    Jennifer L Harrow
    BMC Genomics, 10
  • [30] Curation and annotation of planarian gene expression patterns with segmented reference morphologies
    Roy, Joy
    Cheung, Eric
    Bhatti, Junaid
    Muneem, Abraar
    Lobo, Daniel
    BIOINFORMATICS, 2020, 36 (09) : 2881 - 2887