Gene3D: expanding the utility of domain assignments

被引:43
|
作者
Lam, Su Datt [1 ]
Dawson, Natalie L. [1 ]
Das, Sayoni [1 ]
Sillitoe, Ian [1 ]
Ashford, Paul [1 ]
Lee, David [1 ]
Lehtinen, Sonja [1 ,2 ]
Orengo, Christine A. [1 ]
Lees, Jonathan G. [1 ]
机构
[1] UCL, Inst Struct & Mol Biol, Div Biosci, Gower St, London WC1E 6BT, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Infect Dis Epidemiol, St Marys Campus,Norfolk Pl, London W2 1PG, England
基金
英国生物技术与生命科学研究理事会;
关键词
PROTEIN; SUPERFAMILIES; ANNOTATIONS; DATABASE;
D O I
10.1093/nar/gkv1231
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene3D http://gene3d.biochem.ucl.ac.uk is a database of domain annotations of Ensembl and UniProtKB protein sequences. Domains are predicted using a library of profile HMMs representing 2737 CATH superfamilies. Gene3D has previously featured in the Database issue of NAR and here we report updates to the website and database. The current Gene3D (v14) release has expanded its domain assignments to similar to 20 000 cellular genomes and over 43 million unique protein sequences, more than doubling the number of protein sequences since our last publication. Amongst other updates, we have improved our Functional Family annotation method. We have also improved the quality and coverage of our 3D homology modelling pipeline of predicted CATH domains. Additionally, the structural models have been expanded to include an extra model organism (Drosophila melanogaster). We also document a number of additional visualization tools in the Gene3D website.
引用
收藏
页码:D404 / D409
页数:6
相关论文
共 50 条
  • [1] Gene3D: structural assignments for the biologist and bioinformaticist alike
    Buchan, DWA
    Rison, SCG
    Bray, JE
    Lee, D
    Pearl, F
    Thornton, JM
    Orengo, CA
    NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 469 - 473
  • [2] Gene3D: Multi-domain annotations for protein sequence and comparative genome analysis
    Lees, Jonathan G.
    Lee, David
    Studer, Romain A.
    Dawson, Natalie L.
    Sillitoe, Ian
    Das, Sayoni
    Yeats, Corin
    Dessailly, Benoit H.
    Rentzsch, Robert
    Orengo, Christine A.
    NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D240 - D245
  • [3] Gene3D: comprehensive structural and functional annotation of genomes
    Yeats, Corin
    Lees, Jonathan
    Reid, Adam
    Kellam, Paul
    Martin, Nigel
    Liu, Xinhui
    Orengo, Christine
    NUCLEIC ACIDS RESEARCH, 2008, 36 : D414 - D418
  • [4] Gene3D: modelling protein structure, function and evolution
    Yeats, Corin
    Maibaum, Michael
    Marsden, Russell
    Dibley, Mark
    Lee, David
    Addou, Sarah
    Orengo, Christine A.
    NUCLEIC ACIDS RESEARCH, 2006, 34 : D281 - D284
  • [5] Gene3D: merging structure and function for a Thousand genomes
    Lees, Jonathan
    Yeats, Corin
    Redfern, Oliver
    Clegg, Andrew
    Orengo, Christine
    NUCLEIC ACIDS RESEARCH, 2010, 38 : D296 - D300
  • [6] Gene3D: Structural assignment for whole genes and genomes using the CATH domain structure database
    Buchan, DWA
    Shepherd, AJ
    Lee, D
    Pearl, FMG
    Rison, SCG
    Thornton, JM
    Orengo, CA
    GENOME RESEARCH, 2002, 12 (03) : 503 - 514
  • [7] The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis
    Pearl, F
    Todd, A
    Sillitoe, I
    Dibley, M
    Redfern, O
    Lewis, T
    Bennett, C
    Marsden, R
    Grant, A
    Lee, D
    Akpor, A
    Maibaum, M
    Harrison, A
    Dallman, T
    Reeves, G
    Diboun, I
    Addou, S
    Lise, S
    Johnston, C
    Sillero, A
    Thornton, J
    Orengo, C
    NUCLEIC ACIDS RESEARCH, 2005, 33 : D247 - D251
  • [8] Gene3D: a domain-based resource for comparative genomics, functional annotation and protein network analysis
    Lees, Jonathan
    Yeats, Corin
    Perkins, James
    Sillitoe, Ian
    Rentzsch, Robert
    Dessailly, Benoit H.
    Orengo, Christine
    NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D465 - D471
  • [9] Identification and distribution of protein families in 120 completed genomes using Gene3D
    Lee, D
    Grant, A
    Marsden, RL
    Orengo, C
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 59 (03) : 603 - 615
  • [10] Gene3D: Extensive prediction of globular domains in proteins (vol 46, pg 435, 2017)
    Lewis, Tony E.
    Sillitoe, Ian
    Dawson, Natalie
    Lam, Su Datt
    Clarke, Tristan
    Lee, David
    Orengo, Christine
    Lees, Jonathan
    NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) : D1282 - D1282