The TIGRFAMs database of protein families

被引:627
|
作者
Haft, DH [1 ]
Selengut, JD [1 ]
White, O [1 ]
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
关键词
D O I
10.1093/nar/gkg128
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
TIGRFAMs is a collection of manually curated protein families consisting of hidden Markov models (HMMs), multiple sequence alignments, commentary, Gene Ontology ( GO) assignments, literature references and pointers to related TIGRFAMs, Pfam and InterPro models. These models are designed to support both automated and manually curated annotation of genomes. TIGRFAMs contains models of full-length proteins and shorter regions at the levels of superfamilies, subfamilies and equivalogs, where equivalogs are sets of homologous proteins conserved with respect to function since their last common ancestor. The scope of each model is set by raising or lowering cutoff scores and choosing members of the seed alignment to group proteins sharing specific function (equivalog) or more general properties. The overall goal is to provide information with maximum utility for the annotation process. TIGRFAMs is thus complementary to Pfam, whose models typically achieve broad coverage across distant homologs but end at the boundaries of conserved structural domains. The database currently contains over 1600 protein families. TIGRFAMs is available for searching or downloading at www.tigr.org/TIGRFAMs.
引用
收藏
页码:371 / 373
页数:3
相关论文
共 50 条
  • [1] The Pfam protein families database
    Bateman, A
    Coin, L
    Durbin, R
    Finn, RD
    Hollich, V
    Griffiths-Jones, S
    Khanna, A
    Marshall, M
    Moxon, S
    Sonnhammer, ELL
    Studholme, DJ
    Yeats, C
    Eddy, SR
    NUCLEIC ACIDS RESEARCH, 2004, 32 : D138 - D141
  • [2] The Pfam protein families database
    Finn, Robert D.
    Tate, John
    Mistry, Jaina
    Coggill, Penny C.
    Sammut, Stephen John
    Hotz, Hans-Rudolf
    Ceric, Goran
    Forslund, Kristoffer
    Eddy, Sean R.
    Sonnhammer, Erik L. L.
    Bateman, Alex
    NUCLEIC ACIDS RESEARCH, 2008, 36 : D281 - D288
  • [3] The Pfam protein families database
    Punta, Marco
    Coggill, Penny C.
    Eberhardt, Ruth Y.
    Mistry, Jaina
    Tate, John
    Boursnell, Chris
    Pang, Ningze
    Forslund, Kristoffer
    Ceric, Goran
    Clements, Jody
    Heger, Andreas
    Holm, Liisa
    Sonnhammer, Erik L. L.
    Eddy, Sean R.
    Bateman, Alex
    Finn, Robert D.
    NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D290 - D301
  • [4] The Pfam protein families database
    Finn, Robert D.
    Mistry, Jaina
    Tate, John
    Coggill, Penny
    Heger, Andreas
    Pollington, Joanne E.
    Gavin, O. Luke
    Gunasekaran, Prasad
    Ceric, Goran
    Forslund, Kristoffer
    Holm, Liisa
    Sonnhammer, Erik L. L.
    Eddy, Sean R.
    Bateman, Alex
    NUCLEIC ACIDS RESEARCH, 2010, 38 : D211 - D222
  • [5] The Pfam protein families database
    Bateman, A
    Birney, E
    Durbin, R
    Eddy, SR
    Howe, KL
    Sonnhammer, ELL
    NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 263 - 266
  • [6] The Pfam Protein Families Database
    Bateman, A
    Birney, E
    Cerruti, L
    Durbin, R
    Etwiller, L
    Eddy, SR
    Griffiths-Jones, S
    Howe, KL
    Marshall, M
    Sonnhammer, ELL
    NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 276 - 280
  • [7] Pfam: the protein families database
    Finn, Robert D.
    Bateman, Alex
    Clements, Jody
    Coggill, Penelope
    Eberhardt, Ruth Y.
    Eddy, Sean R.
    Heger, Andreas
    Hetherington, Kirstie
    Holm, Liisa
    Mistry, Jaina
    Sonnhammer, Erik L. L.
    Tate, John
    Punta, Marco
    NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D222 - D230
  • [8] The Pfam protein families database in 2019
    El-Gebali, Sara
    Mistry, Jaina
    Bateman, Alex
    Eddy, Sean R.
    Luciani, Aurelien
    Potter, Simon C.
    Qureshi, Matloob
    Richardson, Lorna J.
    Salazar, Gustavo A.
    Smart, Alfredo
    Sonnhammer, Erik L. L.
    Hirsh, Layla
    Paladin, Lisanna
    Piovesan, Damiano
    Tosatto, Silvio C. E.
    Finn, Robert D.
    NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) : D427 - D432
  • [9] Pfam: The protein families database in 2021
    Mistry, Jaina
    Chuguransky, Sara
    Williams, Lowri
    Qureshi, Matloob
    Salazar, Gustavo A.
    Sonnhammer, Erik L. L.
    Tosatto, Silvio C. E.
    Paladin, Lisanna
    Raj, Shriya
    Richardson, Lorna J.
    Finn, Robert D.
    Bateman, Alex
    NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) : D412 - D419
  • [10] The ProDom database of protein domain families
    Corpet, F
    Gouzy, J
    Kahn, D
    NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 323 - 326