The TIGRFAMs database of protein families

被引:627
|
作者
Haft, DH [1 ]
Selengut, JD [1 ]
White, O [1 ]
机构
[1] Inst Genom Res, Rockville, MD 20850 USA
关键词
D O I
10.1093/nar/gkg128
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
TIGRFAMs is a collection of manually curated protein families consisting of hidden Markov models (HMMs), multiple sequence alignments, commentary, Gene Ontology ( GO) assignments, literature references and pointers to related TIGRFAMs, Pfam and InterPro models. These models are designed to support both automated and manually curated annotation of genomes. TIGRFAMs contains models of full-length proteins and shorter regions at the levels of superfamilies, subfamilies and equivalogs, where equivalogs are sets of homologous proteins conserved with respect to function since their last common ancestor. The scope of each model is set by raising or lowering cutoff scores and choosing members of the seed alignment to group proteins sharing specific function (equivalog) or more general properties. The overall goal is to provide information with maximum utility for the annotation process. TIGRFAMs is thus complementary to Pfam, whose models typically achieve broad coverage across distant homologs but end at the boundaries of conserved structural domains. The database currently contains over 1600 protein families. TIGRFAMs is available for searching or downloading at www.tigr.org/TIGRFAMs.
引用
收藏
页码:371 / 373
页数:3
相关论文
共 50 条
  • [21] The Lipase Engineering Database: a navigation and analysis tool for protein families
    Fischer, M
    Pleiss, J
    NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 319 - 321
  • [22] ProtBuD: a database of biological unit structures of protein families and superfamilies
    Xu, Qifang
    Canutescu, Adrian
    Obradovic, Zoran
    Dunbrack, Roland L., Jr.
    BIOINFORMATICS, 2006, 22 (23) : 2876 - 2882
  • [23] A DATABASE OF PROTEIN-STRUCTURE FAMILIES WITH COMMON FOLDING MOTIFS
    HOLM, L
    OUZOUNIS, C
    SANDER, C
    TUPAREV, G
    VRIEND, G
    PROTEIN SCIENCE, 1992, 1 (12) : 1691 - 1698
  • [24] The Pfam protein families database: towards a more sustainable future
    Finn, Robert D.
    Coggill, Penelope
    Eberhardt, Ruth Y.
    Eddy, Sean R.
    Mistry, Jaina
    Mitchell, Alex L.
    Potter, Simon C.
    Punta, Marco
    Qureshi, Matloob
    Sangrador-Vegas, Amaia
    Salazar, Gustavo A.
    Tate, John
    Bateman, Alex
    NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D279 - D285
  • [25] EyeSite: a semi-automated database of protein families in the eye
    Lee, DA
    Fefeu, S
    Edo-Ukeh, AA
    Orengo, CA
    Slingsby, C
    NUCLEIC ACIDS RESEARCH, 2004, 32 : D148 - D152
  • [26] TIGRFAMs and Genome Properties in 2013
    Haft, Daniel H.
    Selengut, Jeremy D.
    Richter, Roland A.
    Harkins, Derek
    Basu, Malay K.
    Beck, Erin
    NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D387 - D395
  • [27] sORFdb - a database for sORFs, small proteins, and small protein families in bacteria
    Hahnfeld, Julian M.
    Schwengers, Oliver
    Jelonek, Lukas
    Diedrich, Sonja
    Cemic, Franz
    Goesmann, Alexander
    BMC GENOMICS, 2025, 26 (01):
  • [28] NMPFamsDB: a database of novel protein families from microbial metagenomes and metatranscriptomes
    Baltoumas, Fotis A.
    Karatzas, Evangelos
    Liu, Sirui
    Ovchinnikov, Sergey
    Sofianatos, Yorgos
    Chen, I-Min
    Kyrpides, Nikos C.
    Pavlopoulos, Georgios A.
    NUCLEIC ACIDS RESEARCH, 2024, 52 (D1) : D502 - D512
  • [29] The InterPro protein families database: the classification resource after 15 years
    Mitchell, Alex
    Chang, Hsin-Yu
    Daugherty, Louise
    Fraser, Matthew
    Hunter, Sarah
    Lopez, Rodrigo
    McAnulla, Craig
    McMenamin, Conor
    Nuka, Gift
    Pesseat, Sebastien
    Sangrador-Vegas, Amaia
    Scheremetjew, Maxim
    Rato, Claudia
    Yong, Siew-Yit
    Bateman, Alex
    Punta, Marco
    Attwood, Teresa K.
    Sigrist, Christian J. A.
    Redaschi, Nicole
    Rivoire, Catherine
    Xenarios, Ioannis
    Kahn, Daniel
    Guyot, Dominique
    Bork, Peer
    Letunic, Ivica
    Gough, Julian
    Oates, Matt
    Haft, Daniel
    Huang, Hongzhan
    Natale, Darren A.
    Wu, Cathy H.
    Orengo, Christine
    Sillitoe, Ian
    Mi, Huaiyu
    Thomas, Paul D.
    Finn, Robert D.
    NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D213 - D221
  • [30] Pfam: A comprehensive database of protein domain families based on seed alignments
    Sonnhammer, ELL
    Eddy, SR
    Durbin, R
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1997, 28 (03) : 405 - 420