Sequence-structure patterns: Discovery and applications

被引:0
|
作者
Milledge, T [1 ]
Khuri, S [1 ]
Wei, X [1 ]
Yang, C [1 ]
Zheng, G [1 ]
Narasimhan, G [1 ]
机构
[1] Florida Int Univ, Sch Comp Sci, BioRG, Miami, FL 33199 USA
关键词
pattern discovery; sequence alignment; structure alignment; sequence-structure patterns;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein sequence data is being generated at a tremendous rate; however, functional annotation of these proteins is proceeding at a much slower pace. Biologists rely on computational biology and pattern recognition to predict the functionality of proteins. This is based on the fact that proteins that share a similar function often exhibit conserved sequence patterns. Such sequence patterns, or motifs, are derived from multiple sequence alignments and have been collected in databases such as PROSITE, PRINTS, SPAT, and eMOTIF. These patterns help to classify proteins into families where the exact function may or may not be known. Research has shown that these domain signatures often exhibit specific three-dimensional structures. In this paper, we show how starting from a seed sequence pattern from any of the existing sequence pattern databases, and using information from the protein structure databases, it is possible to design biologically meaningful sequencestructure patterns (SSPs). An important by-product of our method to generate sequence-structure patterns is an improved sequence alignment as well as an improved structural alignment of proteins belonging to a family and containing that pattern. Validation was performed by matching the resulting SSPs to domains in the ASTRAL compendium associated with a family or super-family designation in the SCOP database. SSPs generated by this method were frequently either fully specific (no false positives), fully sensitive (no false negatives), or both (diagnostic).
引用
收藏
页码:1282 / 1285
页数:4
相关论文
共 50 条
  • [1] Discovery of sequence-structure patterns across diverse proteins
    Berger, B
    BIOPHYSICAL JOURNAL, 2003, 84 (02) : 2A - 2A
  • [2] TRILOGY: Discovery of sequence-structure patterns across diverse proteins
    Bradley, P
    Kim, PS
    Berger, B
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (13) : 8500 - 8505
  • [3] Discovering sequence-structure patterns in proteins with variable secondary structure
    Milledge, Tom
    Zheng, Gaolin
    Narasimhan, Giri
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 702 - 709
  • [4] Clustering of Protein Substructures for Discovery of a Novel Class of Sequence-Structure Fragments
    Rudolfova, Ivana
    Zendulka, Jaroslav
    Lexa, Matej
    INFORMATION TECHNOLOGY IN BIO- AND MEDICAL INFORMATICS, 2010, 6266 : 94 - 101
  • [5] Sequence-structure relationships in proteins
    Elber, R
    Qiu, J
    Meyerguz, L
    Kleinberg, J
    Soft Condensed Matter Physics in Molecular and Cell Biology, 2006, : 201 - 224
  • [6] Sequence-structure relations of biopolymers
    Barrett, Christopher
    Huang, Fenix W.
    Reidys, Christian M.
    BIOINFORMATICS, 2017, 33 (03) : 382 - 389
  • [7] STATISTICS OF SEQUENCE-STRUCTURE THREADING
    BRYANT, SH
    ALTSCHUL, SF
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (02) : 236 - 244
  • [8] The Boltzmann Sequence-Structure Channel
    Magner, Abram
    Kihara, Daisuke
    Szpankowski, Wojciech
    2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 255 - 259
  • [9] A Bayes-optimal sequence-structure theory that unifies protein sequence-structure recognition and alignment
    Richard H. Lathrop
    Robert G. Rogers
    Temple F. Smith
    James V. White
    Bulletin of Mathematical Biology, 1998, 60 (6) : 1039 - 1071
  • [10] A Bayes-optimal sequence-structure theory that unifies protein sequence-structure recognition and alignment
    Lathrop, RH
    Rogers, RG
    Smith, TF
    White, JV
    BULLETIN OF MATHEMATICAL BIOLOGY, 1998, 60 (06) : 1039 - 1071