Sequence-structure patterns: Discovery and applications

被引:0
|
作者
Milledge, T [1 ]
Khuri, S [1 ]
Wei, X [1 ]
Yang, C [1 ]
Zheng, G [1 ]
Narasimhan, G [1 ]
机构
[1] Florida Int Univ, Sch Comp Sci, BioRG, Miami, FL 33199 USA
关键词
pattern discovery; sequence alignment; structure alignment; sequence-structure patterns;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein sequence data is being generated at a tremendous rate; however, functional annotation of these proteins is proceeding at a much slower pace. Biologists rely on computational biology and pattern recognition to predict the functionality of proteins. This is based on the fact that proteins that share a similar function often exhibit conserved sequence patterns. Such sequence patterns, or motifs, are derived from multiple sequence alignments and have been collected in databases such as PROSITE, PRINTS, SPAT, and eMOTIF. These patterns help to classify proteins into families where the exact function may or may not be known. Research has shown that these domain signatures often exhibit specific three-dimensional structures. In this paper, we show how starting from a seed sequence pattern from any of the existing sequence pattern databases, and using information from the protein structure databases, it is possible to design biologically meaningful sequencestructure patterns (SSPs). An important by-product of our method to generate sequence-structure patterns is an improved sequence alignment as well as an improved structural alignment of proteins belonging to a family and containing that pattern. Validation was performed by matching the resulting SSPs to domains in the ASTRAL compendium associated with a family or super-family designation in the SCOP database. SSPs generated by this method were frequently either fully specific (no false positives), fully sensitive (no false negatives), or both (diagnostic).
引用
收藏
页码:1282 / 1285
页数:4
相关论文
共 50 条
  • [31] The sequence-structure relationship and protein function prediction
    Sadowski, M. I.
    Jones, D. T.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2009, 19 (03) : 357 - 362
  • [32] Insertions and deletions in the RNA sequence-structure map
    Martin, Nora S.
    Ahnert, Sebastian E.
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2021, 18 (183)
  • [33] Model identification for DNA sequence-structure relationships
    Hawley, Stephen Dwyer
    Chiu, Anita
    Chizeck, Howard Jay
    MATHEMATICAL BIOSCIENCES, 2006, 204 (01) : 119 - 131
  • [34] Generic properties of the sequence-structure relations of biopolymers
    Stadler, PF
    EXOBIOLOGY: MATTER, ENERGY, AND INFORMATION IN THE ORIGIN AND EVOLUTION OF LIFE IN THE UNIVERSE, 1998, : 149 - 156
  • [35] Servers for sequence-structure relationship analysis and prediction
    Dosztányi, Z
    Magyar, C
    Tusnády, GE
    Cserzo, M
    Fiser, A
    Simon, I
    NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3359 - 3363
  • [36] Exploring the sequence-structure relationship for amyloid peptides
    Morris, Kyle L.
    Rodger, Alison
    Hicks, Matthew R.
    Debulpaep, Maya
    Schymkowitz, Joost
    Rousseau, Frederic
    Serpell, Louise C.
    BIOCHEMICAL JOURNAL, 2013, 450 : 275 - 283
  • [37] JOY: protein sequence-structure representation and analysis
    Mizuguchi, K
    Deane, CM
    Blundell, TL
    Johnson, MS
    Overingon, JP
    BIOINFORMATICS, 1998, 14 (07) : 617 - 623
  • [38] Fast online and index-based algorithms for approximate search of RNA sequence-structure patterns
    Meyer, Fernando
    Kurtz, Stefan
    Beckstette, Michael
    BMC BIOINFORMATICS, 2013, 14
  • [39] Use of residue pairs in protein sequence-sequence and sequence-structure alignments
    Jung, JS
    Lee, B
    PROTEIN SCIENCE, 2000, 9 (08) : 1576 - 1588
  • [40] Fast online and index-based algorithms for approximate search of RNA sequence-structure patterns
    Fernando Meyer
    Stefan Kurtz
    Michael Beckstette
    BMC Bioinformatics, 14