Sequence-structure patterns: Discovery and applications

被引:0
|
作者
Milledge, T [1 ]
Khuri, S [1 ]
Wei, X [1 ]
Yang, C [1 ]
Zheng, G [1 ]
Narasimhan, G [1 ]
机构
[1] Florida Int Univ, Sch Comp Sci, BioRG, Miami, FL 33199 USA
关键词
pattern discovery; sequence alignment; structure alignment; sequence-structure patterns;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein sequence data is being generated at a tremendous rate; however, functional annotation of these proteins is proceeding at a much slower pace. Biologists rely on computational biology and pattern recognition to predict the functionality of proteins. This is based on the fact that proteins that share a similar function often exhibit conserved sequence patterns. Such sequence patterns, or motifs, are derived from multiple sequence alignments and have been collected in databases such as PROSITE, PRINTS, SPAT, and eMOTIF. These patterns help to classify proteins into families where the exact function may or may not be known. Research has shown that these domain signatures often exhibit specific three-dimensional structures. In this paper, we show how starting from a seed sequence pattern from any of the existing sequence pattern databases, and using information from the protein structure databases, it is possible to design biologically meaningful sequencestructure patterns (SSPs). An important by-product of our method to generate sequence-structure patterns is an improved sequence alignment as well as an improved structural alignment of proteins belonging to a family and containing that pattern. Validation was performed by matching the resulting SSPs to domains in the ASTRAL compendium associated with a family or super-family designation in the SCOP database. SSPs generated by this method were frequently either fully specific (no false positives), fully sensitive (no false negatives), or both (diagnostic).
引用
收藏
页码:1282 / 1285
页数:4
相关论文
共 50 条
  • [41] Capturing protein sequence-structure specificity using computational sequence design
    Mach, Paul
    Koehl, Patrice
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (09) : 1556 - 1570
  • [42] PSSARD: Protein sequence-structure analysis relational database
    Guruprasad, K
    Srikanth, K
    Babu, AVN
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2005, 36 (04) : 259 - 262
  • [43] A symmetry-related sequence-structure relation of proteins
    XU Ruizhen
    Chinese Science Bulletin, 2005, (06) : 536 - 538
  • [44] Variable gap penalty for protein sequence-structure alignment
    Madhusudhan, MS
    Marti-Renom, MA
    Sanchez, R
    Sali, A
    PROTEIN ENGINEERING DESIGN & SELECTION, 2006, 19 (03): : 129 - 133
  • [45] Thermodynamics and neutral sets in the RNA sequence-structure map
    Martin, N. S.
    Ahnert, S. E.
    EPL, 2022, 139 (03)
  • [46] Sequence-structure analysis of FAD-containing proteins
    Dym, O
    Eisenberg, D
    PROTEIN SCIENCE, 2001, 10 (09) : 1712 - 1728
  • [47] Tools for integrated sequence-structure analysis with UCSF Chimera
    Elaine C Meng
    Eric F Pettersen
    Gregory S Couch
    Conrad C Huang
    Thomas E Ferrin
    BMC Bioinformatics, 7
  • [48] Mapping the sequence-structure relationships of simple cyclic hexapeptides
    McHugh, Sean M.
    Yu, Hongtao
    Slough, Diana P.
    Lin, Yu-Shan
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2017, 19 (04) : 3315 - 3324
  • [49] Consequences of domain insertion on sequence-structure divergence in a superfold
    Pandya, Chetanya
    Brown, Shoshana
    Pieper, Ursula
    Sali, Andrej
    Dunaway-Mariano, Debra
    Babbitt, Patricia C.
    Xia, Yu
    Allen, Karen N.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (36) : E3381 - E3387
  • [50] Exploring the sequence-structure protein landscape in the glycosyltransferase family
    Zhang, ZD
    Kochhar, S
    Grigorov, M
    PROTEIN SCIENCE, 2003, 12 (10) : 2291 - 2302