Mining super-secondary structure motifs from 3D protein structures: A sequence order independent approach

被引:0
|
作者
Aung, Zeyar [1 ]
Li, Jinyan [2 ]
机构
[1] Inst Infocomm Res, 21 Heng Mui Keng Terrace, Singapore 119613, Singapore
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
关键词
3D protein structure; super-secondary structure; structural motifs mining; DISCOVERY; PACKING; ALGORITHM;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Super-Secondary structure elements (super-SSEs) are the structurally conserved ensembles of secondary structure elements (SSEs) within a protein. They are of great biological interest. In this work, we present a method to formally represent and mine the sequence order independent super-SSE motifs that occur repeatedly in large data sets of protein structures. We represent a protein structure as a graph, and mine the common cliques from a set of protein graphs in order to find the motifs. We mine two categories of super-SSE motifs: the generic motifs that occur frequently across the entire database of protein structures, and the fold-preferential motifs that are concentrated in particular protein fold types. From the experimental data set of 600 proteins belonging to 15 large SCOP Folds, we have discovered 21 generic motifs and 75 fold-preferential motifs that are both statistically significant and biologically relevant. A number of the discovered motifs (both generic and fold-preferential) resemble the well-known super-SSE motifs in the literature such as beta hairpins, Greek keys, zinc fingers, etc. Some of the discovered motifs are of novel shapes that have not been documented yet. Our method is time-efficient where it can discover all the motifs across the 600 proteins in less than 14 minutes on a standalone PC. The discovered motifs are reported in our project webpage: http://www1.i2r.a-star.edu.sg/similar to azeyar/SuperSSE/.
引用
收藏
页码:15 / +
页数:3
相关论文
共 50 条
  • [41] A NOVEL-APPROACH FOR IDENTIFYING POTENTIAL ANTIGENIC DETERMINANTS FROM THE 3D STRUCTURE OF A PROTEIN
    THORNTON, JM
    EDWARDS, MS
    TAYLOR, WR
    BARLOW, DJ
    JOURNAL OF MOLECULAR GRAPHICS, 1985, 3 (03): : 105 - 106
  • [42] A new bioinformatic approach to detect common 3D sites in protein structures
    Jambon, M
    Imberty, A
    Deléage, G
    Geourjon, C
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 52 (02) : 137 - 145
  • [43] Unsupervised Discovery of Geometrically Common Structural Motifs and Long-Range Contacts in Protein 3D Structures
    Kaiser, Florian
    Labudde, Dirk
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (02) : 671 - 680
  • [44] The protein data bank - Bridging the gap between the sequence and 3D structure world
    Sussman, JL
    Abola, EE
    Lin, D
    Jiang, J
    Manning, NO
    Prilusky, J
    GENETICA, 1999, 106 (1-2) : 149 - 158
  • [45] 3dswap-pred: Prediction of 3D Domain Swapping from Protein Sequence Using Random Forest Approach
    Shameer, Khader
    Pugalenthi, Ganesan
    Kandaswamy, Krishna Kumar
    Sowdhamini, Ramanathan
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (10): : 1010 - 1020
  • [46] Modeling the structures of antibody-carbo hydrate complexes: From protein sequence to 3-D structure.
    Dyekiwaer, JD
    Cutler, JE
    Woods, RJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U264 - U264
  • [47] iPBAvizu: a PyMOL plugin for an efficient 3D protein structure superimposition approach
    Faure, Guilhem
    Joseph, Agnel Praveen
    Craveur, Pierrick
    Narwani, Tarun J.
    Srinivasan, Narayanaswamy
    Gelly, Jean-Christophe
    Rebehmed, Joseph
    de Brevern, Alexandre G.
    SOURCE CODE FOR BIOLOGY AND MEDICINE, 2019, 14 (01):
  • [48] Towards 3D structures of G protein-coupled receptors:: A multidisciplinary approach
    Müller, G
    CURRENT MEDICINAL CHEMISTRY, 2000, 7 (09) : 861 - 888
  • [49] A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence
    Rice, DW
    Eisenberg, D
    JOURNAL OF MOLECULAR BIOLOGY, 1997, 267 (04) : 1026 - 1038
  • [50] ProFunc: a server for predicting protein function from 3D structure
    Laskowski, RA
    Watson, JD
    Thornton, JM
    NUCLEIC ACIDS RESEARCH, 2005, 33 : W89 - W93