Mining super-secondary structure motifs from 3D protein structures: A sequence order independent approach

被引:0
|
作者
Aung, Zeyar [1 ]
Li, Jinyan [2 ]
机构
[1] Inst Infocomm Res, 21 Heng Mui Keng Terrace, Singapore 119613, Singapore
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
关键词
3D protein structure; super-secondary structure; structural motifs mining; DISCOVERY; PACKING; ALGORITHM;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Super-Secondary structure elements (super-SSEs) are the structurally conserved ensembles of secondary structure elements (SSEs) within a protein. They are of great biological interest. In this work, we present a method to formally represent and mine the sequence order independent super-SSE motifs that occur repeatedly in large data sets of protein structures. We represent a protein structure as a graph, and mine the common cliques from a set of protein graphs in order to find the motifs. We mine two categories of super-SSE motifs: the generic motifs that occur frequently across the entire database of protein structures, and the fold-preferential motifs that are concentrated in particular protein fold types. From the experimental data set of 600 proteins belonging to 15 large SCOP Folds, we have discovered 21 generic motifs and 75 fold-preferential motifs that are both statistically significant and biologically relevant. A number of the discovered motifs (both generic and fold-preferential) resemble the well-known super-SSE motifs in the literature such as beta hairpins, Greek keys, zinc fingers, etc. Some of the discovered motifs are of novel shapes that have not been documented yet. Our method is time-efficient where it can discover all the motifs across the 600 proteins in less than 14 minutes on a standalone PC. The discovered motifs are reported in our project webpage: http://www1.i2r.a-star.edu.sg/similar to azeyar/SuperSSE/.
引用
收藏
页码:15 / +
页数:3
相关论文
共 50 条
  • [21] 3D structure from uncalibrated image sequence
    Fu, Dan
    Qiu, Zhiqiang
    Yu, Qifeng
    ISSCAA 2006: 1st International Symposium on Systems and Control in Aerospace and Astronautics, Vols 1and 2, 2006, : 224 - 228
  • [22] Modeling the 3D structure of GPCRs from sequence
    Shacham, S
    Topf, M
    Avisar, N
    Glaser, F
    Marantz, Y
    Bar-Haim, S
    Noiman, S
    Naor, Z
    Becker, OM
    MEDICINAL RESEARCH REVIEWS, 2001, 21 (05) : 472 - 483
  • [23] Amalgamation of 3D structure and sequence information for protein–protein interaction prediction
    Kanchan Jha
    Sriparna Saha
    Scientific Reports, 10
  • [24] Towards 3D structure prediction of large RNA molecules: an integer programming framework to insert local 3D motifs in RNA secondary structure
    Reinharz, Vladimir
    Major, Francois
    Waldispuehl, Jerome
    BIOINFORMATICS, 2012, 28 (12) : I207 - I214
  • [25] GRAPHLET DATA MINING OF ENERGETICAL INTERACTION PATTERNS IN PROTEIN 3D STRUCTURES
    Henneges, Carsten
    Roettig, Marc
    Kohlbacher, Oliver
    Zell, Andreas
    ICFC 2010/ ICNC 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION AND INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION, 2010, : 190 - 195
  • [26] ARTEMIS: a method for topology-independent superposition of RNA 3D structures and structure-based sequence alignment
    Bohdan, Davyd R.
    Bujnicki, Janusz M.
    Baulin, Eugene F.
    NUCLEIC ACIDS RESEARCH, 2024, 52 (18) : 10850 - 10861
  • [27] Concurrent prediction of RNA secondary structures with pseudoknots and local 3D motifs in an integer programming framework
    Loyer, Gabriel
    Reinharz, Vladimir
    BIOINFORMATICS, 2024, 40 (02)
  • [28] Spiral mining using attributes from 3D molecular structures
    Okada, T
    Yamakawa, M
    Niitsuma, H
    ACTIVE MINING, 2005, 3430 : 287 - 302
  • [29] Generation of 3D model with super resolved texture from image sequence
    Nakamura, K
    Saito, H
    Ozawa, S
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 1406 - 1411
  • [30] Amalgamation of 3D structure and sequence information for protein-protein interaction prediction
    Jha, Kanchan
    Saha, Sriparna
    SCIENTIFIC REPORTS, 2020, 10 (01)