FAMCS: Finding All Maximal Common Substructures in Proteins

被引:0
|
作者
Anthony K. H. Tung [1 ]
机构
[1] Department of Computer Science National University of Singapore.Singapore 117543
关键词
protein structure; maximal common substructures; secondary structure element (SSE);
D O I
暂无
中图分类号
Q51 [蛋白质];
学科分类号
071010 ; 081704 ;
摘要
Finding the common substructures shared by two proteins is considered as one of the central issues in computational biology because of its usefulness in understand- ing the structure-function relationship and application in drug and vaccine design. In this paper, we propose a novel algorithm called FAMCS (Finding All Maximal Common Substructures) for the common substructure identification problem. Our method works initially at the protein secondary structural element (SSE) level and starts with the identification of all structurally similar SSE pairs. These SSE pairs are then merged into sets using a modified Apriori algorithm, which will test the similarity of various sets of SSE pairs incrementally until all the maximal sets of SSE pairs that deemed to be similar are found. The maximal common substructures of the two proteins will be formed from these maximal sets. A refinement algorithm is also proposed to fine tune the alignment from the SSE level to the residue level. Comparison of FAMCS with other methods on various proteins shows that FAMCS can address all four requirements and infer interesting biological discoveries.
引用
收藏
页码:107 / 119
页数:13
相关论文
共 50 条
  • [1] IDENTIFICATION OF MAXIMAL COMMON SUBSTRUCTURES IN STRUCTURE ACTIVITY STUDIES
    YUAN, SG
    ZHENG, CZ
    ZHAO, X
    ZENG, FY
    ANALYTICA CHIMICA ACTA, 1990, 235 (01) : 239 - 241
  • [2] Finding Largest Common Substructures of Molecules in Quadratic Time
    Droschinsky, Andre
    Kriege, Nils
    Mutzel, Petra
    SOFSEM 2017: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2017, 10139 : 309 - 321
  • [3] ALGORITHMS FOR THE IDENTIFICATION OF 3-DIMENSIONAL MAXIMAL COMMON SUBSTRUCTURES
    BRINT, AT
    WILLETT, P
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1987, 27 (04): : 152 - 158
  • [4] A Parallel Algorithm for Finding All Pairs k-Mismatch Maximal Common Substrings
    Chockalingam, Sriram P.
    Thankachan, Sharma V.
    Aluru, Srinivas
    SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 784 - 794
  • [5] An Efficient Algorithm for Finding All Pairs k-Mismatch Maximal Common Substrings
    Thankachan, Sharma V.
    Chockalingam, Sriram P.
    Aluru, Srinivas
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2016, 2016, 9683 : 3 - 14
  • [6] Finding a Summary for All Maximal Cliques
    Li, Xiaofan
    Zhou, Rui
    Chen, Lu
    Zhang, Yong
    Liu, Chengfei
    He, Qiang
    Yang, Yun
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1344 - 1355
  • [7] An algorithm for finding the largest approximately common substructures of two trees
    Wang, JTL
    Shapiro, BA
    Shasha, D
    Zhang, KZ
    Currey, KM
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (08) : 889 - 895
  • [8] Algorithm for finding the largest approximately common substructures of two trees
    New Jersey Inst of Technology, Newark, United States
    IEEE Trans Pattern Anal Mach Intell, 8 (889-895):
  • [9] MOLECULAR-STRUCTURE COMPARISON PROGRAM FOR IDENTIFICATION OF MAXIMAL COMMON SUBSTRUCTURES
    CONE, MM
    VENKATARAGHAVAN, R
    MCLAFFERTY, FW
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1977, 99 (23) : 7668 - 7671
  • [10] DETECTION OF COMMON 3-DIMENSIONAL SUBSTRUCTURES IN PROTEINS
    VRIEND, G
    SANDER, C
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1991, 11 (01) : 52 - 58