Extended Graph-Based Models for Enhanced Similarity Search in Cavbase

被引:8
|
作者
Krotzky, Timo [1 ]
Fober, Thomas [2 ]
Huellermeier, Eyke [3 ]
Klebe, Gerhard [1 ]
机构
[1] Univ Marburg, Dept Pharmaceut Chem, D-35032 Marburg, Germany
[2] Univ Marburg, Dept Math & Comp Sci, D-35032 Marburg, Germany
[3] Univ Paderborn, Dept Comp Sci, D-33098 Paderborn, Germany
关键词
Cavbase; protein binding site; similarity measure; distance; structural alignment; maximum common subgraph; PROTEIN-STRUCTURE; BINDING-SITES; FUNCTIONAL-RELATIONSHIPS; STRUCTURE ALIGNMENT; CLASSIFICATION; SEQUENCE; DATABASE; RECOGNITION; DESIGN;
D O I
10.1109/TCBB.2014.2325020
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
To calculate similarities between molecular structures, measures based on the maximum common subgraph are frequently applied. For the comparison of protein binding sites, these measures are not fully appropriate since graphs representing binding sites on a detailed atomic level tend to get very large. In combination with an NP-hard problem, a large graph leads to a computationally demanding task. Therefore, for the comparison of binding sites, a less detailed coarse graph model is used building upon so-called pseudocenters. Consistently, a loss of structural data is caused since many atoms are discarded and no information about the shape of the binding site is considered. This is usually resolved by performing subsequent calculations based on additional information. These steps are usually quite expensive, making the whole approach very slow. The main drawback of a graph-based model solely based on pseudocenters, however, is the loss of information about the shape of the protein surface. In this study, we propose a novel and efficient modeling formalism that does not increase the size of the graph model compared to the original approach, but leads to graphs containing considerably more information assigned to the nodes. More specifically, additional descriptors considering surface characteristics are extracted from the local surface and attributed to the pseudocenters stored in Cavbase. These properties are evaluated as additional node labels, which lead to a gain of information and allow for much faster but still very accurate comparisons between different structures.
引用
收藏
页码:878 / 890
页数:13
相关论文
共 50 条
  • [1] Extended graph-based models for enhanced similarity retrieval in Cavbase
    Timo Krotzky
    Thomas Fober
    Marco Mernberger
    Gerhard Klebe
    Eyke Hüllermeier
    [J]. Journal of Cheminformatics, 5 (Suppl 1)
  • [2] High Quality Graph-Based Similarity Search
    Yu, Weiren
    McCann, Julie A.
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 83 - 92
  • [3] ELPIS: Graph-Based Similarity Search for Scalable Data Science
    Azizi, Ilias
    Echihabi, Karima
    Palpanas, Themis
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (06): : 1548 - 1559
  • [4] An Enhanced Graph-based Infrastructure for Software Search Engines
    Schumacher, Marcus
    Atkinson, Colin
    [J]. 12TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2015), 2015, : 386 - 390
  • [5] A graph-based cache for large-scale similarity search engines
    Gil-Costa, Veronica
    Marin, Mauricio
    Bonacic, Carolina
    Solar, Roberto
    [J]. JOURNAL OF SUPERCOMPUTING, 2018, 74 (05): : 2006 - 2034
  • [6] A graph-based cache for large-scale similarity search engines
    Veronica Gil-Costa
    Mauricio Marin
    Carolina Bonacic
    Roberto Solar
    [J]. The Journal of Supercomputing, 2018, 74 : 2006 - 2034
  • [7] A Review of Graph-Based Models for Entity-Oriented Search
    Devezas J.
    Nunes S.
    [J]. SN Computer Science, 2021, 2 (6)
  • [8] A meta-learning configuration framework for graph-based similarity search indexes
    Oyamada, Rafael S.
    Shimomura, Larissa C.
    Barbon Jr, Sylvio
    Kaster, Daniel S.
    [J]. INFORMATION SYSTEMS, 2023, 112
  • [9] Efficient Graph-Based Document Similarity
    Paul, Christian
    Rettinger, Achim
    Mogadala, Aditya
    Knoblock, Craig A.
    Szekely, Pedro
    [J]. SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, 2016, 9678 : 334 - 349
  • [10] Graph-Based Shape Similarity of Petroglyphs
    Seidl, Markus
    Wieser, Ewald
    Zeppelzauer, Matthias
    Pinz, Axel
    Breiteneder, Christian
    [J]. COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 133 - 148