Explaining protein-protein interactions with knowledge graph-based semantic similarity

被引:3
|
作者
Sousa, Rita T. [1 ]
Silva, Sara [1 ]
Pesquita, Catia [1 ]
机构
[1] Univ Lisbon, LASIGE, Fac Ciencias, Lisbon, Portugal
关键词
Machine learning; Explainable artificial intelligence; Knowledge graph; Semantic similarity; Protein-protein interaction prediction;
D O I
10.1016/j.compbiomed.2024.108076
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The application of artificial intelligence and machine learning methods for several biomedical applications, such as protein-protein interaction prediction, has gained significant traction in recent decades. However, explainability is a key aspect of using machine learning as a tool for scientific discovery. Explainable artificial intelligence approaches help clarify algorithmic mechanisms and identify potential bias in the data. Given the complexity of the biomedical domain, explanations should be grounded in domain knowledge which can be achieved by using ontologies and knowledge graphs. These knowledge graphs express knowledge about a domain by capturing different perspectives of the representation of real -world entities. However, the most popular way to explore knowledge graphs with machine learning is through using embeddings, which are not explainable. As an alternative, knowledge graph -based semantic similarity offers the advantage of being explainable. Additionally, similarity can be computed to capture different semantic aspects within the knowledge graph and increasing the explainability of predictive approaches. We propose a novel method to generate explainable vector representations, KGsim2vec, that uses aspectoriented semantic similarity features to represent pairs of entities in a knowledge graph. Our approach employs a set of machine learning models, including decision trees, genetic programming, random forest and eXtreme gradient boosting, to predict relations between entities. The experiments reveal that considering multiple semantic aspects when representing the similarity between two entities improves explainability and predictive performance. KGsim2vec performs better than black -box methods based on knowledge graph embeddings or graph neural networks. Moreover, KGsim2vec produces global models that can capture biological phenomena and elucidate data biases.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Semantic Similarity Measure with Conceptual Graph-Based Image Annotations
    Chinpanthana, Nutchanun
    2012 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2012, : 203 - 208
  • [32] Knowledge Graph-Based Hierarchical Text Semantic Representation
    Wu, Yongliang
    Pan, Xiao
    Li, Jinghui
    Dou, Shimao
    Dong, Jiahao
    Wei, Dan
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
  • [33] Graph-based information diffusion method for prioritizing functionally related genes in protein-protein interaction networks
    Minh Pham
    Lichtarge, Olivier
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020, 2020, : 439 - 450
  • [34] Community detection for graph-based similarity: Application to protein binding pockets classification
    Mallek, Sabrine
    Boukhris, Imen
    Elouedi, Zied
    PATTERN RECOGNITION LETTERS, 2015, 62 : 49 - 54
  • [35] Predicting False Positives of Protein-Protein Interaction Data by Semantic Similarity Measures
    Montanez, George
    Cho, Young-Rae
    CURRENT BIOINFORMATICS, 2013, 8 (03) : 339 - 346
  • [36] Protein-Protein Interaction Identification Using a Similarity-Constrained Graph Model
    Niu, Yun
    Wu, Hongmei
    Wang, Yuwei
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (02) : 607 - 616
  • [37] Transferring network topological knowledge for predicting protein-protein interactions
    Xu, Qian
    Xiang, Evan Wei
    Yang, Qiang
    PROTEOMICS, 2011, 11 (19) : 3818 - 3825
  • [38] Prediction of Protein-Protein Interactions Based on Domain
    Li, Xue
    Yang, Lifeng
    Zhang, Xiaopan
    Jiao, Xiong
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2019, 2019
  • [39] Protein-protein interactions
    Creeth, J. M.
    NATURE, 1981, 294 (5839) : 384 - 384
  • [40] Automatic extraction of protein-protein interactions using grammatical relationship graph
    Yu, Kaixian
    Lung, Pei-Yau
    Zhao, Tingting
    Zhao, Peixiang
    Tseng, Yan-Yuan
    Zhang, Jinfeng
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18