CLEP: a hybrid data- and knowledge-driven framework for generating patient representations

被引:6
|
作者
Bharadhwaj, Vinay Srinivas [1 ,2 ]
Ali, Mehdi [3 ,4 ,5 ]
Birkenbihl, Colin [1 ,2 ]
Mubeen, Sarah [1 ,2 ,6 ]
Lehmann, Jens [3 ,4 ,5 ]
Hofmann-Apitius, Martin [1 ,2 ]
Hoyt, Charles Tapley [1 ,3 ,6 ]
Domingo-Fernandez, Daniel [1 ,3 ,6 ]
机构
[1] Fraunhofer Inst Algorithms & Sci Comp, Dept Bioinformat, D-53757 St Augustin, Germany
[2] Univ Bonn, Bonn Aachen Int Ctr Informat Technol B IT, D-53115 Bonn, Germany
[3] Rheinische Friedrich Wilhelms Univ Bonn, D-53113 Bonn, Germany
[4] Fraunhofer Inst Intelligent Anal & Informat Syst, Dresden, Germany
[5] Fraunhofer Inst Intelligent Anal & Informat Syst, St Augustin, Germany
[6] Fraunhofer Ctr Machine Learning, Bonn, Germany
关键词
BETA;
D O I
10.1093/bioinformatics/btab340
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A Summary: As machine learning and artificial intelligence increasingly attain a larger number of applications in the biomedical domain, at their core, their utility depends on the data used to train them. Due to the complexity and high dimensionality of biomedical data, there is a need for approaches that combine prior knowledge around known biological interactions with patient data. Here, we present CLinical Embedding of Patients (CLEP), a novel approach that generates new patient representations by leveraging both prior knowledge and patient-level data. First, given a patient-level dataset and a knowledge graph containing relations across features that can be mapped to the dataset, CLEP incorporates patients into the knowledge graph as new nodes connected to their most characteristic features. Next, CLEP employs knowledge graph embedding models to generate new patient representations that can ultimately be used for a variety of downstream tasks, ranging from clustering to classification. We demonstrate how using new patient representations generated by CLEP significantly improves performance in classifying between patients and healthy controls for a variety of machine learning models, as compared to the use of the original transcriptomics data. Furthermore, we also show how incorporating patients into a knowledge graph can foster the interpretation and identification of biological features characteristic of a specific disease or patient subgroup. Finally, we released CLEP as an open source Python package together with examples and documentation.
引用
收藏
页码:3311 / 3318
页数:8
相关论文
共 50 条
  • [41] A Knowledge-Driven Anomaly Detection Framework for Social Production System
    Li, Zheng
    Xu, Xiaolong
    Hang, Tian
    Xiang, Haolong
    Cui, Yan
    Qi, Lianyong
    Zhou, Xiaokang
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (03) : 3179 - 3192
  • [42] A KNOWLEDGE-DRIVEN FRAMEWORK FOR ECG REPRESENTATION AND INTERPRETATION FOR WEARABLE APPLICATIONS
    Balasubramanian, Ramasubramanian
    Chaspari, Theodora
    Narayanan, Shrikanth S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1018 - 1022
  • [43] A knowledge-driven approach for designing data analytics platforms
    Madhushi Bandara
    Fethi A. Rabhi
    Muneera Bano
    [J]. Requirements Engineering, 2023, 28 : 195 - 212
  • [44] A knowledge-driven approach for designing data analytics platforms
    Bandara, Madhushi
    Rabhi, Fethi A.
    Bano, Muneera
    [J]. REQUIREMENTS ENGINEERING, 2023, 28 (02) : 195 - 212
  • [45] Semantic Water Data Translation: A Knowledge-driven Approach
    Shu, Yanfeng
    Ratcliffe, David
    Taylor, Kerry
    Wu, Jemma
    Ackland, Ross
    Terhorst, Andrew
    [J]. PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '10), 2010, : 52 - 60
  • [46] Domain Knowledge-Driven Generation of Synthetic Healthcare Data
    Hashemi, Atiye Sadat
    Soliman, Amira
    Lundstrom, Jens
    Etminani, Kobra
    [J]. CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 352 - 353
  • [47] Interaction of knowledge-driven and data-driven processing in category learning
    Vandierendonck, A
    Rosseel, Y
    [J]. EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 2000, 12 (01): : 37 - 63
  • [48] The EMPWR Platform: Data and Knowledge-Driven Processes for the Knowledge Graph Lifecycle
    Yip, Hong Yung
    Sheth, Amit
    [J]. IEEE INTERNET COMPUTING, 2024, 28 (01) : 61 - 69
  • [49] Event-triggered data- and knowledge-driven adaptive quality iterative learning control with uncertainty for a pharmaceutical cyber-physical system
    Wang, Zhengsong
    Tang, Shengnan
    Guo, Ge
    Yang, Yanqiu
    Han, Meng
    Yang, Le
    He, Dakuo
    [J]. CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2023, 101 (10): : 5844 - 5857
  • [50] From a Data-Driven Towards a Knowledge-Driven Society: Making Sense of Data
    Portmann, Edy
    Reimer, Ulrich
    Wilke, Gwendolin
    [J]. APPLICATION OF FUZZY LOGIC FOR MANAGERIAL DECISION MAKING PROCESSES: LATEST RESEARCH AND CASE STUDIES, 2017, : 93 - 98