Prioritizing genomic variants through neuro-symbolic, knowledge-enhanced learning

被引:0
|
作者
Althagafi, Azza [1 ,2 ,3 ]
Zhapa-Camacho, Fernando [1 ,2 ]
Hoehndorf, Robert [1 ,2 ,4 ]
机构
[1] King Abdullah Univ Sci & Technol KAUST, Computat Biosci Res Ctr CBRC, 4700 KAUST, Thuwal 23955, Saudi Arabia
[2] King Abdullah Univ Sci & Technol KAUST, Comp Sci Program, Comp Elect & Math Sci & Engn Div CEMSE, 4700 KAUST, Thuwal 23955, Saudi Arabia
[3] Taif Univ, Coll Comp & Informat Technol, Comp Sci Dept, Taif 26571, Saudi Arabia
[4] King Abdullah Univ Sci & Technol KAUST, SDAIA KAUST Ctr Excellence Data Sci & Artificial I, 4700 KAUST, Thuwal 23955, Saudi Arabia
关键词
STRUCTURAL VARIATION; DIAGNOSTICS; CHALLENGES; PREDICTION; GENETICS; GENES; SCORE;
D O I
10.1093/bioinformatics/btae301
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Whole-exome and genome sequencing have become common tools in diagnosing patients with rare diseases. Despite their success, this approach leaves many patients undiagnosed. A common argument is that more disease variants still await discovery, or the novelty of disease phenotypes results from a combination of variants in multiple disease-related genes. Interpreting the phenotypic consequences of genomic variants relies on information about gene functions, gene expression, physiology, and other genomic features. Phenotype-based methods to identify variants involved in genetic diseases combine molecular features with prior knowledge about the phenotypic consequences of altering gene functions. While phenotype-based methods have been successfully applied to prioritizing variants, such methods are based on known gene-disease or gene-phenotype associations as training data and are applicable to genes that have phenotypes associated, thereby limiting their scope. In addition, phenotypes are not assigned uniformly by different clinicians, and phenotype-based methods need to account for this variability.Results We developed an Embedding-based Phenotype Variant Predictor (EmbedPVP), a computational method to prioritize variants involved in genetic diseases by combining genomic information and clinical phenotypes. EmbedPVP leverages a large amount of background knowledge from human and model organisms about molecular mechanisms through which abnormal phenotypes may arise. Specifically, EmbedPVP incorporates phenotypes linked to genes, functions of gene products, and the anatomical site of gene expression, and systematically relates them to their phenotypic effects through neuro-symbolic, knowledge-enhanced machine learning. We demonstrate EmbedPVP's efficacy on a large set of synthetic genomes and genomes matched with clinical information.Availability and implementation EmbedPVP and all evaluation experiments are freely available at https://github.com/bio-ontology-research-group/EmbedPVP.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Neuro-Symbolic Integration for Reasoning and Learning on Knowledge Graphs
    Werner, Luisa
    [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23429 - 23430
  • [2] Neuro-symbolic representation learning on biological knowledge graphs
    Alshahrani, Mona
    Khan, Mohammad Asif
    Maddouri, Omar
    Kinjo, Akira R.
    Queralt-Rosinach, Nuria
    Hoehndorf, Robert
    [J]. BIOINFORMATICS, 2017, 33 (17) : 2723 - 2730
  • [3] Neuro-Symbolic Class Expression Learning
    Demir, Caglar
    Ngomo, Axel-Cyrille Ngonga
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 3624 - 3632
  • [4] Knowledge-Infused Learning: A Sweet Spot in Neuro-Symbolic AI
    Gaur, Manas
    Gunaratna, Kalpa
    Bhatt, Shreyansh
    Sheth, Amit
    [J]. IEEE INTERNET COMPUTING, 2022, 26 (04) : 5 - 11
  • [5] Learning Neuro-Symbolic Abstractions for Robot Planning and Learning
    Shah, Naman
    [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23417 - 23418
  • [6] Reliable Neuro-Symbolic Abstractions for Planning and Learning
    Shah, Naman
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 7093 - 7094
  • [7] Neuro-symbolic Learning Yielding Logical Constraints
    Li, Zenan
    Huang, Yunpeng
    Li, Zhaoyu
    Yao, Yuan
    Xu, Jingwei
    Chen, Taolue
    Ma, Xiaoxing
    Lu, Jian
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Semantic Probabilistic Layers for Neuro-Symbolic Learning
    Ahmed, Kareem
    Teso, Stefano
    Chang, Kai-Wei
    Van den Broeck, Guy
    Vergari, Antonio
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [9] A Neuro-Symbolic Approach for Enhanced Human Motion Prediction
    Mghames, Sariah
    Castri, Luca
    Hanheide, Marc
    Bellotto, Nicola
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [10] APPLYING DIFFERENT LEARNING RULES IN NEURO-SYMBOLIC INTEGRATION
    Sathasivam, Saratha
    [J]. MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 716 - 720