Amogel: a multi-omics classification framework using associative graph neural networks with prior knowledge for biomarker identification

被引:0
|
作者
Tan, Chia Yan [1 ]
Ong, Huey Fang [1 ]
Lim, Chern Hong [1 ]
Tan, Mei Sze [1 ]
Ooi, Ean Hin [2 ]
Wong, Koksheik [1 ]
机构
[1] Monash Univ Malaysia, Sch Informat Technol, Petaling Jaya 47500, Selangor, Malaysia
[2] Monash Univ Malaysia, Sch Engn, Petaling Jaya 47500, Selangor, Malaysia
来源
BMC BIOINFORMATICS | 2025年 / 26卷 / 01期
关键词
Graph neural network; Association rule mining; Graph classification; Multi-omics; Prior knowledge; GENE-EXPRESSION; SURVIVAL;
D O I
10.1186/s12859-025-06111-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The advent of high-throughput sequencing technologies, such as DNA microarray and DNA sequencing, has enabled effective analysis of cancer subtypes and targeted treatment. Furthermore, numerous studies have highlighted the capability of graph neural networks (GNN) to model complex biological systems and capture non-linear interactions in high-throughput data. GNN has proven to be useful in leveraging multiple types of omics data, including prior biological knowledge from various sources, such as transcriptomics, genomics, proteomics, and metabolomics, to improve cancer classification. However, current works do not fully utilize the non-linear learning potential of GNN and lack of the integration ability to analyse high-throughput multi-omics data simultaneously with prior biological knowledge. Nevertheless, relying on limited prior knowledge in generating gene graphs might lead to less accurate classification due to undiscovered significant gene-gene interactions, which may require expert intervention and can be time-consuming. Hence, this study proposes a graph classification model called associative multi-omics graph embedding learning (AMOGEL) to effectively integrate multi-omics datasets and prior knowledge through GNN coupled with association rule mining (ARM). AMOGEL employs an early fusion technique using ARM to mine intra-omics and inter-omics relationships, forming a multi-omics synthetic information graph before the model training. Moreover, AMOGEL introduces multi-dimensional edges, with multi-omics gene associations or edges as the main contributors and prior knowledge edges as auxiliary contributors. Additionally, it uses a gene ranking technique based on attention scores, considering the relationships between neighbouring genes. Several experiments were performed on BRCA and KIPAN cancer subtypes to demonstrate the integration of multi-omics datasets (miRNA, mRNA, and DNA methylation) with prior biological knowledge of protein-protein interactions, KEGG pathways and Gene Ontology. The experimental results showed that the AMOGEL outperformed the current state-of-the-art models in terms of classification accuracy, F1 score and AUC score. The findings of this study represent a crucial step forward in advancing the effective integration of multi-omics data and prior knowledge to improve cancer subtype classification.
引用
收藏
页数:27
相关论文
共 50 条
  • [31] A hierarchical integration deep flexible neural forest framework for cancer subtype classification by integrating multi-omics data
    Jing Xu
    Peng Wu
    Yuehui Chen
    Qingfang Meng
    Hussain Dawood
    Hassan Dawood
    BMC Bioinformatics, 20
  • [32] A hierarchical integration deep flexible neural forest framework for cancer subtype classification by integrating multi-omics data
    Xu, Jing
    Wu, Peng
    Chen, Yuehui
    Meng, Qingfang
    Dawood, Hussain
    Dawood, Hassan
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [33] Classifying Breast Cancer Subtypes Using Deep Neural Networks Based on Multi-Omics Data
    Lin, Yuqi
    Zhang, Wen
    Cao, Huanshen
    Li, Gaoyang
    Du, Wei
    GENES, 2020, 11 (08) : 1 - 18
  • [34] Multi-layer Convolutional Neural Network Model Based on Prior Knowledge of Knowledge Graph for Text Classification
    Meng, Yining
    Wang, Guoyin
    Liu, Qun
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2019, : 618 - 624
  • [35] Multi-Class and Multi-Label Classification Using Associative Pulsing Neural Networks
    Horzyk, Adrian
    Starzyk, Janusz A.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 427 - 434
  • [36] Biomarkers Identification of Hepatocellular Carcinoma Based on Multi-omics Data Integration and Graph-embedded Deep Neural Network
    Yan, Chaokun
    Li, Mengyuan
    Suo, Zhihao
    Zhang, Jun
    Wang, Jianlin
    Zhang, Ge
    Liang, Wenjuan
    Luo, Huimin
    CURRENT BIOINFORMATICS, 2023, 18 (06) : 459 - 471
  • [37] Integrative machine learning model using multi-omics data for early prediction and biomarker identification of hepatocellular carcinoma
    Spooner, Annette
    Behary, Jason
    Moridani, Mohammad Karimi
    Safarchi, Azadeh
    Sowmya, Arcot
    Zekry, Amany
    Vafaee, Fatemeh
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2024, 39 : 108 - 108
  • [38] Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
    Park, Min-Koo
    Lim, Jin-Muk
    Jeong, Jinwoo
    Jang, Yeongjae
    Lee, Ji-Won
    Lee, Jeong-Chan
    Kim, Hyungyu
    Koh, Euiyul
    Hwang, Sung-Joo
    Kim, Hong-Gee
    Kim, Keun-Cheol
    BIOMOLECULES, 2022, 12 (12)
  • [39] Biologically interpretable neural networks for phenotype prediction using population-cohort multi-omics data
    van Hilten, Arno
    Rooij, Jeroen G. J.
    Niessen, Wiro J.
    van Meurs, Joyce B. J.
    Roshchupkin, Gennady V.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (SUPPL 1) : 493 - 494
  • [40] PRESSNET: Patient stratification and biomarker discovery using multi-modal knowledge graph framework
    Bulusu, Krishna C.
    Cohen-Setton, Jake
    Kagiampakis, Ioannis
    Goncalves, Miguel
    Edwards, Gavin
    Jayabalan, Sanddhya
    Shikare, Shruti
    Tsang, Kelvin
    Sidders, Ben
    Jacob, Etai
    CANCER RESEARCH, 2024, 84 (06)