Amogel: a multi-omics classification framework using associative graph neural networks with prior knowledge for biomarker identification

被引:0
|
作者
Tan, Chia Yan [1 ]
Ong, Huey Fang [1 ]
Lim, Chern Hong [1 ]
Tan, Mei Sze [1 ]
Ooi, Ean Hin [2 ]
Wong, Koksheik [1 ]
机构
[1] Monash Univ Malaysia, Sch Informat Technol, Petaling Jaya 47500, Selangor, Malaysia
[2] Monash Univ Malaysia, Sch Engn, Petaling Jaya 47500, Selangor, Malaysia
来源
BMC BIOINFORMATICS | 2025年 / 26卷 / 01期
关键词
Graph neural network; Association rule mining; Graph classification; Multi-omics; Prior knowledge; GENE-EXPRESSION; SURVIVAL;
D O I
10.1186/s12859-025-06111-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The advent of high-throughput sequencing technologies, such as DNA microarray and DNA sequencing, has enabled effective analysis of cancer subtypes and targeted treatment. Furthermore, numerous studies have highlighted the capability of graph neural networks (GNN) to model complex biological systems and capture non-linear interactions in high-throughput data. GNN has proven to be useful in leveraging multiple types of omics data, including prior biological knowledge from various sources, such as transcriptomics, genomics, proteomics, and metabolomics, to improve cancer classification. However, current works do not fully utilize the non-linear learning potential of GNN and lack of the integration ability to analyse high-throughput multi-omics data simultaneously with prior biological knowledge. Nevertheless, relying on limited prior knowledge in generating gene graphs might lead to less accurate classification due to undiscovered significant gene-gene interactions, which may require expert intervention and can be time-consuming. Hence, this study proposes a graph classification model called associative multi-omics graph embedding learning (AMOGEL) to effectively integrate multi-omics datasets and prior knowledge through GNN coupled with association rule mining (ARM). AMOGEL employs an early fusion technique using ARM to mine intra-omics and inter-omics relationships, forming a multi-omics synthetic information graph before the model training. Moreover, AMOGEL introduces multi-dimensional edges, with multi-omics gene associations or edges as the main contributors and prior knowledge edges as auxiliary contributors. Additionally, it uses a gene ranking technique based on attention scores, considering the relationships between neighbouring genes. Several experiments were performed on BRCA and KIPAN cancer subtypes to demonstrate the integration of multi-omics datasets (miRNA, mRNA, and DNA methylation) with prior biological knowledge of protein-protein interactions, KEGG pathways and Gene Ontology. The experimental results showed that the AMOGEL outperformed the current state-of-the-art models in terms of classification accuracy, F1 score and AUC score. The findings of this study represent a crucial step forward in advancing the effective integration of multi-omics data and prior knowledge to improve cancer subtype classification.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Graph Neural Networks With Multiple Prior Knowledge for Multi-Omics Data Analysis
    Xiao, Shunxin
    Lin, Huibin
    Wang, Conghao
    Wang, Shiping
    Rajapakse, Jagath C.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (09) : 4591 - 4600
  • [2] MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification
    Wang, Tongxin
    Shao, Wei
    Huang, Zhi
    Tang, Haixu
    Zhang, Jie
    Ding, Zhengming
    Huang, Kun
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [3] MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification
    Tongxin Wang
    Wei Shao
    Zhi Huang
    Haixu Tang
    Jie Zhang
    Zhengming Ding
    Kun Huang
    Nature Communications, 12
  • [4] Integration of multi-omics data using adaptive graph learning and attention mechanism for patient classification and biomarker identification
    Ouyang, Dong
    Liang, Yong
    Li, Le
    Ai, Ning
    Lu, Shanghui
    Yu, Mingkun
    Liu, Xiaoying
    Xie, Shengli
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [5] MOGAT: A Multi-Omics Integration Framework Using Graph Attention Networks for Cancer Subtype Prediction
    Tanvir, Raihanul Bari
    Islam, Md Mezbahul
    Sobhan, Masrur
    Luo, Dongsheng
    Mondal, Ananda Mohan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (05)
  • [6] Cancer Molecular Subtype Classification by Graph Convolutional Networks on Multi-omics Data
    Li, Bingjun
    Wang, Tianyu
    Nabavi, Sheida
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [7] moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks
    Joung Min Choi
    Heejoon Chae
    BMC Bioinformatics, 24
  • [8] moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks
    Choi, Joung Min
    Chae, Heejoon
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [9] Integration of Multi-Omics Data Using Probabilistic Graph Models and External Knowledge
    Tripp, Bridget A.
    Otu, Hasan H.
    CURRENT BIOINFORMATICS, 2022, 17 (01) : 37 - 47
  • [10] Geometric graph neural networks on multi-omics data to predict cancer survival outcomes
    Zhu, Jiening
    Oh, Jung Hun
    Simhal, Anish K.
    Elkin, Rena
    Norton, Larry
    Deasy, Joseph O.
    Tannenbaum, Allen
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163