Graph embedding and ensemble learning for predicting gene-disease associations

被引:0
|
作者
Wang, Haorui [1 ,2 ]
Wang, Xiaochan [2 ]
Yu, Zhouxin [1 ]
Zhang, Wen [1 ,3 ]
机构
[1] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Hubei, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Wuhan 430070, Hubei, Peoples R China
[3] Hubei Engn Technol Res Ctr Agr Big Data, Wuhan 430070, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
gene-disease association; heterogeneous network; graph embedding; MUTATIONS;
D O I
10.1504/IJDMB.2020.108704
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The discovery of gene-disease associations is important for preventing, diagnosing, and treating diseases. In this paper, we propose two heterogeneous network-based methods that enhance gene-disease association prediction by using graph embedding and ensemble learning, abbreviated as 'HNEEM' and 'HNEEM-PLUS'. We integrate gene-disease associations, gene-chemical associations, gene-gene associations and disease-chemical associations to construct a heterogeneous network, and adopt six graph embedding methods respectively to learn the representative vectors of genes and diseases from the network. We build individual prediction models using each graph embedding representation and random forest, and then combine them by average scoring to construct the ensemble model HNEEM. To increase the diversity of base predictors, we further introduce the multilayer perceptron as an additional classifier and generate more base predictors, and thus propose an extended method named 'HNEEM-PLUS'. Computational experiments show that HNEEM has better results than individual methods and HNEEM-PLUS makes more improvement than HNEEM.
引用
收藏
页码:360 / 379
页数:20
相关论文
共 50 条
  • [31] Relation-aware Ensemble Learning for Knowledge Graph Embedding
    Yue, Ling
    Zhang, Yongqi
    Yao, Quanming
    Li, Yong
    Wu, Xian
    Zhang, Ziheng
    Lin, Zhenxi
    Zheng, Yefeng
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16620 - 16631
  • [32] Reporting, appraising, and integrating data on genotype prevalence and gene-disease associations
    Little, J
    Bradley, L
    Bray, MS
    Clyne, M
    Dorman, J
    Ellsworth, DL
    Hanson, J
    Khoury, M
    Lau, J
    O'Brien, TR
    Rothman, N
    Stroup, D
    Taioli, E
    Thomas, D
    Vainio, H
    Wacholder, S
    Weinberg, C
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2002, 156 (04) : 300 - 310
  • [33] Predicting gene disease associations with knowledge graph embeddings for diseases with curtailed information
    Gualdi, Francesco
    Oliva, Baldomero
    Pinero, Janet
    NAR GENOMICS AND BIOINFORMATICS, 2024, 6 (02)
  • [34] reg2gene: predicting enhancer-gene associations using ensemble learning approaches
    Patarcic, I.
    Franke, V.
    Akalin, A.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 1157 - 1158
  • [35] Predicting MiRNA-disease associations by multiple meta-paths fusion graph embedding model
    Lei Zhang
    Bailong Liu
    Zhengwei Li
    Xiaoyan Zhu
    Zhizhen Liang
    Jiyong An
    BMC Bioinformatics, 21
  • [36] Predicting MiRNA-disease associations by multiple meta-paths fusion graph embedding model
    Zhang, Lei
    Liu, Bailong
    Li, Zhengwei
    Zhu, Xiaoyan
    Liang, Zhizhen
    An, Jiyong
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [37] Genetic interaction mapping of AMD identifies potential gene-disease associations
    Kiang, Lee
    Huang, Jillian
    Tsuchida, Ryan
    Jayasundera, Kanishka
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2013, 54 (15)
  • [38] Predicting miRNA-disease associations using an ensemble learning framework with resampling method
    Dai, Qiguo
    Wang, Zhaowei
    Liu, Ziqiang
    Duan, Xiaodong
    Song, Jinmiao
    Guo, Maozu
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [39] GGAECDA: Predicting circRNA-disease associations using graph autoencoder based on graph representation learning
    Li, Guanghui
    Lin, Yawei
    Luo, Jiawei
    Xiao, Qiu
    Liang, Cheng
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 99
  • [40] Self-supervised graph representation learning integrates multiple molecular networks and decodes gene-disease relationships
    Wang, Yi
    Sun, Zijun
    He, Qiushun
    Li, Jiwei
    Ni, Ming
    Yang, Meng
    PATTERNS, 2022, 4 (01):