Graph embedding and ensemble learning for predicting gene-disease associations

被引:0
|
作者
Wang, Haorui [1 ,2 ]
Wang, Xiaochan [2 ]
Yu, Zhouxin [1 ]
Zhang, Wen [1 ,3 ]
机构
[1] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Hubei, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Wuhan 430070, Hubei, Peoples R China
[3] Hubei Engn Technol Res Ctr Agr Big Data, Wuhan 430070, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
gene-disease association; heterogeneous network; graph embedding; MUTATIONS;
D O I
10.1504/IJDMB.2020.108704
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The discovery of gene-disease associations is important for preventing, diagnosing, and treating diseases. In this paper, we propose two heterogeneous network-based methods that enhance gene-disease association prediction by using graph embedding and ensemble learning, abbreviated as 'HNEEM' and 'HNEEM-PLUS'. We integrate gene-disease associations, gene-chemical associations, gene-gene associations and disease-chemical associations to construct a heterogeneous network, and adopt six graph embedding methods respectively to learn the representative vectors of genes and diseases from the network. We build individual prediction models using each graph embedding representation and random forest, and then combine them by average scoring to construct the ensemble model HNEEM. To increase the diversity of base predictors, we further introduce the multilayer perceptron as an additional classifier and generate more base predictors, and thus propose an extended method named 'HNEEM-PLUS'. Computational experiments show that HNEEM has better results than individual methods and HNEEM-PLUS makes more improvement than HNEEM.
引用
收藏
页码:360 / 379
页数:20
相关论文
共 50 条
  • [21] A deep learning framework for predicting disease-gene associations with functional modules and graph augmentation
    Jia, Xianghu
    Luo, Weiwen
    Li, Jiaqi
    Xing, Jieqi
    Sun, Hongjie
    Wu, Shunyao
    Su, Xiaoquan
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [22] An integrated approach to inferring gene-disease associations in humans
    Radivojac, Predrag
    Peng, Kang
    Clark, Wyatt T.
    Peters, Brandon J.
    Mohan, Amrita
    Boyle, Sean M.
    Mooney, Sean D.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 72 (03) : 1030 - 1037
  • [23] Improving the identification of miRNA-disease associations with multi-task learning on gene-disease networks
    He, Qiang
    Qiao, Wei
    Fang, Hui
    Bao, Yang
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [24] Selection of SNPs for evaluating gene-disease associations using haplotypes
    Li, N
    Li, M
    GENETIC EPIDEMIOLOGY, 2005, 29 (03) : 263 - 263
  • [25] Multilocus Bayesian Meta-analysis of Gene-disease Associations
    Newcombe, P. J.
    Verzilli, C.
    Pablo-Casas, J.
    Hingorani, Aroon
    Smeeth, L.
    Whittaker, J.
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 828 - 828
  • [26] Investigations of Gene-Disease Associations: Costs and Benefits of Environmental Data
    Luo, Hao
    Burstyn, Igor
    Gustafson, Paul
    EPIDEMIOLOGY, 2013, 24 (04) : 562 - 568
  • [27] Selection bias in meta-analyses of gene-disease associations
    Tang, JL
    PLOS MEDICINE, 2005, 2 (12): : 1226 - 1227
  • [28] Multilocus Bayesian Meta-Analysis of Gene-Disease Associations
    Newcombe, Paul J.
    Verzilli, Claudio
    Casas, Juan P.
    Hingorani, Aroon D.
    Smeeth, Liam
    Whittaker, John C.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (05) : 567 - 580
  • [29] Graph Ensemble Networks for Semi-supervised Embedding Learning
    Tang, Hui
    Liang, Xun
    Wu, Bo
    Guan, Zhenyu
    Guo, Yuhui
    Zheng, Xiangping
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 408 - 420
  • [30] Graph Embedding-Based Ensemble Learning for Image Clustering
    Luo, Xiaohui
    Zhang, Li
    Li, Fanzhang
    Wang, Bangjun
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 213 - 218