EGBMMDA: Extreme Gradient Boosting Machine for MiRNA-Disease Association prediction

被引:218
|
作者
Chen, Xing [1 ]
Huang, Li [2 ]
Xie, Di [3 ]
Zhao, Qi [4 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
[2] Natl Univ Singapore, Business Analyt Ctr, Singapore 119613, Singapore
[3] Liaoning Univ, Sch Math, Shenyang 110036, Liaoning, Peoples R China
[4] Res Ctr Comp Simulating & Informat Proc Biomacrom, Shenyang 110036, Liaoning, Peoples R China
来源
CELL DEATH & DISEASE | 2018年 / 9卷
基金
中国国家自然科学基金;
关键词
HUMAN MICRORNA; TUMOR-SUPPRESSOR; EXPRESSION; CANCER; IDENTIFICATION; RECEPTOR; GENES; LYMPHOMA; DATABASE; TARGETS;
D O I
10.1038/s41419-017-0003-x
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Associations between microRNAs (miRNAs) and human diseases have been identified by increasing studies and discovering new ones is an ongoing process in medical laboratories. To improve experiment productivity, researchers computationally infer potential associations from biological data, selecting the most promising candidates for experimental verification. Predicting potential miRNA-disease association has become a research area of growing importance. This paper presents a model of Extreme Gradient Boosting Machine for MiRNA-Disease Association (EGBMMDA) prediction by integrating the miRNA functional similarity, the disease semantic similarity, and known miRNA-disease associations. The statistical measures, graph theoretical measures, and matrix factorization results for each miRNA-disease pair were calculated and used to form an informative feature vector. The vector for known associated pairs obtained from the HMDD v2.0 database was used to train a regression tree under the gradient boosting framework. EGBMMDA was the first decision tree learning-based model used for predicting miRNA-disease associations. Respectively, AUCs of 0.9123 and 0.8221 in global and local leave-one-out cross-validation proved the model's reliable performance. Moreover, the 0.9048 +/- 0.0012 AUC in fivefold cross-validation confirmed its stability. We carried out three different types of case studies of predicting potential miRNAs related to Colon Neoplasms, Lymphoma, Prostate Neoplasms, Breast Neoplasms, and Esophageal Neoplasms. The results indicated that, respectively, 98%, 90%, 98%, 100%, and 98% of the top 50 predictions for the five diseases were confirmed by experiments. Therefore, EGBMMDA appears to be a useful computational resource for miRNA-disease association prediction.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] EGBMMDA: Extreme Gradient Boosting Machine for MiRNA-Disease Association prediction
    Xing Chen
    Li Huang
    Di Xie
    Qi Zhao
    Cell Death & Disease, 9
  • [2] Review of MiRNA-Disease Association Prediction
    Jiang, Lei
    Zhu, Ji
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2020, 21 (11) : 1044 - 1053
  • [3] miRNA-Disease Association Prediction with Collaborative Matrix Factorization
    Shen, Zhen
    Zhang, You-Hua
    Han, Kyungsook
    Nandi, Asoke K.
    Honig, Barry
    Huang, De-Shuang
    COMPLEXITY, 2017,
  • [4] HAMDA: Hybrid Approach for MiRNA-Disease Association prediction
    Chen, Xing
    Niu, Ya-Wei
    Wang, Guang-Hui
    Yan, Gui-Ying
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 76 : 50 - 58
  • [5] A Novel Computational Method for MiRNA-Disease Association Prediction
    Jiang, Zhi-Chao
    Shen, Zhen
    Bao, Wenzheng
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 539 - 547
  • [6] NMCMDA: neural multicategory MiRNA-disease association prediction
    Wang, Jingru
    Li, Jin
    Yue, Kun
    Wang, Li
    Ma, Yuyun
    Li, Qing
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [7] MCMDA: Matrix completion for MiRNA-disease association prediction
    Li, Jian-Qiang
    Rong, Zhi-Hao
    Chen, Xing
    Yan, Gui-Ying
    You, Zhu-Hong
    ONCOTARGET, 2017, 8 (13) : 21187 - 21199
  • [8] GRMDA: Graph Regression for MiRNA-Disease Association Prediction
    Chen, Xing
    Yang, Jing-Ru
    Guan, Na-Na
    Li, Jian-Qiang
    FRONTIERS IN PHYSIOLOGY, 2018, 9
  • [9] Combined embedding model for MiRNA-disease association prediction
    Liu, Bailong
    Zhu, Xiaoyan
    Zhang, Lei
    Liang, Zhizheng
    Li, Zhengwei
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [10] Combined embedding model for MiRNA-disease association prediction
    Bailong Liu
    Xiaoyan Zhu
    Lei Zhang
    Zhizheng Liang
    Zhengwei Li
    BMC Bioinformatics, 22