A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs

被引:5
|
作者
Xiao, Yubin [1 ,3 ]
Xiao, Zheng [2 ]
Feng, Xiang [1 ]
Chen, Zhiping [1 ]
Kuang, Linai [3 ]
Wang, Lei [1 ,3 ]
机构
[1] Changsha Univ, Coll Comp Engn & Appl Math, Changsha 410001, Peoples R China
[2] Univ South China, Canc Res Inst, Hunan Prov Key Lab Tumor Cellular & Mol Pathol, Hengyang 421001, Hunan, Peoples R China
[3] Xiangtan Univ, Key Lab Hunan Prov Internet Things & Informat Sec, Xiangtan 411105, Peoples R China
基金
中国国家自然科学基金;
关键词
LncRNA-disease association prediction; Features; Random walk; Multiple linear regression; Artificial neural network; LONG NONCODING RNAS; FUNCTIONAL SIMILARITY; RANDOM-WALK; REGRESSION-ANALYSIS; COMPLEX DISEASES; NEURAL-NETWORK; DATABASE;
D O I
10.1186/s12859-020-03906-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Accumulating evidence has demonstrated that long non-coding RNAs (lncRNAs) are closely associated with human diseases, and it is useful for the diagnosis and treatment of diseases to get the relationships between lncRNAs and diseases. Due to the high costs and time complexity of traditional bio-experiments, in recent years, more and more computational methods have been proposed by researchers to infer potential lncRNA-disease associations. However, there exist all kinds of limitations in these state-of-the-art prediction methods as well. Results In this manuscript, a novel computational model named FVTLDA is proposed to infer potential lncRNA-disease associations. In FVTLDA, its major novelty lies in the integration of direct and indirect features related to lncRNA-disease associations such as the feature vectors of lncRNA-disease pairs and their corresponding association probability fractions, which guarantees that FVTLDA can be utilized to predict diseases without known related-lncRNAs and lncRNAs without known related-diseases. Moreover, FVTLDA neither relies solely on known lncRNA-disease nor requires any negative samples, which guarantee that it can infer potential lncRNA-disease associations more equitably and effectively than traditional state-of-the-art prediction methods. Additionally, to avoid the limitations of single model prediction techniques, we combine FVTLDA with the Multiple Linear Regression (MLR) and the Artificial Neural Network (ANN) for data analysis respectively. Simulation experiment results show that FVTLDA with MLR can achieve reliable AUCs of 0.8909, 0.8936 and 0.8970 in 5-Fold Cross Validation (fivefold CV), 10-Fold Cross Validation (tenfold CV) and Leave-One-Out Cross Validation (LOOCV), separately, while FVTLDA with ANN can achieve reliable AUCs of 0.8766, 0.8830 and 0.8807 in fivefold CV, tenfold CV, and LOOCV respectively. Furthermore, in case studies of gastric cancer, leukemia and lung cancer, experiment results show that there are 8, 8 and 8 out of top 10 candidate lncRNAs predicted by FVTLDA with MLR, and 8, 7 and 8 out of top 10 candidate lncRNAs predicted by FVTLDA with ANN, having been verified by recent literature. Comparing with the representative prediction model of KATZLDA, comparison results illustrate that FVTLDA with MLR and FVTLDA with ANN can achieve the average case study contrast scores of 0.8429 and 0.8515 respectively, which are both notably higher than the average case study contrast score of 0.6375 achieved by KATZLDA. Conclusion The simulation results show that FVTLDA has good prediction performance, which is a good supplement to future bioinformatics research.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs
    Yubin Xiao
    Zheng Xiao
    Xiang Feng
    Zhiping Chen
    Linai Kuang
    Lei Wang
    [J]. BMC Bioinformatics, 21
  • [2] A random forest based computational model for predicting novel lncRNA-disease associations
    Dengju Yao
    Xiaojuan Zhan
    Xiaorong Zhan
    Chee Keong Kwoh
    Peng Li
    Jinke Wang
    [J]. BMC Bioinformatics, 21
  • [3] A random forest based computational model for predicting novel lncRNA-disease associations
    Yao, Dengju
    Zhan, Xiaojuan
    Zhan, Xiaorong
    Kwoh, Chee Keong
    Li, Peng
    Wang, Jinke
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [4] LLCLPLDA: a novel model for predicting lncRNA-disease associations
    Xie, Guobo
    Huang, Shuhuang
    Luo, Yu
    Ma, Lei
    Lin, Zhiyi
    Sun, Yuping
    [J]. MOLECULAR GENETICS AND GENOMICS, 2019, 294 (06) : 1477 - 1486
  • [5] A Novel Model for Predicting LncRNA-disease Associations Based on the LncRNA-MiRNA-disease Interactive Network
    Wang, Lei
    Xuan, Zhanwei
    Zhou, Shunxian
    Kuang, Linai
    Pei, Tingrui
    [J]. CURRENT BIOINFORMATICS, 2019, 14 (03) : 269 - 278
  • [6] HEGANLDA: A Computational Model for Predicting Potential Lncrna-Disease Associations Based On Multiple Heterogeneous Networks
    Li, Jianwei
    Wang, Duanyang
    Yang, Zhenwu
    Liu, Ming
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 388 - 398
  • [7] IDLDA: An Improved Diffusion Model for Predicting LncRNA-Disease Associations
    Wang, Qi
    Yan, Guiying
    [J]. FRONTIERS IN GENETICS, 2019, 10
  • [8] A Novel Method for LncRNA-Disease Association Prediction Based on an lncRNA-Disease Association Network
    Ping, Pengyao
    Wang, Lei
    Kuang, Linai
    Ye, Songtao
    Iqbal, Muhammad Faisal Buland
    Pei, Tingrui
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (02) : 688 - 693
  • [9] WLDAP: A computational model of weighted lncRNA-disease associations prediction
    Xie, Guobo
    Wu, Lifeng
    Lin, Zhiyi
    Cui, Ji
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 558
  • [10] Neighborhood based computational approaches for the prediction of lncRNA-disease associations
    Bonomo, Mariella
    Rombo, Simona E.
    [J]. BMC BIOINFORMATICS, 2024, 25 (01):