Predicting Human lncRNA-Disease Associations Based on Geometric Matrix Completion

被引:37
|
作者
Lu, Chengqian [1 ,2 ]
Yang, Mengyun [1 ,2 ]
Li, Min [1 ,2 ]
Li, Yaohang [3 ]
Wu, Fang-Xiang [4 ,5 ]
Wang, Jianxin [1 ,2 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[2] Cent South Univ, Hunan Prov Key Lab Bioinformat, Changsha 410083, Peoples R China
[3] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
[4] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK S7N 5A9, Canada
[5] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK S7N 5A9, Canada
基金
中国国家自然科学基金;
关键词
Long non-coding; disease; geometric; matrix completion; LONG NONCODING RNA; RENAL-CELL CARCINOMA; IDENTIFICATION; EXPRESSION; SIMILARITY; ONTOLOGY; CANCER;
D O I
10.1109/JBHI.2019.2958389
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, increasing evidences reveal that dysregulations of long non-coding RNAs (lncRNAs) are relevant to diverse diseases. However, the number of experimentally verified lncRNA-disease associations is limited. Prioritizing potential associations is beneficial not only for disease diagnosis, but also disease treatment, more important apprehending disease mechanisms at lncRNA level. Various computational methods have been proposed, but precise prediction and full use of data's intrinsic structure are still challenging. In this work, we design a new method, denominated GMCLDA (Geometric Matrix Completion lncRNA-Disease Association), to infer underlying associations based on geometric matrix completion. Utilizing association patterns among functionally similar lncRNAs and phenotypically similar diseases, GMCLCA makes use of the intrinsic structure embedded in the association matrix. Besides, limiting the scope of the predicted values gives rise to a certain sparsity in computation and enhances the robustness of GMCLDA. GMCLDA computes disease semantic similarity according to the Disease Ontology (DO) hierarchy and lncRNA Gaussian interaction profile kernel similarity according to known interaction profiles. Then, GMCLDA measures lncRNA sequence similarity using Needleman-Wunsch algorithm. For a new lncRNA, GMCLDA prefills interaction profile on account of its K-nearest neighbors defined by sequence similarity. Finally, GMCLDA estimates the missing entries of the association matrix based on geometric matrix completion model. Compared with state-of-the-art methods, GMCLDA can provide more accurate lncRNA-disease prediction. Further case studies prove that GMCLDA is able to correctly infer possible lncRNAs for renal cancer.
引用
收藏
页码:2420 / 2429
页数:10
相关论文
共 50 条
  • [1] Prediction of lncRNA-disease associations based on inductive matrix completion
    Lu, Chengqian
    Yang, Mengyun
    Luo, Feng
    Wu, Fang-Xiang
    Li, Min
    Pan, Yi
    Li, Yaohang
    Wang, Jianxin
    [J]. BIOINFORMATICS, 2018, 34 (19) : 3357 - 3364
  • [2] HOPMCLDA: predicting lncRNA-disease associations based on high-order proximity and matrix completion
    Xie, Guobo
    Zhu, Yinting
    Lin, Zhiyi
    Sun, Yuping
    Gu, Guosheng
    Wang, Weiming
    Chen, Hui
    [J]. MOLECULAR OMICS, 2021, 17 (05) : 760 - 768
  • [3] NFMCLDA: Predicting miRNA-based lncRNA-disease associations by network fusion and matrix completion
    Ma, Yibing
    Shi, Yongle
    Chen, Xiang
    Zhang, Bai
    Wu, Hanwen
    Gao, Jie
    [J]. Computers in Biology and Medicine, 2024, 174
  • [4] Inferring LncRNA-disease associations based on graph autoencoder matrix completion
    Wu, Ximin
    Lan, Wei
    Chen, Qingfeng
    Dong, Yi
    Liu, Jin
    Peng, Wei
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 87
  • [5] BRWMC: Predicting lncRNA-disease associations based on bi-random walk and matrix completion on disease and lncRNA networks
    Zhang, Guo-Zheng
    Gao, Ying-Lian
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2023, 103
  • [6] Weighted matrix factorization based data fusion for predicting lncRNA-disease associations
    Yu, Guoxian
    Wang, Yuehui
    Wang, Jun
    Fu, Guangyuan
    Guo, Maozu
    Domeniconi, Carlotta
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 572 - 577
  • [7] ALSBMF: Predicting lncRNA-Disease Associations by Alternating Least Squares Based on Matrix Factorization
    Zhu, Wen
    Huang, Kaimei
    Xiao, Xiaofang
    Liao, Bo
    Yao, Yuhua
    Wu, Fang-Xiang
    [J]. IEEE ACCESS, 2020, 8 : 26190 - 26198
  • [8] Geometric complement heterogeneous information and random forest for predicting lncRNA-disease associations
    Yao, Dengju
    Zhang, Tao
    Zhan, Xiaojuan
    Zhang, Shuli
    Zhan, Xiaorong
    Zhang, Chao
    [J]. FRONTIERS IN GENETICS, 2022, 13
  • [9] LncDisease: a sequence based bioinformatics tool for predicting lncRNA-disease associations
    Wang, Junyi
    Ma, Ruixia
    Ma, Wei
    Chen, Ji
    Yang, Jichun
    Xi, Yaguang
    Cui, Qinghua
    [J]. NUCLEIC ACIDS RESEARCH, 2016, 44 (09)
  • [10] Prediction of lncRNA-disease associations based on matrix factorization and neural network
    Hu, Xiaocao
    Wu, Haoyang
    Liu, Yuxin
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2765 - 2770