EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction

被引:24
|
作者
Wang, Jingjing [1 ]
Zhao, Yanpeng [1 ]
Gong, Weikang [1 ]
Liu, Yang [1 ]
Wang, Mei [1 ]
Huang, Xiaoqian [1 ]
Tan, Jianjun [1 ]
机构
[1] Beijing Univ Technol, Dept Biomed Engn, Fac Environm & Life, Beijing Int Sci & Technol Cooperat Base Intellige, Beijing 100124, Peoples R China
基金
北京市自然科学基金;
关键词
ncRNA-protein interactions; Multi-scale features combination; Conjoint k-mer; Ensemble deep learning; Independent test; ncRNA-protein networks; LONG NONCODING RNAS; SECONDARY STRUCTURE; BINDING PROTEINS; NEURAL-NETWORKS; ACCURATE PREDICTION; GENE-EXPRESSION; WEB SERVER; SEQUENCE; DNA; IDENTIFICATION;
D O I
10.1186/s12859-021-04069-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Non-coding RNA (ncRNA) and protein interactions play essential roles in various physiological and pathological processes. The experimental methods used for predicting ncRNA-protein interactions are time-consuming and labor-intensive. Therefore, there is an increasing demand for computational methods to accurately and efficiently predict ncRNA-protein interactions. Results: In this work, we presented an ensemble deep learning-based method, EDLMFC, to predict ncRNA-protein interactions using the combination of multi-scale features, including primary sequence features, secondary structure sequence features, and tertiary structure features. Conjoint k-mer was used to extract protein/ncRNA sequence features, integrating tertiary structure features, then fed into an ensemble deep learning model, which combined convolutional neural network (CNN) to learn dominating biological information with bi-directional long short-term memory network (BLSTM) to capture long-range dependencies among the features identified by the CNN. Compared with other state-of-the-art methods under five-fold cross-validation, EDLMFC shows the best performance with accuracy of 93.8%, 89.7%, and 86.1% on RPI1807, NPlnter v2.0, and RP1488 datasets, respectively. The results of the independent test demonstrated that EDLMFC can effectively predict potential ncRNA-protein interactions from different organisms. Furtherly, EDLMFC is also shown to predict hub ncRNAs and proteins presented in ncRNA-protein networks of Mus musculus successfully. Conclusions: In general, our proposed method EDLMFC improved the accuracy of ncRNA-protein interaction predictions and anticipated providing some helpful guidance on ncRNA functions research. The source code of EDLMFC and the datasets used in this work are available at https://github.com/JingjingWang-87/EDLMFC.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA–protein interaction prediction
    Jingjing Wang
    Yanpeng Zhao
    Weikang Gong
    Yang Liu
    Mei Wang
    Xiaoqian Huang
    Jianjun Tan
    [J]. BMC Bioinformatics, 22
  • [2] RPITER: A Hierarchical Deep Learning Framework for ncRNA-Protein Interaction Prediction
    Peng, Cheng
    Han, Siyu
    Zhang, Hui
    Li, Ying
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (05)
  • [3] RPI-EDLCN: An Ensemble Deep Learning Framework Based on Capsule Network for ncRNA-Protein Interaction Prediction
    Li, Xiaoyi
    Qu, Wenyan
    Yan, Jing
    Tan, Jianjun
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 64 (07) : 2221 - 2235
  • [4] A Stacked Ensemble Learning Framework with Heterogeneous Feature Combinations for Predicting ncRNA-Protein Interaction
    Dai, Qiguo
    Wang, Zhaowei
    Song, Jinmiao
    Duan, Xiaodong
    Guo, Maozu
    Tian, Zhen
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 67 - 71
  • [5] A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information
    Yi, Hai-Cheng
    You, Zhu-Hong
    Huang, De-Shuang
    Li, Xiao
    Jiang, Tong-Hai
    Li, Li-Ping
    [J]. MOLECULAR THERAPY-NUCLEIC ACIDS, 2018, 11 : 337 - 344
  • [6] RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information
    Yi, Hai-Cheng
    You, Zhu-Hong
    Wang, Mei-Neng
    Guo, Zhen-Hao
    Wang, Yan-Bin
    Zhou, Ji-Ren
    [J]. BMC BIOINFORMATICS, 2020, 21 (01)
  • [7] RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information
    Hai-Cheng Yi
    Zhu-Hong You
    Mei-Neng Wang
    Zhen-Hao Guo
    Yan-Bin Wang
    Ji-Ren Zhou
    [J]. BMC Bioinformatics, 21
  • [8] Efficient Framework for Predicting ncRNA-Protein Interactions Based on Sequence Information by Deep Learning
    Zhan, Zhao-Hui
    You, Zhu-Hong
    Zhou, Yong
    Li, Li-Ping
    Li, Zheng-Wei
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 337 - 344
  • [9] BGFE: A Deep Learning Model for ncRNA-Protein Interaction Predictions Based on Improved Sequence Information
    Zhan, Zhao-Hui
    Jia, Li-Na
    Zhou, Yong
    Li, Li-Ping
    Yi, Hai-Cheng
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (04)
  • [10] GATLGEMF: A graph attention model with line graph embedding multi-complex features for ncRNA-protein interactions prediction
    Yan, Jing
    Qu, Wenyan
    Li, Xiaoyi
    Wang, Ruobing
    Tan, Jianjun
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2024, 108