RPIPCM: A deep network model for predicting lncRNA-protein interaction based on sequence feature encoding

被引:0
|
作者
Gong, Lejun [1 ]
Chen, Jingmei [1 ]
Cui, Xiong [1 ]
Liu, Yang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
基金
中国博士后科学基金;
关键词
lncRNA-protein interaction; Sequence; Feature encoding; Deep network; NCRNA; MECHANISMS; DATABASE;
D O I
10.1016/j.compbiomed.2023.107366
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
LncRNA-protein interactionplays an important regulatory role in biological processes. In this paper, the proposed RPIPCM based on a novel deep network model uses the sequence feature encoding of both RNA and protein to predict lncRNA-protein interactions (LPIs). A negative sampling of sliding window method is proposed for solving the problem of unbalanced between positive and negative samples. The proposed negative sampling method is effective and helpful to solve the problem of data imbalance in the existing LPIs research by comparative experiments. Experimental results also show that the proposed sequence feature encoding method has good performance in predicting LPIs for different datasets of different sizes and types. In the RPI488 dataset related to animal, compared with the direct original sequence encoding model, the accuracy of sequence feature encoding model increased by 1.02%, the recall increased by 4.08%, and the value of MCC increased by 1.67%. In the case of the plant dataset ATH948, the sequence feature-based encoding demonstrated a 1.58% higher accuracy, a 1.53% higher recall, a 1.62% higher specificity, a 1.62% higher precision, and a 3.16% higher value of MCC compared to the direct original sequence-based encoding. Compared with the latest prediction work in the ZEA22133 dataset, RPIPCM is shown to be more effective with the accuracy increased by 2.23%, the recall increased by 1.78%, the specificity increased by 2.67%, the precision increased by 2.52%, and the value of MCC increased by 4.43%, which also proves the effectiveness and robustness of RPIPCM. In conclusion, RPIPCM of deep network model based on sequence feature encoding can automatically mine the hidden feature information of the sequence in the lncRNA-protein interaction without relying on external features or prior biomedical knowledge, and its low cost and high efficiency can provide a reference for biomedical researchers.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] A novel lncRNA-protein interaction prediction method based on deep forest with cascade forest structure
    Tian, Xiongfei
    Shen, Ling
    Wang, Zhenwu
    Zhou, Liqian
    Peng, Lihong
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [22] LPI-CSFFR: Combining serial fusion with feature reuse for predicting LncRNA-protein interactions
    Huang, Xiaoqian
    Shi, Yi
    Yan, Jing
    Qu, Wenyan
    Li, Xiaoyi
    Tan, Jianjun
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 99
  • [23] Predicting lncRNA-protein interactions with bipartite graph embedding and deep graph neural networks
    Ma, Yuzhou
    Zhang, Han
    Jin, Chen
    Kang, Chuanze
    FRONTIERS IN GENETICS, 2023, 14
  • [24] A Comparison Study of Predicting lncRNA-Protein Interactions via Representative Network Embedding Methods
    Zhao, Guoqing
    Li, Pengpai
    Liu, Zhi-Ping
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 3 - 13
  • [25] A text feature-based approach for literature mining of lncRNA-protein interactions
    Li, Ao
    Zang, Qiguang
    Sun, Dongdong
    Wang, Minghui
    NEUROCOMPUTING, 2016, 206 : 73 - 80
  • [26] RLF-LPI: An ensemble learning framework using sequence information for predicting lncRNA-protein interaction based on AE-ResLSTM and fuzzy decision
    Song, Jinmiao
    Tian, Shengwei
    Yu, Long
    Yang, Qimeng
    Dai, Qiguo
    Wang, Yuanxu
    Wu, Weidong
    Duan, Xiaodong
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (05) : 4749 - 4764
  • [27] Cross-domain contrastive graph neural network for lncRNA-protein interaction prediction
    Li, Hui
    Wu, Bin
    Sun, Miaomiao
    Zhu, Zhenfeng
    Chen, Kuisheng
    Ge, Hong
    KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [28] LPI-HyADBS: a hybrid framework for lncRNA-protein interaction prediction integrating feature selection and classification
    Zhou, Liqian
    Duan, Qi
    Tian, Xiongfei
    Xu, He
    Tang, Jianxin
    Peng, Lihong
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [29] LPI-HyADBS: a hybrid framework for lncRNA-protein interaction prediction integrating feature selection and classification
    Liqian Zhou
    Qi Duan
    Xiongfei Tian
    He Xu
    Jianxin Tang
    Lihong Peng
    BMC Bioinformatics, 22
  • [30] IEssLnc: quantitativeestimation of lncRNA gene essentialities with meta- path-guided random walks on the lncRNA-protein interaction network
    Zhang, Ying-Yin
    Liang, De-Min
    Du, Pu-Feng
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)