RPIPCM: A deep network model for predicting lncRNA-protein interaction based on sequence feature encoding

被引:0
|
作者
Gong, Lejun [1 ]
Chen, Jingmei [1 ]
Cui, Xiong [1 ]
Liu, Yang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
基金
中国博士后科学基金;
关键词
lncRNA-protein interaction; Sequence; Feature encoding; Deep network; NCRNA; MECHANISMS; DATABASE;
D O I
10.1016/j.compbiomed.2023.107366
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
LncRNA-protein interactionplays an important regulatory role in biological processes. In this paper, the proposed RPIPCM based on a novel deep network model uses the sequence feature encoding of both RNA and protein to predict lncRNA-protein interactions (LPIs). A negative sampling of sliding window method is proposed for solving the problem of unbalanced between positive and negative samples. The proposed negative sampling method is effective and helpful to solve the problem of data imbalance in the existing LPIs research by comparative experiments. Experimental results also show that the proposed sequence feature encoding method has good performance in predicting LPIs for different datasets of different sizes and types. In the RPI488 dataset related to animal, compared with the direct original sequence encoding model, the accuracy of sequence feature encoding model increased by 1.02%, the recall increased by 4.08%, and the value of MCC increased by 1.67%. In the case of the plant dataset ATH948, the sequence feature-based encoding demonstrated a 1.58% higher accuracy, a 1.53% higher recall, a 1.62% higher specificity, a 1.62% higher precision, and a 3.16% higher value of MCC compared to the direct original sequence-based encoding. Compared with the latest prediction work in the ZEA22133 dataset, RPIPCM is shown to be more effective with the accuracy increased by 2.23%, the recall increased by 1.78%, the specificity increased by 2.67%, the precision increased by 2.52%, and the value of MCC increased by 4.43%, which also proves the effectiveness and robustness of RPIPCM. In conclusion, RPIPCM of deep network model based on sequence feature encoding can automatically mine the hidden feature information of the sequence in the lncRNA-protein interaction without relying on external features or prior biomedical knowledge, and its low cost and high efficiency can provide a reference for biomedical researchers.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Extracting LncRNA-protein Interactions from Literature Using a Text Feature-based Approach
    Zang, Qiguang
    Sun, Dongdong
    Feng, Huanqing
    Li, Ao
    IFAC PAPERSONLINE, 2015, 48 (28): : 22 - 26
  • [32] Predicting lncRNA-protein interactions through deep learning framework employing multiple features and random forest algorithm
    Liang, Ying
    Yin, XingRui
    Zhang, YangSen
    Guo, You
    Wang, YingLong
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [33] Finding lncRNA-Protein Interactions Based on Deep Learning With Dual-Net Neural Architecture
    Peng, Lihong
    Wang, Chang
    Tian, Xiongfei
    Zhou, Liqian
    Li, Keqin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3456 - 3468
  • [34] Capsule-LPI: a LncRNA–protein interaction predicting tool based on a capsule network
    Ying Li
    Hang Sun
    Shiyao Feng
    Qi Zhang
    Siyu Han
    Wei Du
    BMC Bioinformatics, 22
  • [35] LPI-deepGBDT: a multiple-layer deep framework based on gradient boosting decision trees for lncRNA-protein interaction identification
    Zhou, Liqian
    Wang, Zhao
    Tian, Xiongfei
    Peng, Lihong
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [36] LPI-FKLGCN: Predicting LncRNA-Protein Interactions Through Fast Kernel Learning and Graph Convolutional Network
    Li, Wen
    Wang, Shulin
    Guo, Hu
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 227 - 238
  • [37] LPI-IBWA: Predicting lncRNA-protein interactions based on an improved Bi-Random walk
    Xie, Minzhu
    Xie, Ruijie
    Wang, Hao
    METHODS, 2023, 220 : 98 - 105
  • [38] EnANNDeep: An Ensemble-based lncRNA-protein Interaction Prediction Framework with Adaptive k-Nearest Neighbor Classifier and Deep Models
    Peng, Lihong
    Tan, Jingwei
    Tian, Xiongfei
    Zhou, Liqian
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2022, 14 (01) : 209 - 232
  • [39] Multi-feature Fusion Method Based on Linear Neighborhood Propagation Predict Plant LncRNA-Protein Interactions
    Jia, Lijuan
    Luan, Yushi
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2022, 14 (02) : 545 - 554
  • [40] HNSPPI: a hybrid computational model combing network and sequence information for predicting protein-protein interaction
    Xie, Shijie
    Xie, Xiaojun
    Zhao, Xin
    Liu, Fei
    Wang, Yiming
    Ping, Jihui
    Ji, Zhiwei
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)