SAWRPI: A Stacking Ensemble Framework With Adaptive Weight for Predicting ncRNA-Protein Interactions Using Sequence Information

被引:4
|
作者
Ren, Zhong-Hao [1 ]
Yu, Chang-Qing [1 ]
Li, Li-Ping [1 ]
You, Zhu-Hong [2 ]
Guan, Yong-Jian [1 ]
Li, Yue-Chao [1 ]
Pan, Jie [1 ]
机构
[1] Xijing Univ, Sch Informat Engn, Xian, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
ncRNA-protein interactions; ncRNA; ensemble learning; sequence analysis; natural language processing; AMINO-ACID-COMPOSITION; LONG NONCODING RNAS; SPECIFICITIES; LNCRNAS;
D O I
10.3389/fgene.2022.839540
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Non-coding RNAs (ncRNAs) take essential effects on biological processes, like gene regulation. One critical way of ncRNA executing biological functions is interactions between ncRNA and RNA binding proteins (RBPs). Identifying proteins, involving ncRNA-protein interactions, can well understand the function ncRNA. Many high-throughput experiment have been applied to recognize the interactions. As a consequence of these approaches are time- and labor-consuming, currently, a great number of computational methods have been developed to improve and advance the ncRNA-protein interactions research. However, these methods may be not available to all RNAs and proteins, particularly processing new RNAs and proteins. Additionally, most of them cannot process well with long sequence. In this work, a computational method SAWRPI is proposed to make prediction of ncRNA-protein through sequence information. More specifically, the raw features of protein and ncRNA are firstly extracted through the k-mer sparse matrix with SVD reduction and learning nucleic acid symbols by natural language processing with local fusion strategy, respectively. Then, to classify easily, Hilbert Transformation is exploited to transform raw feature data to the new feature space. Finally, stacking ensemble strategy is adopted to learn high-level abstraction features automatically and generate final prediction results. To confirm the robustness and stability, three different datasets containing two kinds of interactions are utilized. In comparison with state-of-the-art methods and other results classifying or feature extracting strategies, SAWRPI achieved high performance on three datasets, containing two kinds of lncRNA-protein interactions. Upon our finding, SAWRPI is a trustworthy, robust, yet simple and can be used as a beneficial supplement to the task of predicting ncRNA-protein interactions.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] NPI-RGCNAE: Fast Predicting ncRNA-Protein Interactions Using the Relational Graph Convolutional Network Auto-Encoder
    Yu, Han
    Shen, Zi-Ang
    Du, Pu-Feng
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (04) : 1861 - 1871
  • [22] MHAM-NPI: Predicting ncRNA-protein interactions based on multi-head attention mechanism
    Zhou, Zhecheng
    Du, Zhenya
    Wei, Jinhang
    Zhuo, Linlin
    Pan, Shiyao
    Fu, Xiangzheng
    Lian, Xinze
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [23] HeadTailTransfer: An efficient sampling method to improve the performance of graph neural network method in predicting sparse ncRNA-protein interactions
    Wei, Jinhang
    Zhuo, Linlin
    Pan, Shiyao
    Lian, Xinze
    Yao, Xiaojun
    Fu, Xiangzheng
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 157
  • [24] Predicting RNA-Protein Interactions Using Only Sequence Information
    Usha K Muppirala
    Vasant G Honavar
    Drena Dobbs
    [J]. BMC Bioinformatics, 12
  • [25] Predicting RNA-Protein Interactions Using Only Sequence Information
    Muppirala, Usha K.
    Honavar, Vasant G.
    Dobbs, Drena
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [26] An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhang, Shanwen
    Huang, Yu-An
    Yu, Chang-Qing
    Li, Li-Ping
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 809 - 817
  • [27] NPI-DCGNN: An Accurate Tool for Identifying ncRNA-Protein Interactions Using a Dual-Channel Graph Neural Network
    Zhang, Xin
    Zhao, Liangwei
    Chai, Ziyi
    Wu, Hao
    Yang, Wei
    Li, Chen
    Jiang, Yu
    Liu, Quanzhong
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2024, 31 (08) : 742 - 756
  • [28] Predicting Protein-Protein Interactions Using Sequence and Network Information via Variational Graph Autoencoder
    Luo, Xin
    Wang, Liwei
    Hu, Pengwei
    Hu, Lun
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (05) : 3182 - 3194
  • [29] Predicting protein structure using only sequence information
    Karplus, K
    Barrett, C
    Cline, M
    Diekhans, M
    Grate, L
    Hughey, R
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1999, : 121 - 125
  • [30] Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
    Zhan, Xinke
    Xiao, Mang
    You, Zhuhong
    Yan, Chenggang
    Guo, Jianxin
    Wang, Liping
    Sun, Yaoqi
    Shang, Bingwan
    [J]. BIOLOGY-BASEL, 2022, 11 (07):