MRSLpred-a hybrid approach for predicting multi-label subcellular localization of mRNA at the genome scale

被引:2
|
作者
Choudhury, Shubham [1 ]
Bajiya, Nisha [1 ]
Patiyal, Sumeet [1 ]
Raghava, Gajendra P. S. [1 ]
机构
[1] Indraprastha Inst Informat Technol, Dept Computat Biol, New Delhi, India
来源
FRONTIERS IN BIOINFORMATICS | 2024年 / 4卷
关键词
subcellular localization; multi-label; motif search; messenger RNA; machine learning; GLOBAL ANALYSIS; RNALOCATE; TRANSPORT; RESOURCE; REVEALS;
D O I
10.3389/fbinf.2024.1341479
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the past, several methods have been developed for predicting the single-label subcellular localization of messenger RNA (mRNA). However, only limited methods are designed to predict the multi-label subcellular localization of mRNA. Furthermore, the existing methods are slow and cannot be implemented at a transcriptome scale. In this study, a fast and reliable method has been developed for predicting the multi-label subcellular localization of mRNA that can be implemented at a genome scale. Machine learning-based methods have been developed using mRNA sequence composition, where the XGBoost-based classifier achieved an average area under the receiver operator characteristic (AUROC) of 0.709 (0.668-0.732). In addition to alignment-free methods, we developed alignment-based methods using motif search techniques. Finally, a hybrid technique that combines the XGBoost model and the motif-based approach has been developed, achieving an average AUROC of 0.742 (0.708-0.816). Our method-MRSLpred-outperforms the existing state-of-the-art classifier in terms of performance and computation efficiency. A publicly accessible webserver and a standalone tool have been developed to facilitate researchers (webserver: https://webs.iiitd.edu.in/raghava/mrslpred/).
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Gm-PLoc: A Subcellular Localization Model of Multi-Label Protein Based on GAN and DeepFM
    Wu, Liwen
    Gao, Song
    Yao, Shaowen
    Wu, Feng
    Li, Jie
    Dong, Yunyun
    Zhang, Yunqi
    FRONTIERS IN GENETICS, 2022, 13
  • [42] mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines
    Shibiao Wan
    Man-Wai Mak
    Sun-Yuan Kung
    BMC Bioinformatics, 13
  • [43] Tissue-Specific Subcellular Localization Prediction Using Multi-Label Markov Random Fields
    Zhu, Lu
    Hofestaedt, Ralf
    Ester, Martin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (05) : 1471 - 1482
  • [44] Leakage detection and localization on water transportation pipelines: a multi-label classification approach
    Kayaalp, Fatih
    Zengin, Ahmet
    Kara, Resul
    Zavrak, Sultan
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (10): : 2905 - 2914
  • [45] A multi-core computing approach for large-scale multi-label classification
    Rodriguez, Juan Manuel
    Godoy, Daniela
    Mateos, Cristian
    Zunino, Alejandro
    INTELLIGENT DATA ANALYSIS, 2017, 21 (02) : 329 - 352
  • [46] Accurate prediction of multi-label protein subcellular localization through multi-view feature learning with RBRL classifier
    Zhang, Qi
    Zhang, Yandan
    Li, Shan
    Han, Yu
    Jin, Shuping
    Gu, Haiming
    Yu, Bin
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [47] A Divide-and-Conquer Approach for Large-scale Multi-label Learning
    Zhang, Wenjie
    Wang, Xiangfeng
    Yan, Junchi
    Zha, Hongyuan
    2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 398 - 401
  • [48] Extreme multi-label learning : A large scale classification approach in machine learning
    Prajapati, Purvi
    Thakkar, Amit
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (04): : 983 - 1001
  • [49] CFPLncLoc: A multi-label lncRNA subcellular localization prediction based on Chaos game representation and centralized feature pyramid
    Wang, Sheng
    Yu, Zu-Guo
    Han, Guo-Sheng
    Sun, Xin-Gen
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2025, 297
  • [50] Deep learning model for protein multi-label subcellular localization and function prediction based on multi-task collaborative training
    Bai, Peihao
    Li, Guanghui
    Luo, Jiawei
    Liang, Cheng
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)