MRSLpred-a hybrid approach for predicting multi-label subcellular localization of mRNA at the genome scale

被引:2
|
作者
Choudhury, Shubham [1 ]
Bajiya, Nisha [1 ]
Patiyal, Sumeet [1 ]
Raghava, Gajendra P. S. [1 ]
机构
[1] Indraprastha Inst Informat Technol, Dept Computat Biol, New Delhi, India
来源
FRONTIERS IN BIOINFORMATICS | 2024年 / 4卷
关键词
subcellular localization; multi-label; motif search; messenger RNA; machine learning; GLOBAL ANALYSIS; RNALOCATE; TRANSPORT; RESOURCE; REVEALS;
D O I
10.3389/fbinf.2024.1341479
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the past, several methods have been developed for predicting the single-label subcellular localization of messenger RNA (mRNA). However, only limited methods are designed to predict the multi-label subcellular localization of mRNA. Furthermore, the existing methods are slow and cannot be implemented at a transcriptome scale. In this study, a fast and reliable method has been developed for predicting the multi-label subcellular localization of mRNA that can be implemented at a genome scale. Machine learning-based methods have been developed using mRNA sequence composition, where the XGBoost-based classifier achieved an average area under the receiver operator characteristic (AUROC) of 0.709 (0.668-0.732). In addition to alignment-free methods, we developed alignment-based methods using motif search techniques. Finally, a hybrid technique that combines the XGBoost model and the motif-based approach has been developed, achieving an average AUROC of 0.742 (0.708-0.816). Our method-MRSLpred-outperforms the existing state-of-the-art classifier in terms of performance and computation efficiency. A publicly accessible webserver and a standalone tool have been developed to facilitate researchers (webserver: https://webs.iiitd.edu.in/raghava/mrslpred/).
引用
收藏
页数:11
相关论文
共 50 条
  • [21] ENSEMBLE RANDOM PROJECTION FOR MULTI-LABEL CLASSIFICATION WITH APPLICATION TO PROTEIN SUBCELLULAR LOCALIZATION
    Wan, Shibiao
    Mak, Man-Wai
    Zhang, Bai
    Wang, Yue
    Kung, Sun-Yuan
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [22] Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
    Xiaotong Guo
    Fulin Liu
    Ying Ju
    Zhen Wang
    Chunyu Wang
    Scientific Reports, 6
  • [23] Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier
    Guo, Xiaotong
    Liu, Fulin
    Ju, Ying
    Wang, Zhen
    Wang, Chunyu
    SCIENTIFIC REPORTS, 2016, 6
  • [24] iLoc-Plant: a multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites
    Wu, Zhi-Cheng
    Xiao, Xuan
    Chou, Kuo-Chen
    MOLECULAR BIOSYSTEMS, 2011, 7 (12) : 3287 - 3297
  • [25] Predicting subcellular localization of multisite proteins using differently weighted multi-label k-nearest neighbors sets
    Jiang, Zhongting
    Wang, Dong
    Wu, Peng
    Chen, Yuehui
    Shang, Huijie
    Wang, Luyao
    Xie, Huichun
    TECHNOLOGY AND HEALTH CARE, 2019, 27 : S185 - S193
  • [26] A Multi-Label Classifier for Predicting the Subcellular Localization of Gram-Negative Bacterial Proteins with Both Single and Multiple Sites
    Xiao, Xuan
    Wu, Zhi-Cheng
    Chou, Kuo-Chen
    PLOS ONE, 2011, 6 (06):
  • [27] Predicting the multi-label protein subcellular localization through multi-information fusion and MLSI dimensionality reduction based on MLFE classifier
    Liu, Yushuang
    Jin, Shuping
    Gao, Hongli
    Wang, Xue
    Wang, Congjing
    Zhou, Weifeng
    Yu, Bin
    BIOINFORMATICS, 2022, 38 (05) : 1223 - 1230
  • [28] DeepLoc 2.0: multi-label subcellular localization prediction using protein language models
    Thumuluri, Vineet
    Armenteros, Jose Juan Almagro
    Johansen, Alexander Rosenberg
    Nielsen, Henrik
    Winther, Ole
    NUCLEIC ACIDS RESEARCH, 2022, 50 (W1) : W228 - W234
  • [29] Semi-supervised Predictive Clustering Trees for Multi-label Protein Subcellular Localization
    Alcantara, Leonardo U.
    Triguero, Isaac
    Cerri, Ricardo
    INTELLIGENT SYSTEMS, BRACIS 2024, PT II, 2025, 15413 : 384 - 399
  • [30] ADAPTIVE THRESHOLDING FOR MULTI-LABEL SVM CLASSIFICATION WITH APPLICATION TO PROTEIN SUBCELLULAR LOCALIZATION PREDICTION
    Wan, Shibiao
    Mak, Man-Wai
    Kung, Sun-Yuan
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3547 - 3551