Unified mRNA Subcellular Localization Predictor based on machine learning techniques

被引:2
|
作者
Musleh, Saleh [1 ]
Arif, Muhammad [1 ]
Alajez, Nehad M. [2 ,3 ]
Alam, Tanvir [1 ]
机构
[1] Hamad Bin Khalifa Univ, Coll Sci & Engn, Doha, Qatar
[2] Hamad Bin Khalifa Univ, Qatar Biomed Res Inst QBRI, Translat Canc & Immun Ctr TCIC, Doha, Qatar
[3] Hamad Bin Khalifa Univ, Coll Hlth & Life Sci, Doha, Qatar
来源
BMC GENOMICS | 2024年 / 25卷 / 01期
关键词
Multiclass classification; mRNA; Subcellular Localization; Machine learning; SEQUENCES; PROTEINS;
D O I
10.1186/s12864-024-10077-9
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundThe mRNA subcellular localization bears substantial impact in the regulation of gene expression, cellular migration, and adaptation. However, the methods employed for experimental determination of this localization are arduous, time-intensive, and come with a high cost.MethodsIn this research article, we tackle the essential challenge of predicting the subcellular location of messenger RNAs (mRNAs) through Unified mRNA Subcellular Localization Predictor (UMSLP), a machine learning (ML) based approach. We embrace an in silico strategy that incorporate four distinct feature sets: kmer, pseudo k-tuple nucleotide composition, nucleotide physicochemical attributes, and the 3D sequence depiction achieved via Z-curve transformation for predicting subcellular localization in benchmark dataset across five distinct subcellular locales, encompassing nucleus, cytoplasm, extracellular region (ExR), mitochondria, and endoplasmic reticulum (ER).ResultsThe proposed ML model UMSLP attains cutting-edge outcomes in predicting mRNA subcellular localization. On independent testing dataset, UMSLP ahcieved over 87% precision, 94% specificity, and 94% accuracy. Compared to other existing tools, UMSLP outperformed mRNALocator, mRNALoc, and SubLocEP by 11%, 21%, and 32%, respectively on average prediction accuracy for all five locales. SHapley Additive exPlanations analysis highlights the dominance of k-mer features in predicting cytoplasm, nucleus, ER, and ExR localizations, while Z-curve based features play pivotal roles in mitochondria subcellular localization detection.AvailabilityWe have shared datasets, code, Docker API for users in GitHub at: https://github.com/smusleh/UMSLP.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] MSLP: mRNA subcellular localization predictor based on machine learning techniques
    Musleh, Saleh
    Islam, Mohammad Tariqul
    Qureshi, Rizwan
    Alajez, Nihad
    Alam, Tanvir
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [2] MSLP: mRNA subcellular localization predictor based on machine learning techniques
    Saleh Musleh
    Mohammad Tariqul Islam
    Rizwan Qureshi
    Nehad M. Alajez
    Tanvir Alam
    [J]. BMC Bioinformatics, 24
  • [3] Correction: MSLP: mRNA subcellular localization predictor based on machine learning techniques
    Saleh Musleh
    Mohammad Tariqul Islam
    Rizwan Qureshi
    Nehad M. Alajez
    Tanvir Alam
    [J]. BMC Bioinformatics, 24
  • [4] MSLP: mRNA subcellular localization predictor based on machine learning techniques (vol 24, 109, 2023)
    Musleh, Saleh
    Islam, Mohammad Tariqul
    Qureshi, Rizwan
    Alajez, Nehad M. M.
    Alam, Tanvir
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [5] SubLocEP: a novel ensemble predictor of subcellular localization of eukaryotic mRNA based on machine learning
    Li, Jing
    Zhang, Lichao
    He, Shida
    Guo, Fei
    Zou, Quan
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [6] DeepmRNALoc: A Novel Predictor of Eukaryotic mRNA Subcellular Localization Based on Deep Learning
    Wang, Shihang
    Shen, Zhehan
    Liu, Taigang
    Long, Wei
    Jiang, Linhua
    Peng, Sihua
    [J]. MOLECULES, 2023, 28 (05):
  • [7] mRNALoc: a novel machine-learning based in-silico tool to predict mRNA subcellular localization
    Garg, Anjali
    Singhal, Neelja
    Kumar, Ravindra
    Kumar, Manish
    [J]. NUCLEIC ACIDS RESEARCH, 2020, 48 (W1) : W239 - W243
  • [8] Prediction of subcellular localization of proteins using machine learning techniques and evolutionary information
    Raghava, G. P. S.
    [J]. AMINO ACIDS, 2007, 33 (03) : X - XI
  • [9] Extreme Learning Machine Based Bacterial Protein Subcellular Localization Prediction
    Lan, Yuan
    Soh, Yeng Chai
    Huang, Guang-Bin
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1859 - 1863
  • [10] Prediction of Protein Subcellular Localization using Machine Learning
    Upama, Paramita Basak
    Akhter, Shahin
    Bin Asad, Mohammad Imam Hasan
    [J]. 2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,