MSLP: mRNA subcellular localization predictor based on machine learning techniques

被引:9
|
作者
Musleh, Saleh [1 ]
Islam, Mohammad Tariqul [2 ]
Qureshi, Rizwan [1 ]
Alajez, Nihad [3 ,4 ]
Alam, Tanvir [1 ]
机构
[1] Hamad Bin Khalifa Univ, Coll Sci & Engn, Doha, Qatar
[2] Southern Connecticut State Univ, Comp Sci Dept, New Haven, CT USA
[3] Hamad Bin Khalifa Univ, Qatar Biomed Res Inst QBRI, Translat Canc & Immun Ctr TC, Doha, Qatar
[4] Hamad Bin Khalifa Univ, Coll Hlth & Life Sci, Doha, Qatar
关键词
RNA; mRNA; Machine learning; Sequence analysis; Localization prediction; Subcellular localization; NERVOUS-SYSTEM; RNALOCATE; SEQUENCES; RESOURCE;
D O I
10.1186/s12859-023-05232-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Subcellular localization of messenger RNA (mRNAs) plays a pivotal role in the regulation of gene expression, cell migration as well as in cellular adaptation. Experiment techniques for pinpointing the subcellular localization of mRNAs are laborious, time-consuming and expensive. Therefore, in silico approaches for this purpose are attaining great attention in the RNA community. Methods: In this article, we propose MSLP, a machine learning-based method to predict the subcellular localization of mRNA. We propose a novel combination of four types of features representing k-mer, pseudo k-tuple nucleotide composition (PseKNC), physicochemical properties of nucleotides, and 3D representation of sequences based on Z-curve transformation to feed into machine learning algorithm to predict the subcellular localization of mRNAs. Results: Considering the combination of the above-mentioned features, ennsemble-based models achieved state-of-the-art results in mRNA subcellular localization prediction tasks for multiple benchmark datasets. We evaluated the performance of our method in ten subcellular locations, covering cytoplasm, nucleus, endoplasmic reticulum (ER), extracellular region (ExR), mitochondria, cytosol, pseudopodium, posterior, exosome, and the ribosome. Ablation study highlighted k-mer and PseKNC to be more dominant than other features for predicting cytoplasm, nucleus, and ER localizations. On the other hand, physicochemical properties and Z-curve based features contributed the most to ExR and mitochondria detection. SHAP-based analysis revealed the relative importance of features to provide better insights into the proposed approach. Availability: We have implemented a Docker container and API for end users to run their sequences on our model. Datasets, the code of API and the Docker are shared for the community in GitHub at: https://github.com/smusleh/MSLP.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] DRpred: A Novel Deep Learning-Based Predictor for Multi-Label mRNA Subcellular Localization Prediction by Incorporating Bayesian Inferred Prior Label Relationships
    Wang, Xiao
    Yang, Lixiang
    Wang, Rong
    BIOMOLECULES, 2024, 14 (09)
  • [22] MSlocPRED: deep transfer learning-based identification of multi-label mRNA subcellular localization
    Zuo, Yun
    Zhang, Bangyi
    He, Wenying
    Bi, Yue
    Liu, Xiangrong
    Zeng, Xiangxiang
    Deng, Zhaohong
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [23] Combining Machine Learning and Homology-Based Approaches to Accurately Predict Subcellular Localization in Arabidopsis
    Kaundal, Rakesh
    Saini, Reena
    Zhao, Patrick X.
    PLANT PHYSIOLOGY, 2010, 154 (01) : 36 - 54
  • [24] Protein subcellular localization prediction using multiple kernel learning based support vector machine
    Hasan, Md. Al Mehedi
    Ahmad, Shamim
    Molla, Md. Khademul Islam
    MOLECULAR BIOSYSTEMS, 2017, 13 (04) : 785 - 795
  • [25] Subcellular localization of mRNA in neuronal cellsContributions of high-resolutionin situ Hybridization techniques
    Maryann E. Martone
    John A. Pollock
    Mark H. Ellisman
    Molecular Neurobiology, 1998, 18 : 227 - 246
  • [26] The identification and localization of speaker using fusion techniques and machine learning techniques
    Ali, Rasha H.
    Abdullah, Mohammed Najm
    Abed, Buthainah F.
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 133 - 149
  • [27] The identification and localization of speaker using fusion techniques and machine learning techniques
    Rasha H. Ali
    Mohammed Najm Abdullah
    Buthainah F. Abed
    Evolutionary Intelligence, 2024, 17 : 133 - 149
  • [28] Subcellular localization of expansin mRNA in xylem cells
    Im, KH
    Cosgrove, DJ
    Jones, AM
    PLANT PHYSIOLOGY, 2000, 123 (02) : 463 - 470
  • [29] Subcellular localization of mRNA in neuronal cells -: Contributions of high-resolution in situ hybridization techniques
    Martone, ME
    Pollock, JA
    Ellisman, MH
    MOLECULAR NEUROBIOLOGY, 1998, 18 (03) : 227 - 246
  • [30] Comparison of Machine Learning Techniques Based Brain Source Localization Using EEG Signals
    Jatoi, Munsif Ali
    Dharejo, Fayaz Ali
    Teevino, Sadam Hussain
    CURRENT MEDICAL IMAGING, 2021, 17 (01) : 64 - 72