Identification of plant vacuole proteins by exploiting deep representation learning features

被引:7
|
作者
Jiao, Shihu [1 ]
Zou, Quan [1 ,2 ,3 ]
机构
[1] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Quzhou, Quzhou, Peoples R China
[2] Northeast Forestry Univ, State Key Lab Tree Genet & Breeding, Harbin, Peoples R China
[3] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Peoples R China
关键词
Vacuole proteins; Machine learning; Deep representation learning; Feature selection; Light gradient boosting machine; TRANSPORTERS; BIOGENESIS;
D O I
10.1016/j.csbj.2022.06.002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Plant vacuoles are the most important organelles for plant growth, development, and defense, and they play an important role in many types of stress responses. An important function of vacuole proteins is the transport of various classes of amino acids, ions, sugars, and other molecules. Accurate identification of vacuole proteins is crucial for revealing their biological functions. Several automatic and rapid computational tools have been proposed for the subcellular localization of proteins. Regrettably, they are not specific for the identification of plant vacuole proteins. To the best of our knowledge, there is only one computational software specifically trained for plant vacuolar proteins. Although its accuracy is acceptable, the prediction performance and stability of this method in practical applications can still be improved. Hence, in this study, a new predictor named iPVP-DRLF was developed to identify plant vacuole proteins specifically and effectively. This prediction software is designed using the light gradient boosting machine (LGBM) algorithm and hybrid features composed of classic sequence features and deep representation learning features. iPVP-DRLF achieved fivefold cross-validation and independent test accuracy values of 88.25 % and 87.16 %, respectively, both outperforming previous state-of-the-art predictors. Moreover, the blind dataset test results also showed that the performance of iPVP-DRLF was significantly better than the existing tools. The results of comparative experiments confirmed that deep representation learning features have an advantage over other classic sequence features in the identification of plant vacuole proteins. We believe that iPVP-DRLF would serve as an effective computational technique for plant vacuole protein prediction and facilitate related future research. The online server is freely accessible at https://lab.malab.cn/~acy/iPVP-DRLF. In addition, the source code and datasets are also accessible at https://github.com/jiaoshihu/iPVP-DRLF. (c) 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:2921 / 2927
页数:7
相关论文
共 50 条
  • [1] Deep Learning Based Radio Frequency Fingerprint Identification by Exploiting Spatial Stereoscopic Features
    Zhang, Shunliang
    Li, Jing
    Guo, Xiaolei
    [J]. MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES, MOBIQUITOUS 2023, PT II, 2024, 594 : 511 - 519
  • [2] PortPred: Exploiting deep learning embeddings of amino acid sequences for the identification of transporter proteins and their substrates
    Anteghini, Marco
    dos Santos, Vitor A. P. Martins
    Saccenti, Edoardo
    [J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 2023, 124 (11) : 1870 - 1885
  • [3] Identification of sub-Golgi protein localization by use of deep representation learning features
    Lv, Zhibin
    Wang, Pingping
    Zou, Quan
    Jiang, Qinghua
    [J]. BIOINFORMATICS, 2021, 36 (24) : 5600 - 5609
  • [4] Identification of sub-Golgi protein localization by use of deep representation learning features
    Lv, Zhibin
    Wang, Pingping
    Zou, Quan
    Jiang, Qinghua
    [J]. BIOINFORMATICS, 2020, 36 (24) : 5600 - 5609
  • [5] Deep learning on chaos game representation for proteins
    Loechel, Hannah F.
    Eger, Dominic
    Sperlea, Theodor
    Heider, Dominik
    [J]. BIOINFORMATICS, 2020, 36 (01) : 272 - 279
  • [6] PLANT VACUOLE DEGRADES EXOGENOUS PROTEINS
    MORIYASU, Y
    TAZAWA, M
    [J]. PROTOPLASMA, 1986, 130 (2-3) : 214 - 215
  • [7] Targeting defence proteins to the plant vacuole
    Anderson, MA
    Miller, EA
    Chen, L
    Johnson, ED
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2000, 11 : 526A - 526A
  • [8] Identification of plant vacuole proteins by using graph neural network and contact maps
    Sui, Jianan
    Chen, Jiazi
    Chen, Yuehui
    Iwamori, Naoki
    Sun, Jin
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [9] Identification of plant vacuole proteins by using graph neural network and contact maps
    Jianan Sui
    Jiazi Chen
    Yuehui Chen
    Naoki Iwamori
    Jin Sun
    [J]. BMC Bioinformatics, 24
  • [10] Deep Contextual Representation Learning for Identifying Essential Proteins via Integrating Multisource Protein Features
    Li Weihua
    Liu Wenyang
    Guo Yanbu
    Wang Bingyi
    Qing Hua
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (04) : 868 - 881