Deep Forest-based Prediction of Protein Subcellular Localization

被引:12
|
作者
Zhao, Lingling [1 ]
Wang, Junjie [1 ]
Nabil, Mahieddine Mohammed [1 ]
Zhang, Jun [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, POB 320,92 West Dazhi St, Harbin, Heilongjiang, Peoples R China
[2] Heilongjiang Prov Land Reclamat Headquarters Gen, Dept Rehabil, Harbin, Heilongjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein subcellular location; Machine learning; Deep forest; Sequence information; UniProt; Algorithm's;
D O I
10.2174/1566523218666180913110949
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Motivation: Knowledge of the correct protein subcellular localization is necessary for understanding the function of a protein and revealing the mechanism of many human diseases due to protein subcellular mislocalization, which is required before approaching gene therapy to treat a disease. In addition, it is well-known that the gene therapy is an effective way to overcome disease by targeting a gene therapy product to a specific subcellular compartment. Deep neural networks to predict protein function have become increasingly popular due to large increases in the available genomics data due to its strong superiority in the non-linear classification ability. However, they still have some drawbacks such as too many hyper-parameters and sufficient amount of labeled data. Results: We present a deep forest-based protein location algorithm relying on sequence information. The prediction model uses a random forest network with a multi-layered structure to identify the subcellular regions of protein. The model was trained and tested on a latest UniProt releases protein dataset, and we demonstrate that our deep forest predict the subcellular location of proteins given only the protein sequence with high accuracy, outperforming the current state-of-art algorithms. Meanwhile, unlike the deep neural networks, it has a significantly smaller number of parameters and is much easier to train.
引用
收藏
页码:268 / 274
页数:7
相关论文
共 50 条
  • [1] Forest-based Deep Recommender
    Feng, Chao
    Lian, Defu
    Liu, Zheng
    Xie, Xing
    Wu, Le
    Chen, Enhong
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 523 - 532
  • [2] DeepLoc: prediction of protein subcellular localization using deep learning
    Armenteros, Jose Juan Almagro
    Sonderby, Casper Kaae
    Sonderby, Soren Kaae
    Nielsen, Henrik
    Winther, Ole
    [J]. BIOINFORMATICS, 2017, 33 (21) : 3387 - 3395
  • [3] A DEEP NEURAL NETWORK APPROACH FOR THE PREDICTION OF PROTEIN SUBCELLULAR LOCALIZATION
    Samson, A. B. P.
    Chandra, S. R. A.
    Manikant, M.
    [J]. NEURAL NETWORK WORLD, 2021, 31 (01) : 29 - 45
  • [4] Protein subcellular and secreted localization prediction using deep learning
    Zidoum, Hamza
    Magdy, Mennatollah
    [J]. PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [5] Prediction of human protein subcellular localization using deep learning
    Wei, Leyi
    Ding, Yijie
    Su, Ran
    Tang, Jijun
    Zou, Quan
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 117 : 212 - 217
  • [6] Prediction of protein subcellular localization
    Yu, Chin-Sheng
    Chen, Yu-Ching
    Lu, Chih-Hao
    Hwang, Jenn-Kang
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 64 (03) : 643 - 651
  • [7] Deep Embedding Forest: Forest-based Serving with Deep Embedding Features
    Zhu, Jie
    Shan, Ying
    Mao, J. C.
    Yu, Dong
    Rahmanian, Holakou
    Zhang, Yi
    [J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1703 - 1711
  • [8] Deep Forest-based Disease Prediction and Diagnosis under the Concept of Digital Health
    Mei, Xiangxiang
    Shen, Hao
    Wu, Fang
    Cai, Xiaodan
    Chen, Hongyun
    [J]. Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [9] Protein subcellular localization prediction tools
    Gillani, Maryam
    Pollastri, Gianluca
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 1796 - 1807
  • [10] Review of Protein Subcellular Localization Prediction
    Wang, Zhen
    Zou, Quan
    Jiang, Yi
    Ju, Ying
    Zeng, Xiangxiang
    [J]. CURRENT BIOINFORMATICS, 2014, 9 (03) : 331 - 342