Deep Protein Subcellular Localization Predictor Enhanced with Transfer Learning of GO Annotation

被引:3
|
作者
Yuan, Xin [1 ]
Pang, Erli [2 ]
Lin, Kui [2 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka 8080135, Japan
[2] Beijing Normal Univ, Coll Life Sci, 19 Xinjiekou Outer St, Beijing, Peoples R China
关键词
subcellular localization prediction; GO annotation; deep feature extractor; deep neural network; transfer learning; SUPPORT VECTOR MACHINES; SIMILARITY;
D O I
10.1002/tee.23330
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Since the large-scale protein sequence data is available, applying deep neural networks to mine better features from the sequences becomes possible. Eukaryotic protein subcellular localization prediction which makes a contribution in many biology process, has used protein sequences in many automatic predicting methods. Moreover, gene ontology (GO) annotation has been shown to be helpful in improving the prediction accuracy of subcellular localization. However, experimentally annotated proteins are not always available. On the other hand, experimentally annotated proteins are available for certain species such as human, mouse, Arabidopsis thaliana, etc. It is highly motivated to perform deep learning of GO annotations on the available experimentally annotated proteins and to transfer it to subcellular localization prediction on other species. In this paper, we propose a deep protein subcellular localization predictor, consisting of a linear classifier and a deep feature extractor of convolution neural network (CNN). The deep CNN feature extractor is first shared and pre-trained in a deep GO annotation predictor, and then is transferred to the subcellular localization predictor with fine-tuning using protein localization samples. In this way, we have a deep protein subcellular localization predictor enhanced with transfer learning of GO annotation. The proposed method has good performances on the Swiss-Prot datasets, when transfer learning using the protein samples both within and out species. Moreover, it outperforms the state-of-the-art traditional methods on benchmark datasets. (c) 2021 Institute of Electrical Engineers of Japan. Published by Wiley Periodicals LLC.
引用
收藏
页码:559 / 567
页数:9
相关论文
共 50 条
  • [1] A New Subcellular Localization Predictor for Human Proteins Considering the Correlation of Annotation Features and Protein Multi-localization
    Zhou, Hang
    Yang, Yang
    Shen, Hong-Bin
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 499 - 512
  • [2] DeepmRNALoc: A Novel Predictor of Eukaryotic mRNA Subcellular Localization Based on Deep Learning
    Wang, Shihang
    Shen, Zhehan
    Liu, Taigang
    Long, Wei
    Jiang, Linhua
    Peng, Sihua
    MOLECULES, 2023, 28 (05):
  • [3] DeepLoc: prediction of protein subcellular localization using deep learning
    Armenteros, Jose Juan Almagro
    Sonderby, Casper Kaae
    Sonderby, Soren Kaae
    Nielsen, Henrik
    Winther, Ole
    BIOINFORMATICS, 2017, 33 (21) : 3387 - 3395
  • [4] Protein subcellular and secreted localization prediction using deep learning
    Zidoum, Hamza
    Magdy, Mennatollah
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [5] Prediction of human protein subcellular localization using deep learning
    Wei, Leyi
    Ding, Yijie
    Su, Ran
    Tang, Jijun
    Zou, Quan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 117 : 212 - 217
  • [6] Gene ontology based transfer learning for protein subcellular localization
    Mei, Suyu
    Fei, Wang
    Zhou, Shuigeng
    BMC BIOINFORMATICS, 2011, 12
  • [7] Gene ontology based transfer learning for protein subcellular localization
    Suyu Mei
    Wang Fei
    Shuigeng Zhou
    BMC Bioinformatics, 12
  • [8] A Transfer Learning Model for Unbalanced Archaeal Bacterial Protein Subcellular Localization
    Chen, Haowen
    Huang, Lei
    Huang, Hao
    Liao, Bo
    Cao, Zhi
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2014, 11 (07) : 1579 - 1584
  • [9] CELLO2GO: A Web Server for Protein subCELlular LOcalization Prediction with Functional Gene Ontology Annotation
    Yu, Chin-Sheng
    Cheng, Chih-Wen
    Su, Wen-Chi
    Chang, Kuei-Chung
    Huang, Shao-Wei
    Hwang, Jenn-Kang
    Lu, Chih-Hao
    PLOS ONE, 2014, 9 (06):
  • [10] PSLCNN: Protein Subcellular Localization Prediction for Eukaryotes and Prokaryotes Using Deep Learning
    Chang, Che-Yu
    Hsu, Tz-Wei
    Chang, Jia-Ming
    2019 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2019,