Protein transfer learning improves identification of heat shock protein families

被引:14
|
作者
Min, Seonwoo [1 ]
Kim, HyunGi [1 ]
Lee, Byunghan [2 ]
Yoon, Sungroh [1 ,3 ,4 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Seoul Natl Univ Sci & Technol, Dept Elect & IT Media Engn, Seoul, South Korea
[3] Seoul Natl Univ, Dept Biol Sci, Interdisciplinary Program Artificial Intelligence, Interdisciplinary Program Bioinformat,ASRI,INMC, Seoul, South Korea
[4] Seoul Natl Univ, Inst Engn Res, Seoul, South Korea
来源
PLOS ONE | 2021年 / 16卷 / 05期
基金
新加坡国家研究基金会;
关键词
D O I
10.1371/journal.pone.0251865
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Heat shock proteins (HSPs) play a pivotal role as molecular chaperones against unfavorable conditions. Although HSPs are of great importance, their computational identification remains a significant challenge. Previous studies have two major limitations. First, they relied heavily on amino acid composition features, which inevitably limited their prediction performance. Second, their prediction performance was overestimated because of the independent two-stage evaluations and train-test data redundancy. To overcome these limitations, we introduce two novel deep learning algorithms: (1) time-efficient DeepHSP and (2) high-performance DeeperHSP. We propose a convolutional neural network (CNN)-based DeepHSP that classifies both non-HSPs and six HSP families simultaneously. It outperforms state-of-the-art algorithms, despite taking 14-15 times less time for both training and inference. We further improve the performance of DeepHSP by taking advantage of protein transfer learning. While DeepHSP is trained on raw protein sequences, DeeperHSP is trained on top of pre-trained protein representations. Therefore, DeeperHSP remarkably outperforms state-of-the-art algorithms increasing F1 scores in both cross-validation and independent test experiments by 20% and 10%, respectively. We envision that the proposed algorithms can provide a proteome-wide prediction of HSPs and help in various downstream analyses for pathology and clinical research.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Small Heat Shock Protein 22 Improves Cognition and Learning in the Tauopathic Brain
    Rodriguez Ospina, Santiago
    Blazier, Danielle M.
    Criado-Marrero, Marangelie
    Gould, Lauren A.
    Gebru, Niat T.
    Beaulieu-Abdelahad, David
    Wang, Xinming
    Remily-Wood, Elizabeth
    Chaput, Dale
    Stevens, Stanley, Jr.
    Uversky, Vladimir N.
    Bickford, Paula C.
    Dickey, Chad A.
    Blair, Laura J.
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (02)
  • [2] Bioresistive identification of heat shock protein 90
    Chandrasekaran, Arvind
    Pakkiriswami, Shanmugasundaram
    You, Jian Liang
    Acharya, Ashwin
    Packirisamy, Muthukumaran
    Maxwell, Denis
    [J]. BIOMICROFLUIDICS, 2008, 2 (03)
  • [3] MAMMALIAN HEAT-SHOCK PROTEIN FAMILIES - EXPRESSION AND FUNCTIONS
    BUREL, C
    MEZGER, V
    PINTO, M
    RALLU, M
    TRIGON, S
    MORANGE, M
    [J]. EXPERIENTIA, 1992, 48 (07): : 629 - 634
  • [4] Identification of Heat Shock Protein families and J-protein types by incorporating Dipeptide Composition into Chou's general PseAAC
    Ahmad, Saeed
    Kabir, Muhammad
    Hayat, Maqsood
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2015, 122 (02) : 165 - 174
  • [5] Heat shock response improves heterologous protein secretion in Saccharomyces cerevisiae
    Hou, Jin
    Osterlund, Tobias
    Liu, Zihe
    Petranovic, Dina
    Nielsen, Jens
    [J]. APPLIED MICROBIOLOGY AND BIOTECHNOLOGY, 2013, 97 (08) : 3559 - 3568
  • [6] Heat shock protein improves cold preserved small bowel grafts
    Tarumi, K
    Yagihashi, A
    Tsuruma, T
    Sakawaki, T
    Sasaki, KS
    Hirata, K
    [J]. TRANSPLANTATION PROCEEDINGS, 1998, 30 (07) : 3455 - 3458
  • [7] Heat shock response improves heterologous protein secretion in Saccharomyces cerevisiae
    Jin Hou
    Tobias Österlund
    Zihe Liu
    Dina Petranovic
    Jens Nielsen
    [J]. Applied Microbiology and Biotechnology, 2013, 97 : 3559 - 3568
  • [8] Targeting the 90 kDa heat shock protein improves photodynamic therapy
    Ferrario, Angela
    Gomer, Charles J.
    [J]. CANCER LETTERS, 2010, 289 (02) : 188 - 194
  • [9] Identification of small molecules that modify the protein folding activity of heat shock protein 70
    Wisen, Susanne
    Gestwicki, Jason E.
    [J]. ANALYTICAL BIOCHEMISTRY, 2008, 374 (02) : 371 - 377
  • [10] Detection and identification of heat shock protein 10 as a biomarker in colorectal cancer by protein profiling
    Melle, C
    Bogumil, R
    Ernst, G
    Schimmel, B
    Bleul, A
    von Eggeling, F
    [J]. PROTEOMICS, 2006, 6 (08) : 2600 - 2608