Improving protein fold recognition using triplet network and ensemble deep learning

被引:9
|
作者
Liu, Yan [1 ,2 ]
Han, Ke [1 ,2 ]
Zhu, Yi-Heng [1 ,2 ]
Zhang, Ying [1 ,2 ]
Shen, Long-Chen [1 ,2 ]
Song, Jiangning [3 ,4 ,5 ]
Yu, Dong-Jun [1 ,6 ,7 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, 200 Xiaolingwei, Nanjing 210094, Peoples R China
[2] Pattern Recognit & Bioinformat Grp, Delft, Netherlands
[3] Monash Univ, Biomed Discovery Inst, Melbourne, Vic 3800, Australia
[4] Monash Univ, Dept Biochem & Mol Biol, Melbourne, Vic 3800, Australia
[5] Monash Univ, Fac Informat Technol, Monash Ctr Data Sci, Melbourne, Vic, Australia
[6] China Comp Federat, Beijing, Peoples R China
[7] China Assoc Artificial Intelligence, Beijing, Peoples R China
基金
澳大利亚研究理事会; 英国医学研究理事会; 中国国家自然科学基金; 美国国家卫生研究院;
关键词
protein fold recognition; bioinformatics; convolutional neural network; ensemble deep learning; triplet loss; HIDDEN MARKOV-MODELS; SEQUENCE; SECONDARY; CLASSIFICATION; DYNAMICS; INFORMATION; PREDICTION; SERVER;
D O I
10.1093/bib/bbab248
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein fold recognition is a critical step toward protein structure and function prediction, aiming at providing the most likely fold type of the query protein. In recent years, the development of deep learning (DL) technique has led to massive advances in this important field, and accordingly, the sensitivity of protein fold recognition has been dramatically improved. Most DL-based methods take an intermediate bottleneck layer as the feature representation of proteins with new fold types. However, this strategy is indirect, inefficient and conditional on the hypothesis that the bottleneck layer's representation is assumed as a good representation of proteins with new fold types. To address the above problem, in this work, we develop a new computational framework by combining triplet network and ensemble DL. We first train a DL-based model, termed FoldNet, which employs triplet loss to train the deep convolutional network. FoldNet directly optimizes the protein fold embedding itself, making the proteins with the same fold types be closer to each other than those with different fold types in the new protein embedding space. Subsequently, using the trained FoldNet, we implement a new residue-residue contact-assisted predictor, termed FoldTR, which improves protein fold recognition. Furthermore, we propose a new ensemble DL method, termed FSD_XGBoost, which combines protein fold embedding with the other two discriminative fold-specific features extracted by two DL-based methods SSAfold and DeepFR. The Top 1 sensitivity of FSD_XGBoost increases to 74.8% at the fold level, which is similar to 9% higher than that of the state-of-the-art method. Together, the results suggest that fold-specific features extracted by different DL methods complement with each other, and their combination can further improve fold recognition at the fold level. The implemented web server of FoldTR and benchmark datasets are publicly available at http://csbio.njust.edu.cn/bioinf/foldtr/.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Improving protein fold recognition by random forest
    Jo, Taeho
    Cheng, Jianlin
    BMC BIOINFORMATICS, 2014, 15
  • [22] Deep Learning Ensemble for Melanoma Recognition
    Osowski, Stanislaw
    Les, Tomasz
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [23] Deep Learning Proteins using a Triplet-BERT network
    Lennox, Mark
    Robertson, Neil
    Devereux, Barry
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 4341 - 4347
  • [24] Improving a Deep Learning based RGB-D Object Recognition Model by Ensemble Learning
    Aakerberg, Andreas
    Nasrollahi, Kamal
    Heder, Thomas
    PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [25] Margin-based ensemble classifier for protein fold recognition
    Yang, Tao
    Kecman, Vojislav
    Cao, Longbing
    Zhang, Chengqi
    Huang, Joshua Zhexue
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 12348 - 12355
  • [26] Deep Ensemble Learning for Human Activity Recognition Using Smart hone
    Zhu, Ran
    Xiao, Zhuoling
    Cheng, Mo
    Zhou, Liang
    Yan, Bo
    Lin, Shuisheng
    Wen, HongKai
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [27] Virtual facial expression recognition using deep CNN with ensemble learning
    Venkata Rami Reddy Chirra
    Srinivasulu Reddy Uyyala
    Venkata Krishna Kishore Kolli
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 10581 - 10599
  • [28] Virtual facial expression recognition using deep CNN with ensemble learning
    Chirra, Venkata Rami Reddy
    Uyyala, Srinivasulu Reddy
    Kolli, Venkata Krishna Kishore
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (12) : 10581 - 10599
  • [29] DEEP NETWORK ENSEMBLE FOR SURVEILLANCE FACE RECOGNITION
    Shi, Hailin
    Zhu, Xiangyu
    Liao, Shengcai
    Lei, Zhen
    Li, Stan Z.
    IEEE INTELLIGENT SYSTEMS, 2018, 33 (03) : 50 - 53
  • [30] Emotion Recognition on Multimodal with Deep Learning and Ensemble
    Dharma, David Adi
    Zahra, Amalia
    International Journal of Advanced Computer Science and Applications, 2022, 13 (12): : 656 - 663