DeepSF: deep convolutional neural network for mapping protein sequences to folds

被引:106
|
作者
Hou, Jie [1 ]
Adhikari, Badri [2 ]
Cheng, Jianlin [1 ,3 ]
机构
[1] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
[2] Univ Missouri St Louis, Dept Math & Comp Sci, St Louis, MO 63121 USA
[3] Univ Missouri, Inst Informat, Columbia, MO 65211 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
SECONDARY STRUCTURE; HOMOLOGY DETECTION; RECOGNITION; PREDICTION; DATABASE; CLASSIFICATION; CATH; SCOP;
D O I
10.1093/bioinformatics/btx780
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein fold recognition is an important problem in structural bioinformatics. Almost all traditional fold recognition methods use sequence (homology) comparison to indirectly predict the fold of a target protein based on the fold of a template protein with known structure, which cannot explain the relationship between sequence and fold. Only a few methods had been developed to classify protein sequences into a small number of folds due to methodological limitations, which are not generally useful in practice. Results: We develop a deep 1D-convolution neural network (DeepSF) to directly classify any protein sequence into one of 1195 known folds, which is useful for both fold recognition and the study of sequence-structure relationship. Different from traditional sequence alignment (comparison) based methods, our method automatically extracts fold-related features from a protein sequence of any length and maps it to the fold space. We train and test our method on the datasets curated from SCOP1.75, yielding an average classification accuracy of 75.3%. On the independent testing dataset curated from SCOP2.06, the classification accuracy is 73.0%. We compare our method with a top profile-profile alignment method-HHSearch on hard template-based and template-free modeling targets of CASP9-12 in terms of fold recognition accuracy. The accuracy of our method is 12.63-26.32% higher than HHSearch on template-free modeling targets and 3.39-17.09% higher on hard template-based modeling targets for top 1, 5 and 10 predicted folds. The hidden features extracted from sequence by our method is robust against sequence mutation, insertion, deletion and truncation, and can be used for other protein pattern recognition problems such as protein clustering, comparison and ranking.
引用
收藏
页码:1295 / 1303
页数:9
相关论文
共 50 条
  • [31] Deep Convolutional and LSTM Neural Network Architectures on Leap Motion Hand Tracking Data Sequences
    Kritsis, Kosmas
    Kaliakatsos-Papakostas, Maximos
    Katsouros, Vassilis
    Pikrakis, Aggelos
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [32] Deep Convolutional Generative Adversarial Network and Convolutional Neural Network for Smoke Detection
    Yin, Hang
    Wei, Yurong
    Liu, Hedan
    Liu, Shuangyin
    Liu, Chuanyun
    Gao, Yacui
    [J]. COMPLEXITY, 2020, 2020
  • [33] Deep Convolutional Generative Adversarial Network and Convolutional Neural Network for Smoke Detection
    Yin, Hang
    Wei, Yurong
    Liu, Hedan
    Liu, Shuangyin
    Liu, Chuanyun
    Gao, Yacui
    [J]. Liu, Shuangyin (hdlsyxlq@126.com), 1600, Hindawi Limited (2020):
  • [34] Obstacle Detection with Deep Convolutional Neural Network
    Yu, Hong
    Hong, Ruxia
    Huang, XiaoLei
    Wang, Zhengyou
    [J]. 2013 SIXTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2013, : 265 - 268
  • [35] Deep Convolutional Neural Network for Image Deconvolution
    Xu, Li
    Ren, Jimmy S. J.
    Liu, Ce
    Jia, Jiaya
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [36] DeepTAM: Deep Tracking and Mapping with Convolutional Neural Networks
    Zhou, Huizhong
    Ummenhofer, Benjamin
    Brox, Thomas
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (03) : 756 - 769
  • [37] Breeds Classification with Deep Convolutional Neural Network
    Zhang, Yicheng
    Gao, Jipeng
    Zhou, Haolin
    [J]. ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 145 - 151
  • [38] Numerosity representation in a deep convolutional neural network
    Zhou, Cihua
    Xu, Wei
    Liu, Yujie
    Xue, Zhichao
    Chen, Rui
    Zhou, Ke
    Liu, Jia
    [J]. JOURNAL OF PACIFIC RIM PSYCHOLOGY, 2021, 15
  • [39] Overview of Deep Convolutional Neural Network Pruning
    Li, Guang
    Liu, Fang
    Xia, Yuping
    [J]. 2020 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING AND ARTIFICIAL INTELLIGENCE, 2020, 11584
  • [40] Relative Attributes with Deep Convolutional Neural Network
    Kim, Dong-Jin
    Yoo, Donggeun
    Im, Sunghoon
    Kim, Namil
    Sirinukulwattana, Tharatch
    Kweon, In So
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2015, : 157 - 158