Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks

被引:43
|
作者
Guo, Yanbu [1 ]
Wang, Bingyi [2 ]
Li, Weihua [1 ]
Yang, Bei [3 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, 2 North Cuihu Rd, Kunming 650091, Yunnan, Peoples R China
[2] Chinese Acad Forestry, Res Inst Resource Insects, Kunming 650224, Yunnan, Peoples R China
[3] Second Peoples Hosp Yunnan Prov, Cardiol Dept, 176 Qingnian Rd, Kunming 650021, Yunnan, Peoples R China
基金
美国国家科学基金会;
关键词
Bioinformatics; protein secondary structure predication (PSSP); convolutional neural networks (CNNs); recurrent neural networks (RNNs); long short-term memory (LSTM); gated recurrent units (GRUs);
D O I
10.1142/S021972001850021X
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein secondary structure prediction (PSSP) is an important research field in bioinformatics. The representation of protein sequence features could be treated as a matrix, which includes the amino-acid residue (time-step) dimension and the feature vector dimension. Common approaches to predict secondary structures only focus on the amino-acid residue dimension. However, the feature vector dimension may also contain useful information for PSSP. To integrate the information on both dimensions of the matrix, we propose a hybrid deep learning framework, two-dimensional convolutional bidirectional recurrent neural network (2C-BRNN), for improving the accuracy of 8-class secondary structure prediction. The proposed hybrid framework is to extract the discriminative local interactions between amino-acid residues by two-dimensional convolutional neural networks (2DCNNs), and then further capture long-range interactions between amino-acid residues by bidirectional gated recurrent units (BGRUs) or bidirectional long short-term memory (BLSTM). Specifically, our proposed 2C-BRNNs framework consists of four models: 2DConv-BGRUs, 2DCNN-BGRUs, 2DConv-BLSTM and 2DCNN-BLSTM. Among these four models, the 2DConv- models only contain two-dimensional (2D) convolution operations. Moreover, the 2DCNN- models contain 2D convolutional and pooling operations. Experiments are conducted on four public datasets. The experimental results show that our proposed 2DConv-BLSTM model performs significantly better than the benchmark models. Furthermore, the experiments also demonstrate that the proposed models can extract more meaningful features from the matrix of proteins, and the feature vector dimension is also useful for PSSP. The codes and datasets of our proposed methods are available at https://github.com/guoyanb/JBCB2018/.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Protein Secondary Structure Prediction Based on Two Dimensional Deep Convolutional Neural Networks
    Liu, Yihui
    Cheng, Jinyong
    Ma, Yuming
    Chen, Yehong
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1995 - 1999
  • [2] Two-Dimensional Convolutional Recurrent Neural Networks for Speech Activity Detection
    Vafeiadis, Anastasios
    Fanioudakis, Eleftherios
    Potamitis, Ilyas
    Votis, Konstantinos
    Giakoumis, Dimitrios
    Tzovaras, Dimitrios
    Chen, Liming
    Hamzaoui, Raouf
    [J]. INTERSPEECH 2019, 2019, : 2045 - 2049
  • [3] RNA secondary structure prediction with convolutional neural networks
    Mehdi Saman Booy
    Alexander Ilin
    Pekka Orponen
    [J]. BMC Bioinformatics, 23
  • [4] RNA secondary structure prediction with convolutional neural networks
    Booy, Mehdi Saman
    Ilin, Alexander
    Orponen, Pekka
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [5] Cascaded bidirectional recurrent neural networks for protein secondary structure prediction
    Chen, Jinmiao
    Chaudhari, Narendra S.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (04) : 572 - 582
  • [6] IGPRED: Combination of convolutional neural and graph convolutional networks for protein secondary structure prediction
    Gormez, Yasin
    Sabzekar, Mostafa
    Aydin, Zafer
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (10) : 1277 - 1288
  • [7] Rainfall Prediction using Spatial Convolutional Neural Networks and Recurrent Neural Networks
    Lestari, Nadia Dwi Puji
    Djamal, Esmeralda Contessa
    [J]. 2022 International Conference on Data Science and Its Applications, ICoDSA 2022, 2022, : 12 - 17
  • [8] Rainfall Prediction using Spatial Convolutional Neural Networks and Recurrent Neural Networks
    Lestari, Nadia Dwi Puji
    Djamal, Esmeralda Contessa
    [J]. 2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 12 - 17
  • [9] Deeper Profiles and Cascaded Recurrent and Convolutional Neural Networks for state-of-the-art Protein Secondary Structure Prediction
    Torrisi, Mirko
    Kaleel, Manaz
    Pollastri, Gianluca
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [10] Deeper Profiles and Cascaded Recurrent and Convolutional Neural Networks for state-of-the-art Protein Secondary Structure Prediction
    Mirko Torrisi
    Manaz Kaleel
    Gianluca Pollastri
    [J]. Scientific Reports, 9