Deep Ensemble Learning with Atrous Spatial Pyramid Networks for Protein Secondary Structure Prediction

被引:3
|
作者
Guo, Yuzhi [1 ]
Wu, Jiaxiang [2 ]
Ma, Hehuan [1 ]
Wang, Sheng [1 ]
Huang, Junzhou [1 ]
机构
[1] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[2] Tencent, Al Lab, Shenzhen 508929, Peoples R China
关键词
deep learning; protein secondary structure prediction; learning representation; computational biology; NEURAL-NETWORKS; SEGMENTATION;
D O I
10.3390/biom12060774
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The secondary structure of proteins is significant for studying the three-dimensional structure and functions of proteins. Several models from image understanding and natural language modeling have been successfully adapted in the protein sequence study area, such as Long Short-term Memory (LSTM) network and Convolutional Neural Network (CNN). Recently, Gated Convolutional Neural Network (GCNN) has been proposed for natural language processing. It has achieved high levels of sentence scoring, as well as reduced the latency. Conditionally Parameterized Convolution (CondConv) is another novel study which has gained great success in the image processing area. Compared with vanilla CNN, CondConv uses extra sample-dependant modules to conditionally adjust the convolutional network. In this paper, we propose a novel Conditionally Parameterized Convolutional network (CondGCNN) which utilizes the power of both CondConv and GCNN. CondGCNN leverages an ensemble encoder to combine the capabilities of both LSTM and CondGCNN to encode protein sequences by better capturing protein sequential features. In addition, we explore the similarity between the secondary structure prediction problem and the image segmentation problem, and propose an ASP network (Atrous Spatial Pyramid Pooling (ASPP) based network) to capture fine boundary details in secondary structure. Extensive experiments show that the proposed method can achieve higher performance on protein secondary structure prediction task than existing methods on CB513, Casp11, CASP12, CASP13, and CASP14 datasets. We also conducted ablation studies over each component to verify the effectiveness. Our method is expected to be useful for any protein related prediction tasks, which is not limited to protein secondary structure prediction.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] The prediction of protein secondary structure with a Cascade Correlation Learning Architecture of neural networks
    F. Vivarelli
    P. Fariselli
    R. Casadio
    [J]. Neural Computing & Applications, 1997, 6 : 57 - 62
  • [22] Effect of simple ensemble methods on protein secondary structure prediction
    Bouziane, Hafida
    Messabih, Belhadri
    Chouarfia, Abdallah
    [J]. SOFT COMPUTING, 2015, 19 (06) : 1663 - 1678
  • [23] Fusion of BLAST and Ensemble of Classifiers for Protein Secondary Structure Prediction
    de Oliveira, Gabriel Bianchin
    Pedrini, Helio
    Dias, Zanoni
    [J]. 2020 33RD SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2020), 2020, : 308 - 315
  • [24] Effect of simple ensemble methods on protein secondary structure prediction
    Hafida Bouziane
    Belhadri Messabih
    Abdallah Chouarfia
    [J]. Soft Computing, 2015, 19 : 1663 - 1678
  • [25] Protein Secondary Structure Prediction Based on Two Dimensional Deep Convolutional Neural Networks
    Liu, Yihui
    Cheng, Jinyong
    Ma, Yuming
    Chen, Yehong
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1995 - 1999
  • [26] Protein-protein interactions prediction based on ensemble deep neural networks
    Zhang, Long
    Yu, Guoxian
    Xia, Dawen
    Wang, Jun
    [J]. NEUROCOMPUTING, 2019, 324 : 10 - 19
  • [27] Nested Densely Atrous Spatial Pyramid Pooling and Deep Dense Short Connection for Skeleton Detection
    Fu, Daoyong
    Zeng, Xiaofei
    Han, Songchen
    Lin, Hanren
    Li, Wei
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2023, 53 (01) : 75 - 84
  • [28] OneHotEncoding and LSTM-based deep learning models for protein secondary structure prediction
    Vamsidhar Enireddy
    C. Karthikeyan
    D. Vijendra Babu
    [J]. Soft Computing, 2022, 26 : 3825 - 3836
  • [29] OneHotEncoding and LSTM-based deep learning models for protein secondary structure prediction
    Enireddy, Vamsidhar
    Karthikeyan, C.
    Babu, D. Vijendra
    [J]. SOFT COMPUTING, 2022, 26 (08) : 3825 - 3836
  • [30] SENSDeep: An Ensemble Deep Learning Method for Protein–Protein Interaction Sites Prediction
    Engin Aybey
    Özgür Gümüş
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2023, 15 : 55 - 87