FUpred: detecting protein domains through deep-learning-based contact map prediction

被引:41
|
作者
Zheng, Wei [1 ]
Zhou, Xiaogen [1 ]
Wuyun, Qiqige [2 ]
Pearce, Robin [1 ]
Li, Yang [1 ,3 ]
Zhang, Yang [1 ,4 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Michigan State Univ, Comp Sci & Engn Dept, E Lansing, MI 48824 USA
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[4] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
STRUCTURAL CLASSIFICATION;
D O I
10.1093/bioinformatics/btaa217
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein domains are subunits that can fold and function independently. Correct domain boundary assignment is thus a critical step toward accurate protein structure and function analyses. There is, however, no efficient algorithm available for accurate domain prediction from sequence. The problem is particularly challenging for proteins with discontinuous domains, which consist of domain segments that are separated along the sequence. Results: We developed a new algorithm, FUpred, which predicts protein domain boundaries utilizing contact maps created by deep residual neural networks coupled with coevolutionary precision matrices. The core idea of the algorithm is to retrieve domain boundary locations by maximizing the number of intra-domain contacts, while minimizing the number of inter-domain contacts from the contact maps. FUpred was tested on a large-scale dataset consisting of 2549 proteins and generated correct single- and multi-domain classifications with a Matthew's correlation coefficient of 0.799, which was 19.1% (or 5.3%) higher than the best machine learning (or threading)-based method. For proteins with discontinuous domains, the domain boundary detection and normalized domain overlapping scores of FUpred were 0.788 and 0.521, respectively, which were 17.3% and 23.8% higher than the best control method. The results demonstrate a new avenue to accurately detect domain composition from sequence alone, especially for discontinuous, multi-domain proteins.
引用
收藏
页码:3749 / 3757
页数:9
相关论文
共 50 条
  • [1] Protein Contact Map Prediction Based On an Ensemble Learning Method
    Habibi, Narjes Khatoon
    Saraee, Mohammad Hossein
    2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY, VOL II, PROCEEDINGS, 2009, : 205 - 209
  • [2] Deep architectures for protein contact map prediction
    Di Lena, Pietro
    Nagata, Ken
    Baldi, Pierre
    BIOINFORMATICS, 2012, 28 (19) : 2449 - 2457
  • [3] Advances in Protein Contact Map Prediction Based on Machine Learning
    Xie, Jiang
    Ding, Wang
    Chen, Luonan
    Guo, Qiang
    Zhang, Wu
    MEDICINAL CHEMISTRY, 2015, 11 (03) : 265 - 270
  • [4] Deep-Learning-Based Approach for Prediction of Algal Blooms
    Zhang, Feng
    Wang, Yuanyuan
    Cao, Minjie
    Sun, Xiaoxiao
    Du, Zhenhong
    Liu, Renyi
    Ye, Xinyue
    SUSTAINABILITY, 2016, 8 (10)
  • [5] Deep-learning-based vehicle trajectory prediction: A review
    Yin, Chenhui
    Cecotti, Marco
    Auger, Daniel J.
    Fotouhi, Abbas
    Jiang, Haobin
    IET INTELLIGENT TRANSPORT SYSTEMS, 2025, 19 (01)
  • [6] Deep-Learning-based Cryptanalysis through Topic Modeling
    Kumar, Kishore
    Tanwar, Sarvesh
    Kumar, Shishir
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (01) : 12524 - 12529
  • [7] Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model
    Wang, Sheng
    Sun, Siqi
    Li, Zhen
    Zhang, Renyu
    Xu, Jinbo
    PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (01)
  • [8] Usefulness of Heat Map Explanations for Deep-Learning-Based Electrocardiogram Analysis
    Storas, Andrea M.
    Andersen, Ole Emil
    Lockhart, Sam
    Thielemann, Roman
    Gnesin, Filip
    Thambawita, Vajira
    Hicks, Steven A.
    Kanters, Jorgen K.
    Struemke, Inga
    Halvorsen, Pal
    Riegler, Michael A.
    DIAGNOSTICS, 2023, 13 (14)
  • [9] A deep-learning model for the prediction of protein domains
    Sato, Renta
    Ekimoto, Toru
    Yoshidome, Takashi
    BIOPHYSICAL JOURNAL, 2023, 122 (03) : 142A - 142A
  • [10] A Study of Deep-Learning-based Prediction Methods for Lossless Coding
    Schiopu, Ionut
    Huang, Hongyue
    Munteanu, Adrian
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 521 - 525