Protein contact prediction by integrating joint evolutionary coupling analysis and supervised learning

被引:88
|
作者
Ma, Jianzhu [1 ]
Wang, Sheng [1 ]
Wang, Zhiyong [1 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL 60637 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
INVERSE COVARIANCE ESTIMATION; MUTUAL INFORMATION; GRAPHICAL MODELS; SEQUENCE; FOLD;
D O I
10.1093/bioinformatics/btv472
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein contact prediction is important for protein structure and functional study. Both evolutionary coupling (EC) analysis and supervised machine learning methods have been developed, making use of different information sources. However, contact prediction is still challenging especially for proteins without a large number of sequence homologs. Results: This article presents a group graphical lasso (GGL) method for contact prediction that integrates joint multi-family EC analysis and supervised learning to improve accuracy on proteins without many sequence homologs. Different from existing single-family EC analysis that uses residue coevolution information in only the target protein family, our joint EC analysis uses residue coevolution in both the target family and its related families, which may have divergent sequences but similar folds. To implement this, we model a set of related protein families using Gaussian graphical models and then coestimate their parameters by maximum-likelihood, subject to the constraint that these parameters shall be similar to some degree. Our GGL method can also integrate supervised learning methods to further improve accuracy. Experiments show that our method outperforms existing methods on proteins without thousands of sequence homologs, and that our method performs better on both conserved and family-specific contacts.
引用
收藏
页码:3506 / 3513
页数:8
相关论文
共 50 条
  • [1] Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning
    Ma, Jianzhu
    Wang, Sheng
    Wang, Zhiyong
    Xu, Jinbo
    [J]. RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY (RECOMB 2015), 2015, 9029 : 218 - 221
  • [2] DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning
    Eickholt, Jesse
    Deng, Xin
    Cheng, Jianlin
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [3] DoBo: Protein domain boundary prediction by integrating evolutionary signals and machine learning
    Jesse Eickholt
    Xin Deng
    Jianlin Cheng
    [J]. BMC Bioinformatics, 12
  • [4] An Evolutionary Approach for Protein Contact Map Prediction
    Marquez Chamorro, Alfonso E.
    Divina, Federico
    Aguilar-Ruiz, Jesus S.
    Asencio Cortes, Gualberto
    [J]. EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, 2011, 6623 : 101 - 110
  • [5] Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning
    Adhikari, Badri
    Hou, Jie
    Cheng, Jianlin
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 : 84 - 96
  • [6] Combining Physicochemical and Evolutionary Information for Protein Contact Prediction
    Schneider, Michael
    Brock, Oliver
    [J]. PLOS ONE, 2014, 9 (10):
  • [7] Integrating Clustering and Supervised Learning for Categorical Data Analysis
    Maulik, Ujjwal
    Bandyopadhyay, Sanghamitra
    Saha, Indrajit
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2010, 40 (04): : 664 - 675
  • [8] Supervised Machine Learning for document analysis and prediction
    Ghany, Kareem Kamal A.
    Ayeldeen, Heba
    [J]. PROCEEDINGS OF 2015 THIRD IEEE WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2015,
  • [9] Evolutionary Protein Contact Maps Prediction Based on Amino Acid Properties
    Marquez Chamorro, Alfonso E.
    Divina, Federico
    Aguilar-Ruiz, Jesus S.
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART II, 2011, 6679 : 303 - 310
  • [10] Integrating Action-aware Features for Saliency Prediction via Weakly Supervised Learning
    Feng, Jiaqi
    Li, Shuai
    Sui, Yunfeng
    Meng, Lingtong
    Zhu, Ce
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 974 - 979