Sequence alignment using machine learning for accurate template-based protein structure prediction

被引:15
|
作者
Makigaki, Shuichiro [1 ]
Ishida, Takashi [1 ]
机构
[1] Tokyo Inst Technol, Sch Comp, Dept Comp Sci, Meguro Ku, Tokyo 1528550, Japan
关键词
DATABASE;
D O I
10.1093/bioinformatics/btz483
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Template-based modeling, the process of predicting the tertiary structure of a protein by using homologous protein structures, is useful if good templates can be found. Although modern homology detection methods can find remote homologs with high sensitivity, the accuracy of template-based models generated from homology-detection-based alignments is often lower than that from ideal alignments. Results In this study, we propose a new method that generates pairwise sequence alignments for more accurate template-based modeling. The proposed method trains a machine learning model using the structural alignment of known homologs. It is difficult to directly predict sequence alignments using machine learning. Thus, when calculating sequence alignments, instead of a fixed substitution matrix, this method dynamically predicts a substitution score from the trained model. We evaluate our method by carefully splitting the training and test datasets and comparing the predicted structure's accuracy with that of state-of-the-art methods. Our method generates more accurate tertiary structure models than those produced from alignments obtained by other methods. Availability and implementation https://github.com/shuichiro-makigaki/exmachina. Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:104 / 111
页数:8
相关论文
共 50 条
  • [31] Template-based structure modeling of protein-protein interactions
    Szilagyi, Andras
    Zhang, Yang
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2014, 24 : 10 - 23
  • [32] (PS)2-v2: template-based protein structure prediction server
    Chen, Chih-Chieh
    Hwang, Jenn-Kang
    Yang, Jinn-Moon
    [J]. BMC BIOINFORMATICS, 2009, 10 : 366
  • [33] Empirical limits for template-based protein structure prediction: the CASP5 example
    Contreras-Moreira, B
    Ezkurdia, I
    Tress, ML
    Valencia, A
    [J]. FEBS LETTERS, 2005, 579 (05): : 1203 - 1207
  • [34] (PS)2-v2: template-based protein structure prediction server
    Chih-Chieh Chen
    Jenn-Kang Hwang
    Jinn-Moon Yang
    [J]. BMC Bioinformatics, 10
  • [35] Prediction and Modeling of Protein-Protein Interactions Using "Spotted" Peptides with a Template-Based Approach
    Gasbarri, Chiara
    Rosignoli, Serena
    Janson, Giacomo
    Boi, Dalila
    Paiardini, Alessandro
    [J]. BIOMOLECULES, 2022, 12 (02)
  • [36] Machine Learning for Sequence and Structure-Based Protein-Ligand Interaction Prediction
    Zhang, Yunjiang
    Li, Shuyuan
    Meng, Kong
    Sun, Shaorui
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (05) : 1456 - 1472
  • [37] Novel use of a genetic algorithm for protein structure prediction: Searching template and sequence alignment space
    Contreras-Moreira, B
    Fitzjohn, PW
    Offman, M
    Smith, GR
    Bates, PA
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 : 424 - 429
  • [38] Protein structure prediction using a combination of sequence-based alignment, constrained energy minimization, and structural alignment
    Standley, DM
    Eyrich, VA
    An, YL
    Pincus, DL
    Gunn, JR
    Friesner, RA
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2001, : 133 - 139
  • [39] Template-based software for accurate MEMS characterization
    Novak, E
    Krell, MB
    Browne, T
    [J]. RELIABILITY, TESTING, AND CHARACTERIZATION OF MEMS/MOEMS II, 2003, 4980 : 75 - 80
  • [40] DNABind: A hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning- and template-based approaches
    Liu, Rong
    Hu, Jianjun
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (11) : 1885 - 1899