Sequence alignment using machine learning for accurate template-based protein structure prediction

被引:15
|
作者
Makigaki, Shuichiro [1 ]
Ishida, Takashi [1 ]
机构
[1] Tokyo Inst Technol, Sch Comp, Dept Comp Sci, Meguro Ku, Tokyo 1528550, Japan
关键词
DATABASE;
D O I
10.1093/bioinformatics/btz483
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Template-based modeling, the process of predicting the tertiary structure of a protein by using homologous protein structures, is useful if good templates can be found. Although modern homology detection methods can find remote homologs with high sensitivity, the accuracy of template-based models generated from homology-detection-based alignments is often lower than that from ideal alignments. Results In this study, we propose a new method that generates pairwise sequence alignments for more accurate template-based modeling. The proposed method trains a machine learning model using the structural alignment of known homologs. It is difficult to directly predict sequence alignments using machine learning. Thus, when calculating sequence alignments, instead of a fixed substitution matrix, this method dynamically predicts a substitution score from the trained model. We evaluate our method by carefully splitting the training and test datasets and comparing the predicted structure's accuracy with that of state-of-the-art methods. Our method generates more accurate tertiary structure models than those produced from alignments obtained by other methods. Availability and implementation https://github.com/shuichiro-makigaki/exmachina. Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:104 / 111
页数:8
相关论文
共 50 条
  • [1] Sequence Alignment Using Machine Learning for Accurate Template-based Protein Structure Prediction
    Makigaki, Shuichiro
    Ishida, Takashi
    [J]. BIO-PROTOCOL, 2020, 10 (09):
  • [2] Improvement of template-based protein structure prediction by using chimera alignment
    Makigaki, Shuichiro
    Ishida, Takashi
    [J]. PROCEEDINGS OF 2018 8TH INTERNATIONAL CONFERENCE ON BIOSCIENCE, BIOCHEMISTRY AND BIOINFORMATICS (ICBBB 2018), 2018, : 32 - 37
  • [3] Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems
    Shang, Lei
    Gardner, David P.
    Xu, Weijia
    Cannone, Jamie J.
    Miranker, Daniel P.
    Ozer, Stuart
    Gutell, Robin R.
    [J]. BMC SYSTEMS BIOLOGY, 2013, 7
  • [4] Template-based prediction of protein structure with deep learning
    Zhang, Haicang
    Shen, Yufeng
    [J]. BMC GENOMICS, 2020, 21 (Suppl 11)
  • [5] Template-based prediction of protein structure with deep learning
    Haicang Zhang
    Yufeng Shen
    [J]. BMC Genomics, 21
  • [6] Deep template-based protein structure prediction
    Wu, Fandi
    Xu, Jinbo
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (05)
  • [7] Accurate prediction of Snare Protein Sequence using Machine Learning
    Talpur, Dani Bux
    Shaikh, Salahuddin
    Khowaja, Ashfaque
    Adnan, Saifullah
    Ghulam, Ali
    [J]. BIOSCIENCE RESEARCH, 2022, 19 (03): : 1414 - 1422
  • [8] Effect of using suboptimal alignments in template-based protein structure prediction
    Chen, Hao
    Kihara, Daisuke
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (01) : 315 - 334
  • [9] Comparison of Full and Interface Structure Alignment in Template-Based Protein Docking
    Chakravarty, Devlina
    [J]. BIOPHYSICAL JOURNAL, 2018, 114 (03) : 576A - 576A
  • [10] Template-based prediction of protein function
    Petrey, Donald
    Chen, T. Scott
    Deng, Lei
    Garzon, Jose Ignacio
    Hwang, Howook
    Lasso, Gorka
    Lee, Hunjoong
    Silkov, Antonina
    Honig, Barry
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2015, 32 : 33 - 38