A general-purpose protein design framework based on mining sequence-structure relationships in known protein structures

被引:60
|
作者
Zhou, Jianfu [1 ]
Panaitiu, Alexandra E. [1 ]
Grigoryan, Gevorg [1 ,2 ]
机构
[1] Dartmouth Coll, Dept Comp Sci, Hanover, NH 03755 USA
[2] Dartmouth Coll, Dept Biol Sci, Hanover, NH 03755 USA
关键词
protein design; data-driven protein design; structure-based analysis; protein structure; structure search; EFFECTIVE ENERGY FUNCTION; DE-NOVO DESIGN; COMPUTATIONAL DESIGN; ALGORITHM; SEARCH; PREDICTION; INTERFACE; FRAGMENTS; SOFTWARE; REDESIGN;
D O I
10.1073/pnas.1908723117
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Current state-of-the-art approaches to computational protein design (CPD) aim to capture the determinants of structure from physical principles. While this has led to many successful designs, it does have strong limitations associated with inaccuracies in physical modeling, such that a reliable general solution to CPD has yet to be found. Here, we propose a design framework one based on identifying and applying patterns of sequence-structure compatibility found in known proteins, rather than approximating them from models of interatomic interactions. We carry out extensive computational analyses and an experimental validation for our method. Our results strongly argue that the Protein Data Bank is now sufficiently large to enable proteins to be designed by using only examples of structural motifs from unrelated proteins. Because our method is likely to have orthogonal strengths relative to existing techniques, it could represent an important step toward removing remaining barriers to robust CPD.
引用
收藏
页码:1059 / 1068
页数:10
相关论文
共 34 条
  • [11] Framework design of a general-purpose power market simulator based on multi-agent technology
    Liu, HJ
    Yuan, B
    Dai, HW
    Lin, JK
    2001 POWER ENGINEERING SOCIETY SUMMER MEETING, VOLS 1-3, CONFERENCE PROCEEDINGS, 2001, : 1478 - 1482
  • [12] Combinatorial and High-throughput Approaches to Evaluate Sequence-Structure Relationships in the Four Helix Bundle Protein Rop
    Sen, Shiladitya
    Magliery, Thomas
    PROTEIN SCIENCE, 2012, 21 : 144 - 144
  • [13] Alignment of multiple protein structures based on sequence and structure features
    Madhusudhan, M. S.
    Webb, Benjamin M.
    Marti-Renom, Marc A.
    Eswar, Narayanan
    Sali, Andrej
    PROTEIN ENGINEERING DESIGN & SELECTION, 2009, 22 (09): : 569 - 574
  • [14] Relationships between protein sequence and structure patterns based on residue contacts
    Selbig, J
    Argos, P
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1998, 31 (02): : 172 - 185
  • [15] Protein sequence design based on the topology of the native state structure
    Jha, Anupam Nath
    Ananthasuresh, G. K.
    Vishveshwara, Saraswathi
    JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (01) : 81 - 90
  • [16] PREDICTION OF PROTEIN-STRUCTURE BY EVALUATION OF SEQUENCE-STRUCTURE FITNESS - ALIGNING SEQUENCES TO CONTACT PROFILES DERIVED FROM 3-DIMENSIONAL STRUCTURES
    OUZOUNIS, C
    SANDER, C
    SCHARF, M
    SCHNEIDER, R
    JOURNAL OF MOLECULAR BIOLOGY, 1993, 232 (03) : 805 - 825
  • [17] Recurring sequence-structure motifs in (βα)8-barrel proteins and experimental optimization of a chimeric protein designed based on such motifs
    Wang, Jichao
    Zhang, Tongchuan
    Liu, Ruicun
    Song, Meilin
    Wang, Juncheng
    Hong, Jiong
    Chen, Quan
    Liu, Haiyan
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2017, 1865 (02): : 165 - 175
  • [18] MASTERS: A General Sequence-based MultiAgent System for Protein TERtiary Structure Prediction
    Lipinski-Paes, Thiago
    de Souza, Osmar Norberto
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2014, 306 : 45 - 59
  • [19] Design of sweet protein based sweeteners: Hints from structure-function relationships
    Rega, Michele Fortunato
    Di Monaco, Rossella
    Leone, Serena
    Donnarumma, Federica
    Spadaccini, Roberta
    Cavella, Silvana
    Picone, Delia
    FOOD CHEMISTRY, 2015, 173 : 1179 - 1186
  • [20] Mining super-secondary structure motifs from 3D protein structures: A sequence order independent approach
    Aung, Zeyar
    Li, Jinyan
    GENOME INFORMATICS 2007, VOL 19, 2007, 19 : 15 - +