Efficient algorithms for protein sequence design and the analysis of certain evolutionary fitness landscapes

被引:5
|
作者
Kleinberg, JM [1 ]
机构
[1] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
关键词
inverse protein folding; protein sequence design; network flow algorithms; combinatorial optimization; evolutionary fitness landscapes;
D O I
10.1089/106652799318346
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein sequence design is a natural inverse problem to protein structure prediction: given a target structure in three dimensions, we wish to design an amino acid sequence that is likely fold to it. A model of Sun, Brem, Chan, and Dill casts this problem as an optimization on a space of sequences of hydrophobic (H) and polar (P) monomers; the goal is to find a sequence that achieves a dense hydrophobic core with few solvent-exposed hydrophobic residues. Sun et nl. developed a heuristic method to search the space of sequences, without a guarantee of optimality or near-optimality; Hart subsequently raised the computational tractability of constructing an optimal sequence in this model as an open question. Here we resolve this question by providing an efficient algorithm to construct optimal sequences; our algorithm has a polynomial running time, and performs very efficiently in practice. We illustrate the implementation of our method on structures drawn from the Protein Data Bank. We also consider extensions of the model to larger amino acid alphabets, as a way to overcome the limitations of the binary H/P alphabet. We show that for a natural class of arbitrarily large alphabets, it remains possible to design optimal sequences efficiently. Finally, we analyze some of the consequences of this sequence design model for the study of evolutionary fitness landscapes. A given target structure may have many sequences that are optimal in the model of Sun et al.; following a notion raised by the work of J. Maynard Smith, we can ask whether these optimal sequences are "connected" by successive point mutations. We provide a polynomial-time algorithm to decide this connectedness property, relative to a given target structure. We develop the algorithm by first solving an analogous problem expressed in terms of submodular functions, a fundamental object of study in combinatorial optimization.
引用
收藏
页码:387 / 404
页数:18
相关论文
共 50 条
  • [21] Design and analysis of migration in parallel evolutionary algorithms
    Jörg Lässig
    Dirk Sudholt
    [J]. Soft Computing, 2013, 17 : 1121 - 1144
  • [22] Design and analysis of migration in parallel evolutionary algorithms
    Laessig, Joerg
    Sudholt, Dirk
    [J]. SOFT COMPUTING, 2013, 17 (07) : 1121 - 1144
  • [23] Evolutionary landscapes derived from amino acid sequence space serve as predictive models of functional phenotypes in protein design
    Morcos, Faruck
    [J]. BIOPHYSICAL JOURNAL, 2022, 121 (03) : 46 - 46
  • [24] Analysis of Evolutionary Algorithms on Fitness Function With Time-Linkage Property
    Zheng, Weijie
    Chen, Huanhuan
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2021, 25 (04) : 696 - 709
  • [25] Design of energy efficient RAL system using evolutionary algorithms
    Nilakantan, Mukund J.
    Ponnambalam, S. G.
    Jawahar, N.
    [J]. ENGINEERING COMPUTATIONS, 2016, 33 (02) : 580 - 602
  • [26] Potts Hamiltonian models of protein co-variation, free energy landscapes, and evolutionary fitness
    Levy, Ronald M.
    Haldane, Allan
    Flynn, William F.
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2017, 43 : 55 - 62
  • [27] Analysis of Multiobjective Evolutionary Algorithms on Fitness Function With Time-Linkage Property
    Yang, Tianyi
    Zhou, Yuren
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2024, 28 (03) : 837 - 843
  • [28] Efficient design of hybrid renewable energy systems using evolutionary algorithms
    Bernal-Agustin, Jose L.
    Dufo-Lopez, Rodolfo
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2009, 50 (03) : 479 - 489
  • [29] Efficient Forest Data Structure for Evolutionary Algorithms Applied to Network Design
    Delbem, Alexandre C. B.
    de Lima, Telma W.
    Telles, Guilherme P.
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2012, 16 (06) : 829 - 846
  • [30] Bridging the physical scales in evolutionary biology: from protein sequence space to fitness of organisms and populations
    Bershtein, Shimon
    Serohijos, Adrian W. R.
    Shakhnovich, Eugene I.
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2017, 42 : 31 - 40