Inverse statistical physics of protein sequences: a key issues review

被引:130
|
作者
Cocco, Simona [1 ,2 ]
Feinauer, Christoph [3 ]
Figliuzzi, Matteo [3 ]
Monasson, Remi [2 ,4 ]
Weigt, Martin [3 ]
机构
[1] Sorbonne Univ UPMC, Ecole Normale Super, Lab Phys Stat, UMR 8549,CNRS, Paris, France
[2] Sorbonne Univ UPMC, PSL Res, Paris, France
[3] Sorbonne Univ, UPMC, Inst Biol Paris Seine, CNRS,Lab Biol Computat & Quantitat,UMR 7238, Paris, France
[4] Sorbonne Univ UPMC, Lab Phys Theor, Ecole Normale Super, UMR 8549,CNRS, Paris, France
关键词
inverse problems; inverse Ising/Potts problem; statistical inference; protein sequence analysis; coevolution; protein structure prediction; protein-protein interaction; DIRECT-COUPLING ANALYSIS; COEVOLUTIONARY INFORMATION; STRUCTURE PREDICTION; RESIDUE COEVOLUTION; MOLECULAR-DYNAMICS; CONTACT PREDICTION; STRUCTURAL BASIS; HIV EVOLUTION; CO-VARIATION; FAMILIES;
D O I
10.1088/1361-6633/aa9965
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In the course of evolution, proteins undergo important changes in their amino acid sequences, while their three-dimensional folded structure and their biological function remain remarkably conserved. Thanks to modern sequencing techniques, sequence data accumulate at unprecedented pace. This provides large sets of so-called homologous, i.e. evolutionarily related protein sequences, to which methods of inverse statistical physics can be applied. Using sequence data as the basis for the inference of Boltzmann distributions from samples of microscopic configurations or observables, it is possible to extract information about evolutionary constraints and thus protein function and structure. Here we give an overview over some biologically important questions, and how statistical-mechanics inspired modeling approaches can help to answer them. Finally, we discuss some open questions, which we expect to be addressed over the next years.
引用
收藏
页数:17
相关论文
共 50 条