GOSSIP: a method for fast and accurate global alignment of protein structures

被引:3
|
作者
Kifer, I. [1 ]
Nussinov, R. [2 ]
Wolfson, H. J. [1 ]
机构
[1] Tel Aviv Univ, Raymond & Beverly Sackler Fac Exact Sci, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Tel Aviv Univ, Sackler Fac Med, Sackler Inst Mol Med, Dept Human Mol Genet & Biochem, IL-69978 Tel Aviv, Israel
基金
以色列科学基金会;
关键词
SEQUENCE; CLASSIFICATION; DATABASE; SIMILARITY; SEARCH; CATH;
D O I
10.1093/bioinformatics/btr044
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The database of known protein structures (PDB) is increasing rapidly. This results in a growing need for methods that can cope with the vast amount of structural data. To analyze the accumulating data, it is important to have a fast tool for identifying similar structures and clustering them by structural resemblance. Several excellent tools have been developed for the comparison of protein structures. These usually address the task of local structure alignment, an important yet computationally intensive problem due to its complexity. It is difficult to use such tools for comparing a large number of structures to each other at a reasonable time. Results: Here we present GOSSIP, a novel method for a global all-against-all alignment of any set of protein structures. The method detects similarities between structures down to a certain cutoff (a parameter of the program), hence allowing it to detect similar structures at a much higher speed than local structure alignment methods. GOSSIP compares many structures in times which are several orders of magnitude faster than well-known available structure alignment servers, and it is also faster than a database scanning method. We evaluate GOSSIP both on a dataset of short structural fragments and on two large sequence-diverse structural benchmarks. Our conclusions are that for a threshold of 0.6 and above, the speed of GOSSIP is obtained with no compromise of the accuracy of the alignments or of the number of detected global similarities.
引用
收藏
页码:925 / 932
页数:8
相关论文
共 50 条
  • [31] FTIP - An Accurate and Efficient Method for Global Protein Surface Comparison
    Zhang, Yuan
    Sui, Xin
    Stagg, Scott M.
    Zhang, Jinfeng
    [J]. BIOPHYSICAL JOURNAL, 2020, 118 (03) : 501A - 501A
  • [32] Fast alignment and comparison of RNA structures
    Wiegels, Tim
    Bienert, Stefan
    Torda, Andrew E.
    [J]. BIOINFORMATICS, 2013, 29 (05) : 588 - 596
  • [33] A Fast Detecting Method for Clone Functions Using Global Alignment of Token Sequences
    Lee, Da-Young
    Ko, Uram
    Aitkazin, Ibrahim
    Park, SangUn
    Tak, Hae-Sung
    Cho, Hwan-Gue
    [J]. ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 17 - 22
  • [34] Kalign – an accurate and fast multiple sequence alignment algorithm
    Timo Lassmann
    Erik LL Sonnhammer
    [J]. BMC Bioinformatics, 6
  • [35] The GEM mapper: fast, accurate and versatile alignment by filtration
    Marco-Sola, Santiago
    Sammeth, Michael
    Guigo, Roderic
    Ribeca, Paolo
    [J]. NATURE METHODS, 2012, 9 (12) : 1185 - U76
  • [36] Fast and accurate global motion compensation
    Deniz, O.
    Bueno, G.
    Bermejo, E.
    Sukthankar, R.
    [J]. PATTERN RECOGNITION, 2011, 44 (12) : 2887 - 2901
  • [37] Kalign - an accurate and fast multiple sequence alignment algorithm
    Lassmann, T
    Sonnhammer, ELL
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [38] Fast and accurate reference-free alignment of subtomograms
    Chen, Yuxiang
    Pfeffer, Stefan
    Hrabe, Thomas
    Schuller, Jan Michael
    Foerster, Friedrich
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2013, 182 (03) : 235 - 245
  • [39] The GEM mapper: fast, accurate and versatile alignment by filtration
    Santiago Marco-Sola
    Michael Sammeth
    Roderic Guigó
    Paolo Ribeca
    [J]. Nature Methods, 2012, 9 : 1185 - 1188
  • [40] An Accurate and Fast Method to Inspect Rail Wear Based on Revised Global Registration
    Yang, Yue
    Liu, Long
    Yi, Bing
    Chen, Feng
    [J]. IEEE ACCESS, 2018, 6 : 57267 - 57278