GOSSIP: a method for fast and accurate global alignment of protein structures

被引:3
|
作者
Kifer, I. [1 ]
Nussinov, R. [2 ]
Wolfson, H. J. [1 ]
机构
[1] Tel Aviv Univ, Raymond & Beverly Sackler Fac Exact Sci, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Tel Aviv Univ, Sackler Fac Med, Sackler Inst Mol Med, Dept Human Mol Genet & Biochem, IL-69978 Tel Aviv, Israel
基金
以色列科学基金会;
关键词
SEQUENCE; CLASSIFICATION; DATABASE; SIMILARITY; SEARCH; CATH;
D O I
10.1093/bioinformatics/btr044
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The database of known protein structures (PDB) is increasing rapidly. This results in a growing need for methods that can cope with the vast amount of structural data. To analyze the accumulating data, it is important to have a fast tool for identifying similar structures and clustering them by structural resemblance. Several excellent tools have been developed for the comparison of protein structures. These usually address the task of local structure alignment, an important yet computationally intensive problem due to its complexity. It is difficult to use such tools for comparing a large number of structures to each other at a reasonable time. Results: Here we present GOSSIP, a novel method for a global all-against-all alignment of any set of protein structures. The method detects similarities between structures down to a certain cutoff (a parameter of the program), hence allowing it to detect similar structures at a much higher speed than local structure alignment methods. GOSSIP compares many structures in times which are several orders of magnitude faster than well-known available structure alignment servers, and it is also faster than a database scanning method. We evaluate GOSSIP both on a dataset of short structural fragments and on two large sequence-diverse structural benchmarks. Our conclusions are that for a threshold of 0.6 and above, the speed of GOSSIP is obtained with no compromise of the accuracy of the alignments or of the number of detected global similarities.
引用
收藏
页码:925 / 932
页数:8
相关论文
共 50 条
  • [1] Fast and accurate alignment of multiple protein networks
    Kalaev, Maxim
    Bafna, Vineet
    Sharan, Roded
    [J]. RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2008, 4955 : 246 - +
  • [2] Fast and Accurate Alignment of Multiple Protein Networks
    Kalaev, Maxim
    Bafna, Vineet
    Sharan, Roded
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2009, 16 (08) : 989 - 999
  • [3] HubAlign: an accurate and efficient method for global alignment of protein-protein interaction networks
    Hashemifar, Somaye
    Xu, Jinbo
    [J]. BIOINFORMATICS, 2014, 30 (17) : I438 - I444
  • [4] The use of a conformational alphabet for fast alignment of protein structures
    Zheng, Wei-Mou
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, 2008, 4983 : 331 - 342
  • [5] A fast approach to global alignment of protein-protein interaction networks
    Kollias G.
    Sathe M.
    Mohammadi S.
    Grama A.
    [J]. BMC Research Notes, 6 (1)
  • [6] FAMSA: Fast and accurate multiple sequence alignment of huge protein families
    Deorowicz, Sebastian
    Debudaj-Grabysz, Agnieszka
    Gudys, Adam
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [7] Fast protein fold recognition and accurate sequence-structure alignment
    Zimmer, R
    Thiele, R
    [J]. BIOINFORMATICS, 1997, 1278 : 137 - 146
  • [8] FAMSA: Fast and accurate multiple sequence alignment of huge protein families
    Sebastian Deorowicz
    Agnieszka Debudaj-Grabysz
    Adam Gudyś
    [J]. Scientific Reports, 6
  • [9] A fast, easy, accurate method for protein quantitation
    Beaudet, Matthew P.
    Ahnert, Nancy
    Dallwig, Jason A.
    Goodman, Terrie
    Thomas, Jerald A.
    [J]. FASEB JOURNAL, 2007, 21 (06): : A1006 - A1007
  • [10] A method for simultaneous alignment of multiple protein structures
    Shatsky, M
    Nussinov, R
    Wolfson, HJ
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 56 (01) : 143 - 156