Towards index-based similarity search for protein structure databases

被引:12
|
作者
Çamoglu, O [1 ]
Kahveci, T [1 ]
Singh, AK [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
关键词
protein structures; feature vectors; indexing; dataset join;
D O I
10.1109/CSB.2003.1227314
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose two methods for finding similarities in protein structure databases. Our techniques extract feature vectors on triplets of SSEs (Secondary Structure Elements) of proteins. These feature vectors are then indexed using a multidimensional index structure. Our first technique cone siders the problem of finding proteins similar to a given query protein in a protein dataset. This technique quickly finds promising proteins using the index structure. These proteins are then aligned to the query protein using a popular pairwise alignment tool such as VAST We also develop a novel statistical model to estimate the goodness of a match using the SSEs. Our second technique considers the problem of joining two protein datasets to find an all-to-all similarity. Experimental results show that our techniques improve the pruning time of VAST 3 to 3.5 times while keeping the sensitivity similar.
引用
收藏
页码:148 / 158
页数:11
相关论文
共 50 条
  • [41] Fast similarity search in string databases
    Sheu, S
    Chang, A
    Huang, W
    19TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS: AINA 2005, 2005, : 617 - 622
  • [42] Similarity Search in Fuzzy Object Databases
    Uskat, Diana
    Emrich, Tobias
    Zuefle, Andreas
    Schmid, Klaus Arthur
    Bernecker, Thomas
    Renz, Matthias
    PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [43] Similarity Search in Animal Sound Databases
    Bardeli, Rolf
    IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (01) : 68 - 76
  • [44] Multiresolution similarity search in image databases
    Martin Heczko
    Alexander Hinneburg
    Daniel Keim
    Markus Wawryniuk
    Multimedia Systems, 2004, 10 : 28 - 40
  • [45] Index-based hyperlinks
    Hartman, JH
    Proebsting, TA
    Sundaram, R
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1997, 29 (8-13): : 1129 - 1135
  • [46] Index-Based Network Aligner of Protein-Protein Interaction Networks
    Elmsallati, Ahed
    Msalati, Abdulghani
    Kalita, Jugal
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (01) : 330 - 336
  • [47] An inverted index-based buffered search algorithm for mobile navigation services
    Kwon, Dongseop
    Choi, Wonik
    Lee, Sangjun
    PROCEEDINGS OF FUTURE GENERATION COMMUNICATION AND NETWORKING, MAIN CONFERENCE PAPERS, VOL 1, 2007, : 487 - +
  • [48] An access structure for similarity-based fuzzy databases
    Yazici, A
    Cibiceli, D
    INFORMATION SCIENCES, 1999, 115 (1-4) : 137 - 163
  • [49] An alphabetic code based atomic level molecular similarity search in databases
    Saranya, Nallusamy
    Selvaraj, Samuel
    BIOINFORMATION, 2012, 8 (11) : 498 - 503
  • [50] Prediction of Customers' Needs: An Approach Based on Similarity Search in Transactions Databases
    Hanyf, Youssef
    Silkan, Hassan
    2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR ORGANIZATIONS DEVELOPMENT (IT4OD), 2016,