Measuring the Similarity of Protein Structures Using Image Compression Algorithms

被引:0
|
作者
Hayashida, Morihiro [1 ]
Akutsu, Tatsuya [1 ]
机构
[1] Kyoto Univ, Bioinformat Ctr, Inst Chem Res, Uji, Kyoto 6110011, Japan
来源
关键词
image compression; universal similarity metric; protein structure comparison; STRUCTURE ALIGNMENT; CONTACT MAPS; SEQUENCE; CLASSIFICATION; DISTANCE; REPRESENTATION; METHODOLOGY; INFORMATION; MATRICES; DATABASE;
D O I
10.1587/transinf.E94.D.2468
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For measuring the similarity of biological sequences and structures such as DNA sequences, protein sequences, and tertiary structures, several compression-based methods have been developed. However, they are based on compression algorithms only for sequential data. For instance, protein structures can be represented by two-dimensional distance matrices. Therefore, it is expected that image compression is useful for measuring the similarity of protein structures because image compression algorithms compress data horizontally and vertically. This paper proposes series of methods for measuring the similarity of protein structures. In the methods, an original protein structure is transformed into a distance matrix, which is regarded as a two-dimensional image. Then, the similarity of two protein structures is measured by a kind of compression ratio of the concatenated image. We employed several image compression algorithms, PEG, GIF, PNG, IFS, and SPC. Since SPC often gave better results among the other image compression methods, and it is simple and easy to be modified, we modified SPC and obtained MSPC. We applied the proposed methods to clustering of protein structures, and performed Receiver Operating Characteristic (ROC) analysis. The results of computational experiments suggest that MSPC has the best performance among existing compression-based methods. We also present some theoretical results on the time complexity and Kolmogorov complexity of image compression-based protein structure comparison.
引用
收藏
页码:2468 / 2478
页数:11
相关论文
共 50 条
  • [1] Measuring the similarity of protein structures by means of the universal similarity metric
    Krasnogor, N
    Pelta, DA
    BIOINFORMATICS, 2004, 20 (07) : 1015 - 1021
  • [2] Measuring image similarity using the geometrical distribution of image contents
    Guo, F
    Jin, JS
    Feng, DG
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 1108 - 1112
  • [3] Image Similarity Using Sparse Representation and Compression Distance
    Guha, Tanaya
    Ward, Rabab K.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (04) : 980 - 987
  • [4] Image segmentation algorithms based on information compression and graph structures
    Department of Reliability-based Information Systems Engineering, Kagawa University, Takamatsu City, Kagawa 761-0396, Japan
    不详
    IEEE Int. Conf. Mechatronics Autom., ICMA, 2009, (95-100):
  • [5] Image Segmentation Algorithms Based on Information Compression and Graph Structures
    Vachkov, Gancho
    Ishihara, Hidenori
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 95 - +
  • [6] Polynomial algorithms for protein similarity search for restricted mRNA structures
    Gurski, Frank
    INFORMATION PROCESSING LETTERS, 2008, 105 (05) : 170 - 176
  • [7] Intelligent image correlation using genetic algorithms for measuring surface deformations in the autonomous inspection of structures
    Mahajan, A
    Pilch, A
    Chu, TC
    PROCEEDINGS OF THE 2000 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2000, : 460 - 461
  • [8] Measuring the Similarity of Files by Data Compression
    Scholnast, Hubert
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 360 - 360
  • [9] Image compression and encryption using tree structures
    Li, XB
    Knipe, J
    Cheng, H
    PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) : 1253 - 1259
  • [10] Digital image compression by using intelligence swarm algorithms
    Ramo, Ramadan Mahmood
    Abd Dawwod, Suhair
    INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2022, 17 (02): : 785 - 794