Measuring Similarity among Protein Sequences Using a New Descriptor

被引:7
|
作者
Abo-Elkhier, Mervat M. [1 ]
Abd Elwahaab, Marwa A. [1 ]
Abo El Maaty, Moheb I. [1 ]
机构
[1] Mansoura Univ, Dept Engn Math & Phys, Fac Engn, Mansoura 35516, Egypt
关键词
2-D GRAPHICAL REPRESENTATION; PHYSICOCHEMICAL PROPERTIES; DNA-SEQUENCES; ALIGNMENT; SEARCH; 2D;
D O I
10.1155/2019/2796971
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The comparison of protein sequences according to similarity is a fundamental aspect of today's biomedical research. With the developments of sequencing technologies, a large number of protein sequences increase exponentially in the public databases. Famous sequences' comparison methods are alignment based. They generally give excellent results when the sequences under study are closely related and they are time consuming. Herein, a new alignment-free method is introduced. Our technique depends on a new graphical representation and descriptor. The graphical representation of protein sequence is a simple way to visualize protein sequences. The descriptor compresses the primary sequence into a single vector composed of only two values. Our approach gives good results with both short and long sequences within a little computation time. It is applied on nine beta globin, nine ND5 (NADH dehydrogenase subunit 5), and 24 spike protein sequences. Correlation and significance analyses are also introduced to compare our similarity/dissimilarity results with others' approaches, results, and sequence homology.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] CLUSS: Clustering of protein sequences based on a new similarity measure
    Kelil, Abdellali
    Wang, Shengrui
    Brzezinski, Ryszard
    Fleury, Alain
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [12] CLUSS: Clustering of protein sequences based on a new similarity measure
    Abdellali Kelil
    Shengrui Wang
    Ryszard Brzezinski
    Alain Fleury
    BMC Bioinformatics, 8
  • [13] A novel descriptor of protein sequences and its application
    Yao, Yuhua
    Yan, Shoujiang
    Han, Jianning
    Dai, Qi
    He, Ping-an
    JOURNAL OF THEORETICAL BIOLOGY, 2014, 347 : 109 - 117
  • [14] Measuring the Similarity of Protein Structures Using Image Compression Algorithms
    Hayashida, Morihiro
    Akutsu, Tatsuya
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (12): : 2468 - 2478
  • [15] Similarity analysis of protein sequences based on a new graphical representation method
    Zhang, Yuyan
    Wen, Jia
    COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2018, 18 (03) : 193 - 208
  • [16] ADACT: a tool for analysing (dis)similarity among nucleotide and protein sequences using minimal and relative absent words
    Akon, Mujtahid
    Akon, Muntashir
    Kabir, Mohimenul
    Rahman, M. Saifur
    Rahman, M. Sohel
    BIOINFORMATICS, 2021, 37 (10) : 1468 - 1470
  • [17] Analysis of similarity/dissimilarity of protein sequences
    Yao, Yu-Hua
    Dai, Qi
    Li, Chun
    He, Ping-An
    Nan, Xu-Ying
    Zhang, Yao-Zhou
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 73 (04) : 864 - 871
  • [18] Measuring correlations in protein sequences
    Weiss, O
    Herzel, H
    ZEITSCHRIFT FUR PHYSIKALISCHE CHEMIE-INTERNATIONAL JOURNAL OF RESEARCH IN PHYSICAL CHEMISTRY & CHEMICAL PHYSICS, 1998, 204 : 183 - 197
  • [19] A New Local Self-similarity Descriptor Based on Structural Similarity Index
    Yang Hongbo
    Hou Xia
    COMPUTER AND INFORMATION TECHNOLOGY, 2014, 519-520 : 615 - +
  • [20] Measuring the similarity of protein structures by means of the universal similarity metric
    Krasnogor, N
    Pelta, DA
    BIOINFORMATICS, 2004, 20 (07) : 1015 - 1021