Measuring Similarity among Protein Sequences Using a New Descriptor

被引:7
|
作者
Abo-Elkhier, Mervat M. [1 ]
Abd Elwahaab, Marwa A. [1 ]
Abo El Maaty, Moheb I. [1 ]
机构
[1] Mansoura Univ, Dept Engn Math & Phys, Fac Engn, Mansoura 35516, Egypt
关键词
2-D GRAPHICAL REPRESENTATION; PHYSICOCHEMICAL PROPERTIES; DNA-SEQUENCES; ALIGNMENT; SEARCH; 2D;
D O I
10.1155/2019/2796971
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The comparison of protein sequences according to similarity is a fundamental aspect of today's biomedical research. With the developments of sequencing technologies, a large number of protein sequences increase exponentially in the public databases. Famous sequences' comparison methods are alignment based. They generally give excellent results when the sequences under study are closely related and they are time consuming. Herein, a new alignment-free method is introduced. Our technique depends on a new graphical representation and descriptor. The graphical representation of protein sequence is a simple way to visualize protein sequences. The descriptor compresses the primary sequence into a single vector composed of only two values. Our approach gives good results with both short and long sequences within a little computation time. It is applied on nine beta globin, nine ND5 (NADH dehydrogenase subunit 5), and 24 spike protein sequences. Correlation and significance analyses are also introduced to compare our similarity/dissimilarity results with others' approaches, results, and sequence homology.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Protein-Protein Interactions Prediction Using a Novel Local Conjoint Triad Descriptor of Amino Acid Sequences
    Wang, Jun
    Zhang, Long
    Jia, Lianyin
    Ren, Yazhou
    Yu, Guoxian
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (11)
  • [22] MatGAT: An application that generates similarity/identity matrices using protein or DNA sequences
    James J Campanella
    Ledion Bitincka
    John Smalley
    BMC Bioinformatics, 4
  • [23] Depth Context: a new descriptor for human activity recognition by using sole depth sequences
    Liu, Mengyuan
    Liu, Hong
    NEUROCOMPUTING, 2016, 175 : 747 - 758
  • [24] Clustering Protein Sequences Using Affinity Propagation Based on an Improved Similarity Measure
    Yang, Fan
    Zhu, Qing-Xin
    Tang, Dong-Ming
    Zhao, Ming-Yuan
    EVOLUTIONARY BIOINFORMATICS, 2009, 5 : 137 - 146
  • [25] MatGAT: An application that generates similarity/identity matrices using protein or DNA sequences
    Campanella, JJ
    Bitincka, L
    Smalley, J
    BMC BIOINFORMATICS, 2003, 4 (1)
  • [26] Measuring similarity among plots including similarity among species: an extension of traditional approaches
    Ricotta, Carlo
    Pavoine, Sandrine
    JOURNAL OF VEGETATION SCIENCE, 2015, 26 (06) : 1061 - 1067
  • [27] A Novel Method for Similarity Analysis of Protein Sequences
    Liu, Longlong
    Zhao, Tingting
    Liu, Maojuan
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON ADVANCED DESIGN AND MANUFACTURING ENGINEERING, 2015, 39 : 2216 - 2220
  • [28] A new 3D graphical representation for similarity/dissimilarity studies of protein sequences
    Chen, Yan
    Li, Kang-Shun
    Chang, Shan
    Yang, Lei
    Computer Modelling and New Technologies, 2014, 18 (12): : 296 - 303
  • [29] NEW APPROACH TO MEASURING GENETIC SIMILARITY
    HEDRICK, PW
    EVOLUTION, 1971, 25 (02) : 276 - &
  • [30] On the Effectiveness of Distances Measuring Protein Structure Similarity
    Galgonek, Jakub
    Hokzsa, David
    SISAP 2009: 2009 SECOND INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2009, : 165 - 166