Measuring Similarity among Protein Sequences Using a New Descriptor

被引:7
|
作者
Abo-Elkhier, Mervat M. [1 ]
Abd Elwahaab, Marwa A. [1 ]
Abo El Maaty, Moheb I. [1 ]
机构
[1] Mansoura Univ, Dept Engn Math & Phys, Fac Engn, Mansoura 35516, Egypt
关键词
2-D GRAPHICAL REPRESENTATION; PHYSICOCHEMICAL PROPERTIES; DNA-SEQUENCES; ALIGNMENT; SEARCH; 2D;
D O I
10.1155/2019/2796971
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The comparison of protein sequences according to similarity is a fundamental aspect of today's biomedical research. With the developments of sequencing technologies, a large number of protein sequences increase exponentially in the public databases. Famous sequences' comparison methods are alignment based. They generally give excellent results when the sequences under study are closely related and they are time consuming. Herein, a new alignment-free method is introduced. Our technique depends on a new graphical representation and descriptor. The graphical representation of protein sequence is a simple way to visualize protein sequences. The descriptor compresses the primary sequence into a single vector composed of only two values. Our approach gives good results with both short and long sequences within a little computation time. It is applied on nine beta globin, nine ND5 (NADH dehydrogenase subunit 5), and 24 spike protein sequences. Correlation and significance analyses are also introduced to compare our similarity/dissimilarity results with others' approaches, results, and sequence homology.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Measuring the structural similarity among XML documents and DTDs
    Elisa Bertino
    Giovanna Guerrini
    Marco Mesiti
    Journal of Intelligent Information Systems, 2008, 30 : 55 - 92
  • [42] Measuring Similarity among Legal Court Case Documents
    Mandal, Arpan
    Chaki, Raktim
    Saha, Sarbajit
    Ghosh, Kripabandhu
    Pal, Arindam
    Ghosh, Saptarshi
    COMPUTE'17: PROCEEDINGS OF THE 10TH ANNUAL ACM INDIA COMPUTE CONFERENCE, 2017, : 1 - 9
  • [43] Measuring the structural similarity among XML documents and DTDs
    Bertino, Elisa
    Guerrini, Giovanna
    Mesiti, Marco
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2008, 30 (01) : 55 - 92
  • [44] Similarity of Football Players Using Passing Sequences
    Barbosa, Alberto
    Ribeiro, Pedro
    Dutra, Ines
    MACHINE LEARNING AND DATA MINING FOR SPORTS ANALYTICS, MLSA 2021, 2022, 1571 : 51 - 61
  • [45] New Model of Semantic Similarity Measuring in WordNet
    Zhou, Zili
    Wang, Yanna
    Gu, Junzhong
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 256 - +
  • [46] Measuring Similarity in Ontologies: A New Family of Measures
    Alsubait, Tahani
    Parsia, Bijan
    Sattler, Uli
    KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, EKAW 2014, 2014, 8876 : 13 - 25
  • [47] Clothing Similarity Estimation Using Dominant Color Descriptor and SSIM Index
    Czapiewski, Piotr
    Forczmanski, Pawel
    Okarma, Krzysztof
    Frejlichowski, Dariusz
    Hofman, Radoslaw
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 491 - 500
  • [48] Similarity analysis of protein sequences using a reduced k-mer amino acid model
    Wen, Jia
    Zhang, Yuyan
    Wang, Huanxu
    COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2020, 20 (01) : 45 - 60
  • [49] Keypoint Description Using Statistical Descriptor with Similarity-Invariant Regions
    El Rube, Ibrahim
    Alsharif, Sameer
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 42 (01): : 407 - 421
  • [50] DISASTER DAMAGE ASSESSMENT FOR BUILDINGS USING SELF-SIMILARITY DESCRIPTOR
    Kahraman, Fatih
    Imamoglu, Mumin
    Ates, Hasan F.
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 2711 - 2714