Use of 2D FFT and DTW in Protein Sequence Comparison

被引:1
|
作者
Pal, Jayanta [1 ,2 ]
Ghosh, Soumen [1 ]
Maji, Bansibadan [1 ]
Bhattacharya, Dilip Kumar [3 ]
机构
[1] Natl Inst Technol, Dept ECE, Durgapur, India
[2] Narula Inst Technol, Dept CSE, Kolkata, India
[3] Calcutta Univ, Dept Pure Math, Kolkata, India
来源
PROTEIN JOURNAL | 2024年 / 43卷 / 01期
关键词
Graphical representation; Two-quadrant Cartesian system; 2 dimensional fast Fourier transform (2D FFT); Dynamic time warping (DTW); Symmetric distance (SD); Phylogenetic tree; CHAOS GAME REPRESENTATION; GRAPHICAL REPRESENTATION; ANTICANCER PEPTIDES; SIMILARITY ANALYSIS; DNA-SEQUENCES; SIMILARITY/DISSIMILARITY; ALGORITHM; MODEL; CLASSIFICATION; ALIGNMENT;
D O I
10.1007/s10930-023-10160-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein sequence comparison remains a challenging work for the researchers owing to the computational complexity due to the presence of 20 amino acids compared with only four nucleotides in Genome sequences. Further, protein sequences of different species are of different lengths; it throws additional changes to the researchers to develop methods, specially alignment-free methods, to compare protein sequences. In this work, an efficient technique to compare protein sequences is developed by a graphical representation. First, the classified grouping of 20 amino acids with a cardinality of 4 based on polar class is considered to narrow down the representational range from 20 to 4. Then a unit vector technique based on a two-quadrant Cartesian system is proposed to provide a new two-dimensional graphical representation of the protein sequence. Now, two approaches are proposed to cope with the varying lengths of protein sequences from various species: one uses Dynamic Time Warping (DTW), while the other one uses a two-dimensional Fast Fourier Transform (2D FFT). Next, the effectiveness of these two techniques is analyzed using two evaluation criteria-quantitative measures based on symmetric distance (SD) and computational speed. An analysis is performed on five data sets of 9 ND4, 9 ND5, 9 ND6, 12 Baculovirus, and 24 TF proteins under the two methods. It is found that the FFT-based method produces the same results as DTW but in less computational time. It is found that the result of the proposed method agrees with the known biological reference. Further, the present method produces better clustering than the existing ones.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] Use of 2D FFT and DTW in Protein Sequence Comparison
    Jayanta Pal
    Soumen Ghosh
    Bansibadan Maji
    Dilip Kumar Bhattacharya
    The Protein Journal, 2024, 43 : 1 - 11
  • [2] Efficient 2D FFT implementation on mediaprocessors
    Mermer, C
    Kim, D
    Kim, Y
    PARALLEL COMPUTING, 2003, 29 (06) : 691 - 709
  • [3] Methods for Evaluating Pictures and Extracting Music by 2D DFA and 2D FFT
    Kawakatsu, Hidefumi
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015, 2015, 60 : 834 - 840
  • [4] A 2D Graphical Representation of Protein Sequence and Its Application
    Zou, Sai
    Peng, Hui
    Wang, Lei
    Wang, Junfeng
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (09) : 2369 - 2374
  • [5] A 2D graphical representation of protein sequence and its numerical characterization
    Wen, Jia
    Zhang, Yuyan
    CHEMICAL PHYSICS LETTERS, 2009, 476 (4-6) : 281 - 286
  • [6] A new 2D graphical representation of protein sequence and its application
    Wang, Lei
    Peng, Hui
    Zheng, Jinhua
    Qiu, Yanzi
    INTERNATIONAL JOURNAL OF BIOMATHEMATICS, 2015, 8 (05)
  • [7] Novel 2D graphic representation of protein sequence and its application
    College of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang 050043, China
    J. Fiber Bioeng. Informatics, 1 (23-33):
  • [8] 2D FFT OF ACOUSTIC SPECTRUM OF A FLUID IN A SPHERICAL CAVITY
    ANTHONY, LM
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1994, 207 : 274 - PHYS
  • [9] 分离矢量基2D FFT新算法
    茅一民
    通信学报, 1991, (05) : 70 - 75
  • [10] 2D test sequence generators
    Mrugalski, G
    Tyszer, J
    Rajski, J
    IEEE DESIGN & TEST OF COMPUTERS, 2003, 20 (01): : 51 - 59