Use of 2D FFT and DTW in Protein Sequence Comparison

被引:1
|
作者
Pal, Jayanta [1 ,2 ]
Ghosh, Soumen [1 ]
Maji, Bansibadan [1 ]
Bhattacharya, Dilip Kumar [3 ]
机构
[1] Natl Inst Technol, Dept ECE, Durgapur, India
[2] Narula Inst Technol, Dept CSE, Kolkata, India
[3] Calcutta Univ, Dept Pure Math, Kolkata, India
来源
PROTEIN JOURNAL | 2024年 / 43卷 / 01期
关键词
Graphical representation; Two-quadrant Cartesian system; 2 dimensional fast Fourier transform (2D FFT); Dynamic time warping (DTW); Symmetric distance (SD); Phylogenetic tree; CHAOS GAME REPRESENTATION; GRAPHICAL REPRESENTATION; ANTICANCER PEPTIDES; SIMILARITY ANALYSIS; DNA-SEQUENCES; SIMILARITY/DISSIMILARITY; ALGORITHM; MODEL; CLASSIFICATION; ALIGNMENT;
D O I
10.1007/s10930-023-10160-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein sequence comparison remains a challenging work for the researchers owing to the computational complexity due to the presence of 20 amino acids compared with only four nucleotides in Genome sequences. Further, protein sequences of different species are of different lengths; it throws additional changes to the researchers to develop methods, specially alignment-free methods, to compare protein sequences. In this work, an efficient technique to compare protein sequences is developed by a graphical representation. First, the classified grouping of 20 amino acids with a cardinality of 4 based on polar class is considered to narrow down the representational range from 20 to 4. Then a unit vector technique based on a two-quadrant Cartesian system is proposed to provide a new two-dimensional graphical representation of the protein sequence. Now, two approaches are proposed to cope with the varying lengths of protein sequences from various species: one uses Dynamic Time Warping (DTW), while the other one uses a two-dimensional Fast Fourier Transform (2D FFT). Next, the effectiveness of these two techniques is analyzed using two evaluation criteria-quantitative measures based on symmetric distance (SD) and computational speed. An analysis is performed on five data sets of 9 ND4, 9 ND5, 9 ND6, 12 Baculovirus, and 24 TF proteins under the two methods. It is found that the FFT-based method produces the same results as DTW but in less computational time. It is found that the result of the proposed method agrees with the known biological reference. Further, the present method produces better clustering than the existing ones.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [11] A 2D Graphical Representation of Protein Sequence and Their Similarity Analysis with Probabilistic Method
    Gupta, Manoj Kumar
    Niyogi, Rajdeep
    Misra, Manoj
    MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2014, 72 (02) : 519 - 532
  • [12] The protein sequence design problem in canonical model on 2D and 3D lattices
    Berman, P
    DasGupta, B
    Mubayi, D
    Sloan, R
    Turán, G
    Zhang, Y
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2004, 3109 : 244 - 253
  • [13] PROTMAP2D: visualization, comparison and analysis of 2D maps of protein structure
    Pietal, Michal J.
    Tuszynska, Irina
    Bujnicki, Janusz M.
    BIOINFORMATICS, 2007, 23 (11) : 1429 - 1430
  • [14] Objective wrinkle evaluation system of fabrics based on 2D FFT
    Choi, Chul Jin
    Kim, Heung Jae
    Jin, Yong Cheol
    Kim, Han Seong
    FIBERS AND POLYMERS, 2009, 10 (02) : 260 - 265
  • [15] Comparison of 2D and 3D compressible convection in a pre-main sequence star
    Pratt, J.
    Baraffe, I.
    Goffrey, T.
    Geroux, C.
    Constantino, T.
    Folini, D.
    Walder, R.
    ASTRONOMY & ASTROPHYSICS, 2020, 638
  • [16] Detection of pulmonary nodules using a 2D HASTE MR sequence: Comparison with MDCT
    Schroeder, T
    Ruehm, SG
    Debatin, JF
    Ladd, ME
    Barkhausen, J
    Goehde, SC
    AMERICAN JOURNAL OF ROENTGENOLOGY, 2005, 185 (04) : 979 - 984
  • [17] An FFT-based algorithm for 2D power series expansions
    Hwang, CY
    Guo, JC
    Guo, TY
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1999, 37 (10) : 19 - 27
  • [18] Objective wrinkle evaluation system of fabrics based on 2D FFT
    Chul Jin Choi
    Heung Jae Kim
    Yong Cheol Jin
    Han Seong Kim
    Fibers and Polymers, 2009, 10 : 260 - 265
  • [19] Multigrid with FFT smoother for a simplified 2D frictional contact problem
    Zhao, Jing
    Vollebregt, Edwin A. H.
    Oosterlee, Cornelis W.
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2014, 21 (02) : 256 - 274
  • [20] A new proposed algorithm of arbitrary radix for the computation of the 2D FFT
    Chikouche, D
    Khellaf, A
    Bouguezel, S
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 1999, 46 (01) : 103 - 115