Graphical Representation and Similarity Analysis of Protein Sequences Based on Fractal Interpolation

被引:24
|
作者
Hu, Hailong [1 ,2 ]
Li, Zhong [3 ]
Dong, Hongwei [4 ]
Zhou, Tianhe [3 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 311300, Zhejiang, Peoples R China
[2] Zhejiang A&F Univ, Hangzhou 311300, Zhejiang, Peoples R China
[3] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 310018, Zhejiang, Peoples R China
[4] Jiangnan Univ, Dept Comp Sci, Wuxi 214122, Peoples R China
关键词
Protein sequence; graphic representation; fractal interpolation; principal component analysis; PHYSICOCHEMICAL PROPERTIES; DISTANCE; DIMENSION; ALIGNMENT; CURVE;
D O I
10.1109/TCBB.2015.2511731
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A new graphical representation of protein sequences is introduced in this paper. Nine main physicochemical properties of amino acids were used to obtain a 2D discrete point set for protein sequences by applying principal component analysis. The fractal method was then employed to interpolate discrete points in constructing a graphical representation of protein sequences. Fractal dimension of the protein curve was used to analyze the similarity of protein sequences by comparing the distance of vectors representing segments of protein sequences. The Jeffrey's and Matusita distance was modified in the similarity comparison of protein sequences with different lengths. Nine different species from Nicotinamide adenine dinucleotide (NADH) dehydrogenase 5 (ND5) protein sequences were tested as an example to demonstrate our method. Finally, a linear correlation and significance analysis was used to compare our results with other graphical representations referring to the ClustalW result. To confirm the validity of our method, eight species in NADH dehydrogenase 6 (ND6) protein families and twenty-seven species in beta-globin protein families were also analyzed. Experimental results show that the proposed method is effective for the similarity analysis of proteins.
引用
收藏
页码:182 / 192
页数:11
相关论文
共 50 条
  • [1] FRACTAL GRAPHICAL REPRESENTATION AND ANALYSIS OF DNA AND PROTEIN SEQUENCES
    SOLOVYEV, VV
    BIOSYSTEMS, 1993, 30 (1-3) : 137 - 160
  • [2] Similarity analysis of protein sequences based on a new graphical representation method
    Zhang, Yuyan
    Wen, Jia
    COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2018, 18 (03) : 193 - 208
  • [3] A novel graphical representation and similarity analysis of protein sequences based on physicochemical properties
    Mahmoodi-Reihani, Mehri
    Abbasitabar, Fatemeh
    Zare-Shahabadi, Vahid
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 510 : 477 - 485
  • [4] Similarity/Dissimilarity Analysis of Protein Sequences by a New Graphical Representation
    Huang, Guohua
    Hu, Jerry
    CURRENT BIOINFORMATICS, 2013, 8 (05) : 539 - 544
  • [5] Similarity/Dissimilarity Analysis of Protein Sequences Based on a New Spectrum-Like Graphical Representation
    Yao, Yuhua
    Yan, Shoujiang
    Xu, Huimin
    Han, Jianning
    Nan, Xuying
    He, Ping-an
    Dai, Qi
    EVOLUTIONARY BIOINFORMATICS, 2014, 10 : 87 - 96
  • [6] Graphical Representation and Similarity Analysis of DNA Sequences Based on Trigonometric Functions
    Guo-Sen Xie
    Xiao-Bo Jin
    Chunlei Yang
    Jiexin Pu
    Zhongxi Mo
    Acta Biotheoretica, 2018, 66 : 113 - 133
  • [7] Graphical Representation and Similarity Analysis of DNA Sequences Based on Trigonometric Functions
    Xie, Guo-Sen
    Jin, Xiao-Bo
    Yang, Chunlei
    Pu, Jiexin
    Mo, Zhongxi
    ACTA BIOTHEORETICA, 2018, 66 (02) : 113 - 133
  • [8] A new graphical representation of similarity/dissimilarity studies of protein sequences
    He, P.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2010, 21 (5-6) : 571 - 580
  • [9] Analysis of similarity/dis similarity of DNA sequences based on 3-D graphical representation
    Liao, B
    Wang, TM
    CHEMICAL PHYSICS LETTERS, 2004, 388 (1-3) : 195 - 200
  • [10] Correction to: Graphical Representation and Similarity Analysis of DNA Sequences Based on Trigonometric Functions
    Guo-Sen Xie
    Xiao-Bo Jin
    Chunlei Yang
    Jiexin Pu
    Zhongxi Mo
    Acta Biotheoretica, 2018, 66 : 251 - 253