On using physico-chemical properties of amino acids in string kernels for protein classification via support vector machines

被引:0
|
作者
Li Limin [1 ]
Aoki-Kinoshita, Kiyoko F. [2 ]
Ching Wai-Ki [3 ]
Jiang Hao [4 ]
机构
[1] Xi An Jiao Tong Univ, Inst Informat & Syst Sci, Xian 710049, Peoples R China
[2] Soka Univ, Dept Bioinformat, Fac Engn, Tokyo, Japan
[3] Univ Hong Kong, Dept Math, Adv Modeling & Appl Comp Lab, Hong Kong, Hong Kong, Peoples R China
[4] Renmin Univ China, Sch Informat, Dept Math, Beijing 100872, Peoples R China
基金
中国国家自然科学基金;
关键词
AAindex; AA spectrum kernel; correlation spectrum kernel; physico-chemical properties; string kernel; weighted spectrum kernel; LECTIN; PREDICTION; GLYCOMICS;
D O I
10.1007/s11424-015-2156-y
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
String kernels are popular tools for analyzing protein sequence data and they have been successfully applied to many computational biology problems. The traditional string kernels assume that different substrings are independent. However, substrings can be highly correlated due to their substructure relationship or common physico-chemical properties. This paper proposes two kinds of weighted spectrum kernels: The correlation spectrum kernel and the AA spectrum kernel. We evaluate their performances by predicting glycan-binding proteins of 12 glycans. The results show that the correlation spectrum kernel and the AA spectrum kernel perform significantly better than the spectrum kernel for nearly all the 12 glycans. By comparing the predictive power of AA spectrum kernels constructed by different physico-chemical properties, the authors can also identify the physicochemical properties which contributes the most to the glycan-protein binding. The results indicate that physico-chemical properties of amino acids in proteins play an important role in the mechanism of glycan-protein binding.
引用
收藏
页码:504 / 516
页数:13
相关论文
共 50 条
  • [21] Conotoxin protein classification using free scores of words and support vector machines
    Zaki, Nazar
    Wolfsheimer, Stefan
    Nuel, Gregory
    Khuri, Sawsan
    BMC BIOINFORMATICS, 2011, 12
  • [22] ISOLATION, AMINO ACID COMPOSITION AND SOME PHYSICO-CHEMICAL PROPERTIES OF PROTEIN DEUTERIO-PHYCOCYANIN
    BERNS, DS
    CRESPI, HL
    KATZ, JJ
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1963, 85 (01) : 8 - &
  • [23] Effects of the physico-chemical properties of amino acids and chemically functionalized surfaces on DIOS-MS analysis
    Lavigne, Antonin
    Gehin, Thomas
    Gilquin, Benoit
    Xerri, Laetitia-Eiko
    Veillerot, Marc
    Jousseaume, Vincent
    Chevolot, Yann
    Phaner-Goutorbe, Magali
    Yeromonahos, Christelle
    ANALYTICAL BIOCHEMISTRY, 2025, 700
  • [24] An Overview on Physico-Chemical Properties of Amino Acids upon Interactions with Solution Components: A Volumetric and Viscometric Approach
    Malabika Rupesh Kumar Pradhan
    Sulochana Talukdar
    Russian Journal of Physical Chemistry A, 2023, 97 : 2631 - 2649
  • [25] An Overview on Physico-Chemical Properties of Amino Acids upon Interactions with Solution Components: A Volumetric and Viscometric Approach
    Pradhana, Rupesh Kumar
    Talukdar, Malabika
    Singh, Sulochana
    RUSSIAN JOURNAL OF PHYSICAL CHEMISTRY A, 2023, 97 (12) : 2631 - 2649
  • [26] Identification of Relevant Physico Chemical Properties of Amino Acids with Respect to Protein Glycosylation Prediction
    Banerjee, Sagnik
    Mitra, Basudeb
    Chatterjee, Avimita
    Santra, Arnab
    Chatterjee, Baisakhi
    2015 INTERNATIONAL CONFERENCE AND WORKSHOP ON COMPUTING AND COMMUNICATION (IEMCON), 2015,
  • [27] Soil type classification and estimation of soil properties using support vector machines
    Kovacevic, Milos
    Bajat, Branislav
    Gajic, Bosko
    GEODERMA, 2010, 154 (3-4) : 340 - 347
  • [28] Comparison of Amino Acids Physico-Chemical Properties and Usage of Late Embryogenesis Abundant Proteins, Hydrophilins and WHy Domain
    Jaspard, Emmanuel
    Hunault, Gilles
    PLOS ONE, 2014, 9 (10):
  • [29] Comparative analysis of physico-chemical properties and amino acids profile of three tropical maize hybrid cultivars in Nigeria
    Oladeji, Babatunde Stephen
    Irinkoyenikan, Oluwatoyin Ajoke
    Gbadamosi, Olasunkanmi Saka
    Ibironke, Samson Ishola
    Akanbi, Charles Taiwo
    Taiwo, Kehinde Adekunbi
    NUTRITION & FOOD SCIENCE, 2016, 46 (05): : 695 - 705
  • [30] Using recurrence quantification analysis Descriptors for protein sequence classification with support vector machines
    Mitra, Joydeep
    Mundra, Piyushkumar
    Kulkarni, B. D.
    Jayaraman, Valadi K.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2007, 25 (03): : 289 - 297