Protein Sequence Classification Using Natural Vector and Convex Hull Method

被引:5
|
作者
Wang, Yi [1 ]
Tian, Kun [1 ]
Yau, Stephen S. -T. [1 ]
机构
[1] Tsinghua Univ, Dept Math Sci, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
phylogenetic analysis; protein classification; protein kinase C; sequence comparison; BRYOSTATIN-1;
D O I
10.1089/cmb.2018.0216
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein kinase C (PKC) is a superfamily of enzymes, which regulate numerous cellular responses. The specific function of PKC protein family is mainly governed by its individual protein domains. However, existing protein sequence classification methods based on sequence alignment and sequence analysis models focused little on the domain analysis. In this study, we introduce a novel protein kinase classification method that considers both domain sequence similarity and whole sequence similarity to quantify the evolutionary distance from a specific protein to a protein family. Using the natural vector method, we establish a 60-dimensional space, where each protein is uniquely represented by a vector. We also define a convex hull, consisting of the natural vectors corresponding to all members of a protein family. The sequence similarity between a protein and a protein family, therefore, can be quantified as the distance between the protein vector and the protein family convex hull. We have applied this method in a PKC sample library and the results showed a higher accuracy of classification compared with other alignment-free methods.
引用
收藏
页码:315 / 321
页数:7
相关论文
共 50 条
  • [31] Fast convex-hull vector machine for training on large-scale ncRNA data classification tasks
    Gu, Xiaoqing
    Chung, Fu-lai
    Wang, Shitong
    KNOWLEDGE-BASED SYSTEMS, 2018, 151 : 149 - 164
  • [32] A Convex Hull Query Processing Method in MANETs
    Komai, Yuka
    Hara, Takahiro
    Nishio, Shojiro
    2014 IEEE 33RD INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2014, : 331 - 332
  • [33] Convex Hull Aided Registration Method (CHARM)
    Fan, Jingfan
    Yang, Jian
    Zhao, Yitian
    Ai, Danni
    Liu, Yonghuai
    Wang, Ge
    Wang, Yongtian
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (09) : 2042 - 2055
  • [34] Kernelized Convex Hull based Collaborative Representation for Tumor Classification
    Chen, Xia
    Chen, Haowen
    Cao, Dan
    Li, Bo
    CURRENT PROTEOMICS, 2018, 15 (05) : 384 - 393
  • [35] A Fast Algorithm of Convex Hull Vertices Selection for Online Classification
    Ding, Shuguang
    Nie, Xiangli
    Qiao, Hong
    Zhang, Bo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (04) : 792 - 806
  • [36] Protein sequence classification using feature hashing
    Caragea, Cornelia
    Silvescu, Adrian
    Mitra, Prasenjit
    PROTEOME SCIENCE, 2012, 10
  • [37] Protein Sequence Classification Using Feature Hashing
    Caragea, Cornelia
    Silvescu, Adrian
    Mitra, Prasenjit
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 538 - 543
  • [38] Protein sequence classification using feature hashing
    Cornelia Caragea
    Adrian Silvescu
    Prasenjit Mitra
    Proteome Science, 10
  • [39] Online Support Vector Machine Based on Convex Hull Vertices Selection
    Wang, Di
    Qiao, Hong
    Zhang, Bo
    Wang, Min
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (04) : 593 - 609
  • [40] CONVEX HULL OF RANGE OF VECTOR VALUED HOLOMORPHIC MAPPINGS - PRELIMINARY REPORT
    HALL, RL
    PATIL, DJ
    NOTICES OF THE AMERICAN MATHEMATICAL SOCIETY, 1971, 18 (01): : 182 - &