Classifying DNA repair genes by kernel-based support vector machines

被引:0
|
作者
Jiang, Hao [1 ]
Ching, Wai-Ki [1 ]
机构
[1] Univ Hong Kong, Dept Math, Adv Modeling & Appl Comp Lab, Pokfulam Rd, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human longevity is a complex phenotype that has a significant genetic predisposition. Like other biological processes, ageing process is governed through the regulation of signaling pathways and transcription factors. The DNA damage theory of ageing suggests that ageing is a consequence of un-repaired DNA damage accumulation. Intensive research has been carried out to elucidate the role of DNA repair systems in the ageing process. Decision Trees and Naive Bayesian Algorithm are two data-mining based classification methods for systematically analyzing data about human DNA repair genes. In this paper we develop a linearly combined kernel with Support Vector Machine (SVM) to analyze the ageing related data. The popular supervised learning algorithm enables better discrimination between ageing-related and non-ageing-related DNA repair genes. The linear combination of linear kernel and polynomial kernel of degree 3 in conjunction with SVM allows better classification accuracy in DNA repair gene data set. Compared to Decision Trees and Naive Bayesian Algorithm, SVM with the proposed kernel can achieve 65% AUC (Area Under ROC Curve) values, in contrast to 51.1% and 52.1% respectively. More importantly, we obtain 5 significant ageing-related genes selected through the training on the whole data set and they are PCNA, PARP, APEX1, MLH1 and XRCC6. Different from the two methods, we can identify another important gene PCNA in the pathways the two methods targeted, while they failed to. And two novel genes PARP, MLH1 are selected as well. The two genes might provide potential insights for biologists in ageing research. SVM is a powerful and robust classification algorithm that can yield higher predictive accuracies. The selection of proper kernel plays a more important role in fulfilling the classification task. The important genes identified not only can target critical pathways related to ageing but also detected genes that may reveal possible related ageing biomarkers.
引用
收藏
页码:257 / 263
页数:7
相关论文
共 50 条
  • [1] Simplified support vector machines via kernel-based clustering
    Zeng, Zhi-Qiang
    Gao, Ji
    Guo, Hang
    [J]. AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 1189 - +
  • [2] On the algorithmic implementation of multiclass kernel-based vector machines
    Crammer, K
    Singer, Y
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) : 265 - 292
  • [3] Question classification via multiclass kernel-based vector machines
    Huang, Peng
    Bu, Jiajun
    Chen, Chun
    Kang, Zhiming
    [J]. PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 336 - +
  • [4] Kernel-based support vector machines for automated health status assessment in monitoring sensor data
    Diez-Olivan, Alberto
    Pagan, Jose A.
    Nguyen Lu Dang Khoa
    Sanz, Ricardo
    Sierra, Basilio
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2018, 95 (1-4): : 327 - 340
  • [5] A kernel-based two-stage one-class support vector machines algorithm
    Yeh, Chi-Yuan
    Lee, Shie-Jue
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 3, PROCEEDINGS, 2007, 4493 : 515 - +
  • [6] Kernel-based support vector machines for automated health status assessment in monitoring sensor data
    Alberto Diez-Olivan
    Jose A. Pagan
    Nguyen Lu Dang Khoa
    Ricardo Sanz
    Basilio Sierra
    [J]. The International Journal of Advanced Manufacturing Technology, 2018, 95 : 327 - 340
  • [7] Chaotic time series prediction using fuzzy sigmoid kernel-based support vector machines
    Liu Han
    Liu Ding
    Deng Ling-Feng
    [J]. CHINESE PHYSICS, 2006, 15 (06): : 1196 - 1200
  • [8] Support Vector Machines Based Composite Kernel
    Ma, Dingkun
    Yang, Xinquan
    Kuang, Yin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION PROBLEM-SOLVING (ICCP), 2015, : 432 - 435
  • [9] Robust kernel-based multiclass support vector machines via second-order cone programming
    Sebastián Maldonado
    Julio López
    [J]. Applied Intelligence, 2017, 46 : 983 - 992
  • [10] A novel algorithm for classification using a low rank approximation of kernel-based support vector machines with applications
    Chatrabgoun, O.
    Esmaeilbeigi, M.
    Daneshkhah, A.
    Kamandi, A.
    Salimi, N.
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023,