Splice site prediction using support vector machines with a Bayes kernel

被引:39
|
作者
Zhang, Y
Chu, CH
Chen, YX
Zha, HY
Ji, X
机构
[1] Penn State Univ, Sch Informat Sci & Technol, University Pk, PA 16802 USA
[2] Univ New Orleans, Dept Comp Sci, New Orleans, LA 70148 USA
[3] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[4] NEC Labs Amer, Cupertino, CA 95014 USA
关键词
splice site prediction; SVM; Support Vector Machines; Bayes classifier; machine learning; splice site prediction using support vector machines with Bayes kernel;
D O I
10.1016/j.eswa.2005.09.052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most important tasks in correctly annotating genes in higher organisms is to accurately locate the DNA splice sites. Although relatively high accuracy has been achieved by existing methods, most of these prediction methods are computationally extensive. Due to the enormous amount of DNA sequences to be processed, the computational speed is an important issue to consider. In this paper, we present a new machine learning method for predicting DNA splice sites, which first applies a Bayes feature mapping (kernel) to project the data into a new feature space and then uses a linear Support Vector Machine (SVM) as a classifier to recognize the true splice sites. The computation time is linear to the number of sequences tested, while the performance is notably improved compared with the Naive Bayes classifier in terms of classification accuracy, precision, and recall. Our classification results are also comparable to the solution quality obtained by the SVMs with polynomial kernels, while the speed of our proposed method is significantly faster. This is a notable improvement in computational modeling considering the huge amount of DNA sequences to be processed. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:73 / 81
页数:9
相关论文
共 50 条
  • [1] Splice Site Prediction using Support Vector Machines with Context-Sensitive Kernel Functions
    Chen, Yifei
    Liu, Feng
    Vanschoenwinkel, Bram
    Manderick, Bernard
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2009, 15 (13) : 2528 - 2546
  • [2] Accurate splice site prediction using support vector machines
    Sören Sonnenburg
    Gabriele Schweikert
    Petra Philips
    Jonas Behr
    Gunnar Rätsch
    [J]. BMC Bioinformatics, 8
  • [3] Accurate splice site prediction using support vector machines
    Sonnenburg, Soeren
    Schweikert, Gabriele
    Philips, Petra
    Behr, Jonas
    Raetsch, Gunnar
    [J]. BMC BIOINFORMATICS, 2007, 8 (Suppl 10)
  • [4] An improved method for splice site prediction in DNA sequences using support vector machines
    Goel, Neelam
    Singh, Shailendra
    Aseri, Trilok Chand
    [J]. 3RD INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTING 2015 (ICRTC-2015), 2015, 57 : 358 - 367
  • [5] The Effect of Kernel Functions on Cryptocurrency Prediction Using Support Vector Machines
    Hitam, Nor Azizah
    Ismail, Amelia Ritahani
    Samsudin, Ruhaidah
    Alkhammash, Eman H.
    [J]. ADVANCES ON INTELLIGENT INFORMATICS AND COMPUTING: HEALTH INFORMATICS, INTELLIGENT SYSTEMS, DATA SCIENCE AND SMART COMPUTING, 2022, 127 : 319 - 332
  • [6] Churn prediction in telecommunication industry using kernel Support Vector Machines
    Nhu, Nguyen Y.
    Tran Van Lyid
    Dao Vu Truong Son
    [J]. PLOS ONE, 2022, 17 (05):
  • [7] A novel splice site prediction method using support vector machine
    [J]. Wei, Y. (yj.wei@siat.ac.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [8] Prediction of Active Site Cleft Using Support Vector Machines
    Sonavane, Shrihari
    Chakrabarti, Pinak
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (12) : 2266 - 2273
  • [9] Traffic Accident Prediction Model Using Support Vector Machines with Gaussian Kernel
    Sharma, Bharti
    Katiyar, Vinod Kumar
    Kumar, Kranti
    [J]. PROCEEDINGS OF FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2015), VOL 2, 2016, 437 : 1 - 10
  • [10] Human splice site identification with multiclass support vector machines and bagging
    Lorena, AC
    de Carvalho, ACPLF
    [J]. ARTIFICIAL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 234 - 241