Woven String Kernels for DNA Sequence Classification

被引:0
|
作者
McEachern, Andrew [1 ]
Ashlock, Daniel [1 ]
机构
[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Woven string kernels are a form of evolvable directed acyclic graph specialized to perform DNA classification. They are introduced in this study and tested on simple and complex synthetic data as well as biological data. The WSKs perform marginally on the simplest synthetic data - based on GC content - for which they are not entirely appropriate. They exhibit perfect classification on the more complex synthetic data and on the biological data. Woven string kernels have a number of parameters including their height, the number of initial strings from which they are built, and the amount of "weaving" used to generate the final structure. A parameter study shows that these parameters must be set based on the type of data under analysis. The paper concludes with comments on possible improvements of the woven string kernel technique.
引用
收藏
页码:1578 / 1585
页数:8
相关论文
共 50 条
  • [1] Biological Sequence Classification with Multivariate String Kernels
    Kuksa, Pavel P.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (05) : 1201 - 1210
  • [2] Accuracy of string kernels for protein sequence classification
    Spalding, JD
    Hoyle, DC
    PATTERN RECOGNITION AND DATA MINING, PT 1, PROCEEDINGS, 2005, 3686 : 454 - 460
  • [3] Length-weighted string kernels for sequence data classification
    Tian, Shengfeng
    Mu, Shaomin
    Yin, Chuanhuan
    PATTERN RECOGNITION LETTERS, 2007, 28 (13) : 1651 - 1656
  • [4] Text classification using string kernels
    Lodhi, H
    Shawe-Taylor, J
    Cristianini, N
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 563 - 569
  • [5] Text classification using string kernels
    Lodhi, H
    Saunders, C
    Shawe-Taylor, J
    Cristianini, N
    Watkins, C
    JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (03) : 419 - 444
  • [6] Video event classification using string kernels
    Lamberto Ballan
    Marco Bertini
    Alberto Del Bimbo
    Giuseppe Serra
    Multimedia Tools and Applications, 2010, 48 : 69 - 87
  • [7] Mismatch string kernels for discriminative protein classification
    Leslie, CS
    Eskin, E
    Cohen, A
    Weston, J
    Noble, WS
    BIOINFORMATICS, 2004, 20 (04) : 467 - 476
  • [8] Video event classification using string kernels
    Ballan, Lamberto
    Bertini, Marco
    Del Bimbo, Alberto
    Serra, Giuseppe
    MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 48 (01) : 69 - 87
  • [9] FastSK: fast sequence analysis with gapped string kernels
    Blakely, Derrick
    Collins, Eamon
    Singh, Ritambhara
    Norton, Andrew
    Lanchantin, Jack
    Qi, Yanjun
    BIOINFORMATICS, 2020, 36 : I857 - I865
  • [10] Using string kernels for classification of Slovenian web documents
    Fortuna, B
    Mladenic, D
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 358 - +