Woven String Kernels for DNA Sequence Classification

被引:0
|
作者
McEachern, Andrew [1 ]
Ashlock, Daniel [1 ]
机构
[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Woven string kernels are a form of evolvable directed acyclic graph specialized to perform DNA classification. They are introduced in this study and tested on simple and complex synthetic data as well as biological data. The WSKs perform marginally on the simplest synthetic data - based on GC content - for which they are not entirely appropriate. They exhibit perfect classification on the more complex synthetic data and on the biological data. Woven string kernels have a number of parameters including their height, the number of initial strings from which they are built, and the amount of "weaving" used to generate the final structure. A parameter study shows that these parameters must be set based on the type of data under analysis. The paper concludes with comments on possible improvements of the woven string kernel technique.
引用
收藏
页码:1578 / 1585
页数:8
相关论文
共 50 条
  • [41] Languages as hyperplanes: grammatical inference with string kernels
    Alexander Clark
    Christophe Costa Florêncio
    Chris Watkins
    Machine Learning, 2011, 82 : 351 - 373
  • [42] A Framework for Space-Efficient String Kernels
    Djamal Belazzougui
    Fabio Cunial
    Algorithmica, 2017, 79 : 857 - 883
  • [43] A Framework for Space-Efficient String Kernels
    Belazzougui, Djamal
    Cunial, Fabio
    ALGORITHMICA, 2017, 79 (03) : 857 - 883
  • [44] Learning actions using robust string kernels
    Yang, Changjiang
    Guo, Yanlin
    Sawhney, Harpreet
    Kumar, Rakesh
    HUMAN MOTION - UNDERSTANDING, MODELING, CAPTURE AND ANIMATION, PROCEEDINGS, 2007, 4814 : 313 - +
  • [45] Languages as hyperplanes:: Grammatical inference with string kernels
    Clark, Alexander
    Florencio, Christophe Costa
    Watkins, Chris
    MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 90 - 101
  • [46] Languages as hyperplanes: grammatical inference with string kernels
    Clark, Alexander
    Florencio, Christophe Costa
    Watkins, Chris
    MACHINE LEARNING, 2011, 82 (03) : 351 - 373
  • [47] Evolved Features for DNA Sequence Classification and Their Fitness Landscapes
    Ashlock, Wendy
    Datta, Suprakash
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2013, 17 (02) : 185 - 197
  • [48] DNA sequence classification based on MLP with PILAE algorithm
    Mahmoud, Mohammed A. B.
    Guo, Ping
    SOFT COMPUTING, 2021, 25 (05) : 4003 - 4014
  • [49] DNA sequence classification based on MLP with PILAE algorithm
    Mohammed A. B. Mahmoud
    Ping Guo
    Soft Computing, 2021, 25 : 4003 - 4014
  • [50] Word-sequence kernels
    Cancedda, N
    Gaussier, E
    Goutte, C
    Renders, JM
    JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) : 1059 - 1082