Woven String Kernels for DNA Sequence Classification

被引:0
|
作者
McEachern, Andrew [1 ]
Ashlock, Daniel [1 ]
机构
[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Woven string kernels are a form of evolvable directed acyclic graph specialized to perform DNA classification. They are introduced in this study and tested on simple and complex synthetic data as well as biological data. The WSKs perform marginally on the simplest synthetic data - based on GC content - for which they are not entirely appropriate. They exhibit perfect classification on the more complex synthetic data and on the biological data. Woven string kernels have a number of parameters including their height, the number of initial strings from which they are built, and the amount of "weaving" used to generate the final structure. A parameter study shows that these parameters must be set based on the type of data under analysis. The paper concludes with comments on possible improvements of the woven string kernel technique.
引用
收藏
页码:1578 / 1585
页数:8
相关论文
共 50 条
  • [21] HASKER: An efficient algorithm for string kernels. Application to polarity classification in various languages
    Popescu, Marius
    Grozea, Cristian
    Ionescu, Radu Tudor
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 1755 - 1763
  • [22] Infinite String Block Matching Features for DNA Classification
    Ashlock, Daniel
    Gillis, Sierra
    Ashlock, Wendy
    2017 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2017, : 67 - 74
  • [23] New techniques for DNA sequence classification
    Wang, JTL
    Rozen, S
    Shapiro, BA
    Shasha, D
    Wang, ZY
    Yin, MS
    JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (02) : 209 - 218
  • [24] DNA sequence classification using DAWGs
    Levy, S
    Stormo, GD
    STRUCTURES IN LOGIC AND COMPUTER SCIENCE, 1997, 1261 : 339 - 352
  • [25] Tsetlin Machine in DNA sequence classification
    Liland, Kristian Hovde
    Tomic, Oliver
    Indahl, Ulf Geir
    Futsaether, Cecilia Marie
    Jiao, Lei
    Granmo, Ole-Christoffer
    Snipen, Lars Gustav
    2023 INTERNATIONAL SYMPOSIUM ON THE TSETLIN MACHINE, ISTM, 2023,
  • [26] Text clustering with string kernels in R
    Karatzoglou, Alexandros
    Feinerer, Ingo
    ADVANCES IN DATA ANALYSIS, 2007, : 91 - +
  • [27] Masquerade Detection Using String Kernels
    Yang, Min
    Zhang, Huanguo
    Cai, H. J.
    2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 3681 - +
  • [28] Fast kernels for inexact string matching
    Leslie, C
    Kuang, R
    LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 114 - 128
  • [29] Shape categorization using string kernels
    Daliri, Mohammad Reza
    Delponte, Elisabetta
    Verri, Alessandro
    Torre, Vincent
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 297 - 305
  • [30] Language identification based on string kernels
    Kruengkrai, C
    Snichaivattana, P
    Sornlertlamvanich, V
    Isahara, H
    International Symposium on Communications and Information Technologies 2005, Vols 1 and 2, Proceedings, 2005, : 896 - 899