Classification of Nucleotide Sequences Using Support Vector Machines

被引:19
|
作者
Seo, Tae-Kun [1 ]
机构
[1] Univ Tokyo, Grad Sch Agr & Life Sci, Agr Bioinformat Res Unit, Bunkyo Ku, Tokyo 1138657, Japan
关键词
DNA barcoding; Support vector machine; Pattern recognition; Bootstrap; Permutation test; EVOLUTIONARY TREE TOPOLOGIES; DNA BARCODES; SPECIES TREES; MITOCHONDRIAL-DNA; IDENTIFICATION; TAXONOMY; STEP;
D O I
10.1007/s00239-010-9380-9
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Species identification is one of the most important issues in biological studies. Due to recent increases in the amount of genomic information available and the development of DNA sequencing technologies, the applicability of using DNA sequences to identify species (commonly referred to as "DNA barcoding") is being tested in many areas. Several methods have been suggested to identify species using DNA sequences, including similarity scores, analysis of phylogenetic and population genetic information, and detection of species-specific sequence patterns. Although these methods have demonstrated good performance under a range of circumstances, they also have limitations, as they are subject to loss of information, require intensive computation and are sensitive to model mis-specification, and can be difficult to evaluate in terms of the significance of identification. Here, we suggest a new DNA barcoding method in which support vector machine (SVM) procedures are adopted. Our new method is nonparametric and thus is expected to be robust for a wide range of evolutionary scenarios as well as multi locus analyses. Furthermore, we describe bootstrap procedures that can be used to test the significances of species identifications. We implemented a novel conversion technique for transforming sequence data to real-valued vectors, and therefore, bootstrap procedures can be easily combined with our SVM approach. In this study, we present the results of simulation studies and empirical data analyses to demonstrate the performance of our method and discuss its properties.
引用
收藏
页码:250 / 267
页数:18
相关论文
共 50 条
  • [1] Classification of Nucleotide Sequences Using Support Vector Machines
    Tae-Kun Seo
    [J]. Journal of Molecular Evolution, 2010, 71 : 250 - 267
  • [2] Wafer Classification Using Support Vector Machines
    Baly, Ramy
    Hajj, Hazem
    [J]. IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2012, 25 (03) : 373 - 383
  • [3] Bag classification using support vector machines
    Kartoun, Uri
    Stern, Helman
    Edan, Yael
    [J]. APPLIED SOFT COMPUTING TECHNOLOGIES: THE CHALLENGE OF COMPLEXITY, 2006, 34 : 665 - 674
  • [4] A CBIR CLASSIFICATION USING SUPPORT VECTOR MACHINES
    Sugamya, Katta
    Pabboju, Suresh
    Babu, A. Vinaya
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN HUMAN MACHINE INTERACTION (HMI), 2016, : 135 - +
  • [5] Classification of Torreya Using Support Vector Machines
    Wang, Xiaodong
    Chang, Jianli
    [J]. 2012 THIRD INTERNATIONAL CONFERENCE ON TELECOMMUNICATION AND INFORMATION (TEIN 2012), 2012, : 212 - 216
  • [6] Gender classification using support vector machines
    Yang, MH
    Moghaddam, B
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, : 471 - 474
  • [7] Cloud classification using support vector machines
    Azimi-Sadjadi, MR
    Zekavat, SA
    [J]. IGARSS 2000: IEEE 2000 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOL I - VI, PROCEEDINGS, 2000, : 669 - 671
  • [8] Scene Classification Using Support Vector Machines
    Mandhala, Venkata Naresh
    Sujatha, V.
    Devi, B. Renuka
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 1807 - 1810
  • [9] Classification of Performers using Support Vector Machines
    Reljin, Natasa
    Pokrajac, Dragoljub
    [J]. NEUREL 2008: NINTH SYMPOSIUM ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2008, : 156 - +
  • [10] Pose classification using support vector machines
    Ardizzone, E
    Chella, A
    Pirrone, R
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL VI, 2000, : 317 - 322