Efficient Decision Tree Based Data Selection and Support Vector Machine Classification

被引:13
|
作者
Arumugam, P. [1 ]
Jose, P. [1 ]
机构
[1] Manonmaniam Sundaranar Univ, Dept Stat, Tirunelveli 628012, Tamil Nadu, India
关键词
Microarray; large datasets; Classification; Decision Tree; SVM;
D O I
10.1016/j.matpr.2017.11.263
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Today' real world data bases witnessed significant increase in the amount of data in digital format, due to the widespread use of datasets and storage system. There is a need to developing fast and highly accurate algorithms to automatically classify large data. It becomes a vital part of the machine learning and knowledge discovery. The main intention of this paper is however data sizes increases, our proposed method make faster computation and scalable machine learning algorithm is used to learn faster from the labelled training data. Due to its strong mathematical background and theoretical foundation and good generalization performance, Support Vector Machine (SVM) Classification becomes more feasible options for large datasets. A major research goal of SVM is to improve the speed in training and testing phase. In this paper We introduce a proposed algorithm to speed up the training time of SVM is presented. It is highly accurate classification method. However SVM classifiers suffer from slow processing, when training with a large set of data tuples. Our novel approach selects a small representative amount of data from large datasets to enhance training time of SVM. This method uses an induction tree to reduce the training dataset for SVM classification, it generate faster results with improving accuracy rates than the current SVM implementations. (c) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1679 / 1685
页数:7
相关论文
共 50 条
  • [1] Feature selection and classification using support vector machine and decision tree
    Durgalakshmi, B.
    Vijayakumar, V.
    [J]. COMPUTATIONAL INTELLIGENCE, 2020, 36 (04) : 1480 - 1492
  • [2] Sequence Mining-Based Support Vector Machine with Decision Tree Approach for Efficient Time Series Data Classification
    Senthil, D.
    Suseendran, G.
    [J]. DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2019, VOL 2, 2020, 1016 : 3 - 17
  • [3] Robust variable selection based on bagging classification tree for support vector machine in metabonomic data analysis
    Chen, Shu-Fang
    Gu, Hui
    Tu, Meng-Ying
    Zhou, Yan-Ping
    Cui, Yan-Fang
    [J]. JOURNAL OF CHEMOMETRICS, 2018, 32 (11)
  • [4] Automated parameter selection for support vector machine decision tree
    Choi, Gyunghyun
    Bae, Suk Joo
    [J]. NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 746 - 753
  • [5] Automatic classification using decision tree and support vector machine
    Han, Y
    Lee, C
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2005, 3682 : 1325 - 1330
  • [6] Support vector machine tree based on feature selection
    Xu, Qinzhen
    Pei, Wenjiang
    Yang, Luxi
    He, Zhenya
    [J]. NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2006, 4232 : 856 - 863
  • [7] Decision tree support vector machine
    Zhang, Li
    Zhou, Wei-Da
    Su, Tian-Tian
    Jiao, Li-Cheng
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (01) : 1 - 15
  • [8] Multiclass cancer classification by using fuzzy support vector machine and binary decision tree with gene selection
    Mao, Y
    Zhou, XB
    Pi, DY
    Sun, YX
    Wong, STC
    [J]. JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2005, (02): : 160 - 171
  • [10] Decision tree support vector machine based on genetic algorithm for multi-class classification
    Chen, Huanhuan
    Wang, Qiang
    Shen, Yi
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2011, 22 (02) : 322 - 326