A map reduce based support vector machine for big data classification

被引:0
|
作者
Priyadarshini, Anushree [1 ]
Agarwal, Sonali [1 ]
机构
[1] Department of Information Technology, Indian Institute of Information Technology, Allahabad, India
关键词
Support vector machines - Distributed computer systems - Digital storage;
D O I
10.14257/ijdta.2015.8.5.07
中图分类号
学科分类号
摘要
Support Vector Machine (SVM) is extremely powerful and widely accepted classifier in the field of machine learning due to its better generalization capability. However, SVM is not suiTable for large scale dataset due to its high computational complexity. The computation and storage requirement increases tremendously for large dataset. In this paper, we have proposed a MapReduce based SVM for large scale data. MapReduce is a distributed programming model which works on large scale dataset by dividing the huge datasets in smaller chunks. MapReduce distribution model works on several frame works like Hadoop Twister and so on. In this paper, we have analyzed the impact of penalty and kernel parameters on the performance of parallel SVM. The experimental result shows that the number of support vectors and predictive accuracy of SVM is affected by the choice of these parameters. From experimental results, it is also analyzed that the computation time taken by the SVM with multi-node cluster is less as compared to the single node cluster for large dataset.
引用
收藏
页码:77 / 98
相关论文
共 50 条
  • [1] Quantum Support Vector Machine for Big Data Classification
    Rebentrost, Patrick
    Mohseni, Masoud
    Lloyd, Seth
    [J]. PHYSICAL REVIEW LETTERS, 2014, 113 (13)
  • [2] Support Vector Machine Based Automatic Classification Method for IoT Big Data Features
    Xu, Yong-Hua
    [J]. Journal of Computers (Taiwan), 2023, 34 (05) : 15 - 27
  • [3] AN OUTLIER MAP FOR SUPPORT VECTOR MACHINE CLASSIFICATION
    Debruyne, Michel
    [J]. ANNALS OF APPLIED STATISTICS, 2009, 3 (04): : 1566 - 1580
  • [4] Prediction of Data Classification Based on Support Vector Machine
    Wu, Xinghui
    Zhou, Yuping
    [J]. PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL & ELECTRONICS ENGINEERING AND COMPUTER SCIENCE (ICEEECS 2016), 2016, 50 : 694 - 699
  • [5] Data Classification with Support Vector Machine and Generalized Support Vector Machine
    Qi, Xiaomin
    Silvestrov, Sergei
    Nazir, Talat
    [J]. ICNPAA 2016 WORLD CONGRESS: 11TH INTERNATIONAL CONFERENCE ON MATHEMATICAL PROBLEMS IN ENGINEERING, AEROSPACE AND SCIENCES, 2017, 1798
  • [6] Support vector machine parallelized remote sensing image classification algorithm based on big data
    Liao, Li
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06) : 62005
  • [7] Sentiment Analysis Based on Support Vector Machine and Big Data
    Povoda, Lukas
    Burget, Radim
    Dutta, Malay Kishore
    [J]. 2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 543 - 545
  • [8] Map-Reduce based Parallel Support Vector Machine For Risk analysis
    Tripathy, Pujasuman
    Rautaray, Siddharth Swarup
    Pandey, Manjusha
    [J]. 2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 300 - 303
  • [9] A Map Reduce solution for associative classification of big data
    Bechini, Alessio
    Marcelloni, Francesco
    Segatori, Armando
    [J]. INFORMATION SCIENCES, 2016, 332 : 33 - 55
  • [10] Support vector machine for classification based on fuzzy training data
    Ji, Ai-bing
    Pang, Jia-hong
    Qiu, Hong-jie
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (04) : 3495 - 3498