Big Data Analytics using Multi-Classifier Approach with RHadoop

被引:0
|
作者
Hiranandani, Priyanka [1 ]
Pilli, Emmanuel S. [2 ]
Chand, Nanak [3 ]
Ramakrishna, C. [3 ]
Gupta, Madhuri [2 ]
机构
[1] Bharat Petr Corp Ltd, ERPCC IIS, Mumbai, Maharashtra, India
[2] Malaviya Natl Inst Technol, Dept Comp Sci & Engn, Jaipur, Rajasthan, India
[3] Natl Inst Tech Teachers Training & Res, Dept Comp Sci & Engn, Chandigarh, India
关键词
BigData Analytics; Multi-Classifier; Naive Bayes; K-Nearest Neighbor; Decision Tree;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big Data is the massive amount of data that is generated at such a high speed that is very difficult to analyze with traditional tools. Hadoop provides distributed storage and processing, to extract useful information from such huge data. On the other hand, R is open-source data analysis and programming language that facilitates statistical analysis and data visualization. But R is not scalable, it becomes difficult to process big data using R due to its memory limitations. To utilize data visualization, data transformation capabilities of R on Big Data, in this paper we have integrated R with Hadoop using RHadoop[] package and implemented map reduce form of K-Nearest Neighbor, Naive Bayes and Decision Tree Classifiers in R. In this paper we have also implemented Multi-Classifier to improve the accuracy of classification. Multi-Classifier combines the power of individual classifier to increase the eciency and accuracy of classfication. We have used Bayesian combinatorial function and majority voting to combine powers of the above mentioned classifiers. We have found that Multi-Classifier approach gives an improvement in parameters like precision, recall and accuracy.
引用
收藏
页码:478 / 484
页数:7
相关论文
共 50 条
  • [1] A multi-classifier approach to fingerprint classification
    Cappelli, R
    Maio, D
    Maltoni, D
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2002, 5 (02) : 136 - 144
  • [2] A Multi-Classifier Approach to Fingerprint Classification
    Raffaele Cappelli
    Dario Maio
    Davide Maltoni
    [J]. Pattern Analysis & Applications, 2002, 5 : 136 - 144
  • [3] A multi-classifier approach to dialogue act classification using function words
    [J]. O'Shea, J. (j.d.oshea@mmu.ac.uk), 1600, Springer Verlag (7270 LNCS):
  • [4] Competitiveness improvement in multi-classifier systems by data equalization
    Ng, GS
    Singh, H
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 815 - 818
  • [5] A Multi-Classifier Approach on L1-Regulated Features of Microarray Cancer Data
    Shekar, B. H.
    Dagnew, Guesh
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1515 - 1522
  • [6] Multi-classifier Fusion Approach based on Data Clustering for Analog Circuits Fault Diagnosis
    Song, Guoming
    Wang, Houjun
    Liu, Hong
    Jiang, Shuyan
    [J]. 2009 IEEE 8TH INTERNATIONAL CONFERENCE ON ASIC, VOLS 1 AND 2, PROCEEDINGS, 2009, : 1217 - 1220
  • [7] Multi-Classifier System Configuration using Genetic Algorithms
    Impedovo, D.
    Pirlo, G.
    Barbuzzi, D.
    [J]. 13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 560 - 564
  • [8] Cricket Match Analytics Using the Big Data Approach
    Awan, Mazhar Javed
    Gilani, Syed Arbaz Haider
    Ramzan, Hamza
    Nobanee, Haitham
    Yasin, Awais
    Zain, Azlan Mohd
    Javed, Rabia
    [J]. ELECTRONICS, 2021, 10 (19)
  • [9] Business Process Analytics Using a Big Data Approach
    Vera-Baquero, Alejandro
    Colomo-Palacios, Ricardo
    Molloy, Owen
    [J]. IT PROFESSIONAL, 2013, 15 (06) : 29 - 35
  • [10] A multi-classifier approach to modelling human and automatic visual cognition
    Sirlantzis, Kostantinos
    Howells, Gareth
    Lloyd-Jones, Toby
    Fairhurst, Michael
    [J]. 2007 ECSIS SYMPOSIUM ON BIO-INSPIRED, LEARNING, AND INTELLIGENT SYSTEMS FOR SECURITY, PROCEEDINGS, 2007, : 111 - +