MapReduce Implementation of a Multinomial and Mixed Naive Bayes Classifier

被引:6
|
作者
Bagui, Sikha [1 ]
Devulapalli, Keerthi [2 ]
John, Sharon [2 ]
机构
[1] Univ West Florida, Dept Comp Sci, Pensacola, FL 32514 USA
[2] Univ West Florida, Pensacola, FL USA
关键词
Big Data; Discrete Naive Bayes Model; Hadoop; MapReduce Environment; Mixed Naive Bayes Model; Naive Bayes Classification; Parallel Naive Bayes Implementation; Probability-Based Classification;
D O I
10.4018/IJIIT.2020040101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study presents an efficient way to deal with discrete as well as continuous values in Big Data in a parallel Naive Bayes implementation on Hadoop's MapReduce environment. Two approaches were taken: (i) discretizing continuous values using a binning method; and (ii) using a multinomial distribution for probability estimation of discrete values and a Gaussian distribution for probability estimation of continuous values. The models were analyzed and compared for performance with respect to run time and classification accuracy for varying data sizes, data block sizes, and map memory sizes.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 50 条
  • [1] Mixture of latent multinomial naive Bayes classifier
    Harzevili, Nima Shiri
    Alizadeh, Sasan H.
    [J]. APPLIED SOFT COMPUTING, 2018, 69 : 516 - 527
  • [2] Naive Bayes Classifier Based Partitioner for MapReduce
    Chen, Lei
    Lu, Wei
    Bao, Ergude
    Wang, Liqiang
    Xing, Weiwei
    Cai, Yuanyuan
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (05) : 778 - 786
  • [3] Modifying Naive Bayes classifier for multinomial text classification
    [J]. 1600, Institute of Electrical and Electronics Engineers Inc., United States
  • [4] Modifying Naive Bayes Classifier for Multinomial Text Classification
    Sharma, Neha
    Singh, Manoj
    [J]. 2016 INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2016,
  • [5] Optimizing MapReduce Partitioner Using Naive Bayes Classifier
    Chen, Lei
    Lu, Wei
    Wang, Liqiang
    Bao, Ergude
    Xing, Weiwei
    Yang, Yong
    Yuan, Victor
    [J]. 2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 812 - 819
  • [6] Extended Naive Bayes classifier for mixed data
    Hsu, Chung-Chian
    Huang, Yan-Ping
    Chang, Keng-Wei
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 1080 - 1083
  • [7] Hierarchical Scheme for Assigning Components in Multinomial Naive Bayes Text Classifier
    Nghia Nguyen
    Yamada, Koichi
    Suzuki, Izumi
    Unehara, Muneyuki
    [J]. 2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 335 - 340
  • [8] Sentiment analysis on hotel reviews using Multinomial Naive Bayes classifier
    Farisi, Arif Abdurrahman
    Sibaroni, Yuliant
    Al Faraby, Said
    [J]. 2ND INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE, 2019, 1192
  • [9] Multinomial Naive Bayes Classifier for Sentiment Analysis of Internet Movie Database
    Dewi, Christine
    Chen, Rung-Ching
    Christanto, Henoch Juli
    Cauteruccio, Francesco
    [J]. VIETNAM JOURNAL OF COMPUTER SCIENCE, 2023, 10 (04) : 485 - 498
  • [10] A message classifier based on multinomial Naive Bayes for online social contexts
    de Souza Viana, Tharsis Salathiel
    de Oliveira, Marcos
    Coelho da Silva, Ticiana Linhares
    Rodrigues Falc Ao, Mario Sergio
    Tavares Goncalves, Enyo Jose
    [J]. JOURNAL OF MANAGEMENT ANALYTICS, 2018, 5 (03) : 213 - 229