MapReduce based for speech classification

被引:0
|
作者
Quang Trung Nguyen [1 ]
The Duy Bui [1 ]
机构
[1] Univ Engn & Technol, VNU Hanoi, Human Machine Interact Lab, Hanoi, Vietnam
关键词
LNBNN; MapReduce; Speech Classification; Big Data Speech Classification;
D O I
10.1145/3011077.3011090
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speech classification is one of the most vital problems in speech processing as well as spoken word recognition. Although, there have been many studies on the classification of speech signals, the results are still limited on both accuracy and the size of the vocabulary. When classifying a huge volumes vocabulary, the speech classification becomes more and more difficult. Today, there are some frameworks that allow working with big data. One of these is a data mining utility. It can perform supervised classification procedures on very large amounts of data, usually named as big data, on a distributed infrastructure by using the MapReduce framework of Hadoop clusters. This tool has four classification approaches implemented. These are Random Forest, Naive Bayes, Decision Trees and Support Vector Machines (SVM). All these approaches require input data having the same size, so the input data must be quantized before using. This leads to decrease the accuracy in the classification stage. In this paper, we propose an implementation of Local Naive Bayes Nearest Neighbor based on Hadoop framework, which allows input data with different sizes and works well with huge training data.
引用
收藏
页码:87 / 91
页数:5
相关论文
共 50 条
  • [21] Large-Scale Graph Classification Based on Evolutionary Computation with MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Wang, Guoren
    Cheng, Yurong
    [J]. WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 227 - 243
  • [22] MapReduce Approach to Collective Classification for Networks
    Indyk, Wojciech
    Kajdanowicz, Tomasz
    Kazienko, Przemyslaw
    Plamowski, Slawomir
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2012, 7267 : 656 - 663
  • [23] Classification Framework of MapReduce Scheduling Algorithms
    Tiwari, Nidhi
    Sarkar, Santonu
    Bellur, Umesh
    Indrawan, Maria
    [J]. ACM COMPUTING SURVEYS, 2015, 47 (03)
  • [24] RuleMR: Classification Rule Discovery with MapReduce
    Kolias, Vasilis
    Kolias, Constantinos
    Anagnostopoulos, Ioannis
    Kayafas, Eleftherios
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [25] The classification of imbalanced large data sets based on MapReduce and ensemble of ELM classifiers
    Junhai Zhai
    Sufang Zhang
    Chenxi Wang
    [J]. International Journal of Machine Learning and Cybernetics, 2017, 8 : 1009 - 1017
  • [26] A MapReduce-based distributed SVM ensemble for scalable image classification and annotation
    Alham, Nasullah Khalid
    Li, Maozhen
    Liu, Yang
    Qi, Man
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2013, 66 (10) : 1920 - 1934
  • [27] The classification of imbalanced large data sets based on MapReduce and ensemble of ELM classifiers
    Zhai, Junhai
    Zhang, Sufang
    Wang, Chenxi
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (03) : 1009 - 1017
  • [28] Fuzzy rule based classification systems for big data with MapReduce: granularity analysis
    Alberto Fernández
    Sara del Río
    Abdullah Bawakid
    Francisco Herrera
    [J]. Advances in Data Analysis and Classification, 2017, 11 : 711 - 730
  • [29] Extreme Learning Machine for large-scale graph classification based on MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Yuan, Ye
    Wang, Guoren
    Chen, Lei
    [J]. NEUROCOMPUTING, 2017, 261 : 106 - 114
  • [30] Extreme Learning Machine for Large-Scale Graph Classification Based on MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Wang, Guoren
    [J]. PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 93 - 105