MapReduce based for speech classification

被引:0
|
作者
Quang Trung Nguyen [1 ]
The Duy Bui [1 ]
机构
[1] Univ Engn & Technol, VNU Hanoi, Human Machine Interact Lab, Hanoi, Vietnam
关键词
LNBNN; MapReduce; Speech Classification; Big Data Speech Classification;
D O I
10.1145/3011077.3011090
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speech classification is one of the most vital problems in speech processing as well as spoken word recognition. Although, there have been many studies on the classification of speech signals, the results are still limited on both accuracy and the size of the vocabulary. When classifying a huge volumes vocabulary, the speech classification becomes more and more difficult. Today, there are some frameworks that allow working with big data. One of these is a data mining utility. It can perform supervised classification procedures on very large amounts of data, usually named as big data, on a distributed infrastructure by using the MapReduce framework of Hadoop clusters. This tool has four classification approaches implemented. These are Random Forest, Naive Bayes, Decision Trees and Support Vector Machines (SVM). All these approaches require input data having the same size, so the input data must be quantized before using. This leads to decrease the accuracy in the classification stage. In this paper, we propose an implementation of Local Naive Bayes Nearest Neighbor based on Hadoop framework, which allows input data with different sizes and works well with huge training data.
引用
收藏
页码:87 / 91
页数:5
相关论文
共 50 条
  • [31] Classification of microarray using MapReduce based proximal support vector machine classifier
    Kumar, Mukesh
    Rath, Santanu Kumar
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 89 : 584 - 602
  • [32] Extreme Learning Machine for Large-Scale Graph Classification Based on MapReduce
    Wang, Zhanghui
    Zhao, Yuhai
    Wang, Guoren
    [J]. PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 93 - 105
  • [33] A MapReduce-based distributed SVM ensemble for scalable image classification and annotation
    Alham, Nasullah Khalid
    Li, Maozhen
    Liu, Yang
    Qi, Man
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2013, 66 (10) : 1920 - 1934
  • [35] The classification of imbalanced large data sets based on MapReduce and ensemble of ELM classifiers
    Zhai, Junhai
    Zhang, Sufang
    Wang, Chenxi
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (03) : 1009 - 1017
  • [36] Towards Generalizing Classification Based Speech Separation
    Han, Kun
    Wang, DeLiang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 166 - 175
  • [37] A speech enhancement approach based on noise classification
    Yuan, Wenhao
    Xia, Bin
    [J]. APPLIED ACOUSTICS, 2015, 96 : 11 - 19
  • [38] AN SVM BASED CLASSIFICATION APPROACH TO SPEECH SEPARATION
    Han, Kun
    Wang, DeLiang
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4632 - 4635
  • [39] SPEECH SOUND CLASSIFICATION BASED ON SIGNAL STATISTICS
    BAKIS, R
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1961, 33 (06): : 852 - &
  • [40] Stress Classification in Speech based on Stress Levels
    Yao, Xiao
    Liu, Xiaofeng
    Jiang, Aiming
    Xu, Ning
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2015, : 146 - 147