Spoken Word Recognition Using MFCC and Learning Vector Quantization

被引:0
|
作者
Djamal, Esmeralda C. [1 ]
Nurhamidah, Neneng [1 ]
Ilyas, Ridwan [1 ]
机构
[1] Univ Jenderal Achmad Yani, Jurusan Informat, Jl Terusan Jenderal Sudirman, Cimahi, Indonesia
关键词
Spoken word Recognition; MFCC; LVQ; Histogram Equalization; voice command;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identification of spoken word(s) can be used to control external device. This research was result word identification in speech using Mel-Frequency Cepstrum Coefficients (MFCC) and Learning Vector Quantization (LVQ). The output of system operated the computer in certain genre song appropriate with the identified word. Identification was divided into three classes contain words such as "Klasik", "Dangdut" and "Pop", which are used to playing three types of accordingly songs. The voice signal is extracted by using MFCC and then identified using LVQ. The training and test set were obtained from six subjects and 10 times trial of the words "Klasik", "Dangdut" and "Pop" separately. Then the recorded sound signal is pre-processed using Histogram Equalization, DC Removal and Pre-emphasize to reduce noise from the sound signal, and then extracted using MFCC. The frequency spectrum generated from MFCC was identified using LVQ after passing through the training process first. Accuracy of the testing results is 92% for identification of training sets while testing new data recorded using different SNR obtained an accuracy of 46%. However, the test results of new data recorded using the same SNR with training data has an accuracy of 75.5%.
引用
收藏
页码:246 / 251
页数:6
相关论文
共 50 条
  • [1] Speaker Recognition using MFCC, shifted MFCC with Vector Quantization and Fuzzy
    Bansal, Priyanka
    Imam, Syed Akhtar
    Bharti, Roma
    [J]. 2015 INTERNATIONAL CONFERENCE ON SOFT COMPUTING TECHNIQUES AND IMPLEMENTATIONS (ICSCTI), 2015,
  • [2] Speaker recognition system using MFCC features and vector quantization
    Wang, Wei
    Deng, Huiwen
    [J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2006, 27 (SUPPL.): : 2253 - 2255
  • [3] APPLICATIONS OF MFCC AND VECTOR QUANTIZATION IN SPEAKER RECOGNITION
    Gupta, Arnav
    Gupta, Harshit
    [J]. 2013 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND SIGNAL PROCESSING (ISSP), 2013, : 170 - 173
  • [4] Voice Identification Using MFCC and Vector Quantization
    Alkhatib, Bassel
    Eddin, Mohammad Madian Waleed Kamal
    [J]. BAGHDAD SCIENCE JOURNAL, 2020, 17 (03) : 1019 - 1028
  • [5] Amazigh Spoken Digit Recognition using a Deep Learning Approach based on MFCC
    Boulal, Hossam
    Hamidi, Mohamed
    Abarkan, Mustapha
    Barkani, Jamal
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (07) : 791 - 798
  • [6] MFCC and vector quantization for Arabic fricatives Speech/Speaker recognition
    Chelali, Fatma Zohra
    Djeradi, Amar
    [J]. 2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 284 - 289
  • [7] CONNECTED SPOKEN WORD RECOGNITION USING THE MARKOV MODEL FOR THE FEATURE VECTOR
    TAKARA, T
    YAKABU, T
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1788 - 1796
  • [8] Image recognition by using generalized learning vector quantization
    [J]. Sato, Atsushi, 1600, Japan Society for Precision Engineering (83):
  • [9] Facial Expression Recognition Using Learning Vector Quantization
    de Vries, Gert-Jan
    Pauws, Steffen
    Biehl, Michael
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 760 - 771
  • [10] Connected spoken word recognition using multistate Markov model for the feature vector
    Takara, T
    Higa, K
    Matayoshi, N
    [J]. SYSTEMS AND COMPUTERS IN JAPAN, 1996, 27 (07) : 51 - 60