Speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method

被引:27
|
作者
Lokesh, S. [1 ]
Devi, M. Ramya [2 ]
机构
[1] Hindusthan Inst Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
[2] Hindusthan Coll Engn & Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Speech recognition; Feature extraction; Cepstral coefficient; Windowing; Framing; MFCC; BIG DATA-SECURITY; CLIMATE-CHANGE;
D O I
10.1007/s10586-017-1447-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, speech recognition systems are used in various environments, namely, healthcare, robotics, vehicle control and unmanned aerial vehicle system. In recent years, many speech recognition systems have been developed to solve various issues in real world applications. We have proposed a novel speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method. Windowing and framing method is used to remove the Gaussian white noise present in the input speech signal. The de-noising block effectively uses the nonnegative matrix factorization algorithm for factorizing the Mel-magnitude spectra of noisy input audio signal. Moreover, the mel-frequency cepstral coefficients (MFCC) is used for finding the more important features exist in the speech signal. Finally, Laplace smoothing technique is used as the language model for recognizing the audio signals. MATLAB software is used for demonstrating the proposed Mel frequency cepstral coefficient with Windowing and Framing based speech recognition system. We have compared the proposed speech recognition system with wavelet based feature extraction and artificial neural network based feature extraction methods for speech recognition. The experimental results proved the good performance of the proposed Mel frequency cepstral coefficient with windowing and framing based speech recognition system.
引用
收藏
页码:11669 / 11679
页数:11
相关论文
共 50 条
  • [31] Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech
    Raikar, Aditya
    Gandhi, Ami
    Patil, Hemant A.
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 405 - 413
  • [32] Voice Recognition and Marking Using Mel-frequency Cepstral Coefficients
    Sheu, Jia-Shing
    Chen, Ching-Wen
    SENSORS AND MATERIALS, 2020, 32 (10) : 3209 - 3220
  • [33] Voice pattern recognition using Mel-Frequency Cepstral Coefficient and Hidden Markov Model for bahasa Madura
    Ubaidi, U.
    Dewi, N. P.
    ANNUAL CONFERENCE OF SCIENCE AND TECHNOLOGY, 2019, 1375
  • [34] Fusion of mel and gammatone frequency cepstral coefficients for speech emotion recognition using deep C-RNN
    Kumaran, U.
    Radha Rammohan, S.
    Nagarajan, Senthil Murugan
    Prathik, A.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 303 - 314
  • [35] Fusion of mel and gammatone frequency cepstral coefficients for speech emotion recognition using deep C-RNN
    Kumaran, U.
    Radha Rammohan, S.
    Nagarajan, Senthil Murugan
    Prathik, A.
    International Journal of Speech Technology, 2021, 24 (02): : 303 - 314
  • [36] Fusion of mel and gammatone frequency cepstral coefficients for speech emotion recognition using deep C-RNN
    U. Kumaran
    S. Radha Rammohan
    Senthil Murugan Nagarajan
    A. Prathik
    International Journal of Speech Technology, 2021, 24 : 303 - 314
  • [37] An Optimized Scheme of Mel Frequency Cepstral Coefficient for Multi-sensor Sign Language Recognition
    Wang, Nana
    Ma, Zhiyuan
    Tang, Yichen
    Liu, Yi
    Li, Ying
    Niu, Jianwei
    SMART COMPUTING AND COMMUNICATION, SMARTCOM 2016, 2017, 10135 : 224 - 235
  • [38] Recognize basic emotional statesin speech by machine learning techniques using mel-frequency cepstral coefficient features
    Yang, Ningning
    Dey, Nilanjan
    Sherratt, R. Simon
    Shi, Fuqian
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 1925 - 1936
  • [39] Speech reconstruction from mel frequency cepstral coefficients and pitch frequency
    Chazan, D
    Hoory, R
    Cohen, G
    Zibulski, M
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1299 - 1302
  • [40] A comparative between Mel Frequency Cepstral Coefficients (MFCC) and Inverse Mel Frequency Cepstral Coefficients (IMFCC) features for an Automatic Bird Species Recognition System
    Pedroza Ramirez, Angel David
    de la Rosa Vargas, Jose Ismael
    Rosas Valdez, Rogelio
    Becerra, Aldonso
    2018 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2018,