Speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method

被引:27
|
作者
Lokesh, S. [1 ]
Devi, M. Ramya [2 ]
机构
[1] Hindusthan Inst Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
[2] Hindusthan Coll Engn & Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Speech recognition; Feature extraction; Cepstral coefficient; Windowing; Framing; MFCC; BIG DATA-SECURITY; CLIMATE-CHANGE;
D O I
10.1007/s10586-017-1447-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, speech recognition systems are used in various environments, namely, healthcare, robotics, vehicle control and unmanned aerial vehicle system. In recent years, many speech recognition systems have been developed to solve various issues in real world applications. We have proposed a novel speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method. Windowing and framing method is used to remove the Gaussian white noise present in the input speech signal. The de-noising block effectively uses the nonnegative matrix factorization algorithm for factorizing the Mel-magnitude spectra of noisy input audio signal. Moreover, the mel-frequency cepstral coefficients (MFCC) is used for finding the more important features exist in the speech signal. Finally, Laplace smoothing technique is used as the language model for recognizing the audio signals. MATLAB software is used for demonstrating the proposed Mel frequency cepstral coefficient with Windowing and Framing based speech recognition system. We have compared the proposed speech recognition system with wavelet based feature extraction and artificial neural network based feature extraction methods for speech recognition. The experimental results proved the good performance of the proposed Mel frequency cepstral coefficient with windowing and framing based speech recognition system.
引用
收藏
页码:11669 / 11679
页数:11
相关论文
共 50 条
  • [21] Fingerprint Recognition Using Mel-Frequency Cepstral Coefficients
    Hashad F.G.
    Halim T.M.
    Diab S.M.
    Sallam B.M.
    El-Samie F.E.A.
    Pattern Recognition and Image Analysis, 2010, 20 (03) : 360 - 369
  • [22] Combining Mel Frequency Cepstral Coefficients and Fractal Dimensions for Automatic Speech Recognition
    Ezeiza, Aitzol
    Lopez de Ipina, Karmele
    Hernandez, Carmen
    Barroso, Nora
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2011, 7015 : 183 - +
  • [23] Mel-Frequency Cepstral Coefficient-Based Bandwidth Extension of Narrowband Speech
    Nour-Eldin, Amr H.
    Kabal, Peter
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 53 - 56
  • [24] Improved speech emotion recognition with Mel frequency magnitude coefficient
    Ancilin, J.
    Milton, A.
    APPLIED ACOUSTICS, 2021, 179
  • [25] UNDERSTANDING SARCASM IN SPEECH USING MEL-FREQUENCY CEPSTRAL COEFFICENT
    Mathur, Abhinav
    Saxena, Vikas
    Singh, Sandeep K.
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 728 - 732
  • [26] Mel Frequency Cepstral Coefficient and its Applications: A Review
    Abdul, Zrar Kh.
    Al-Talabani, Abdulbasit K. K.
    IEEE ACCESS, 2022, 10 : 122136 - 122158
  • [27] The application of fractional Mel cepstral coefficient in deceptive speech detection
    Pan, Xinyu
    Zhao, Heming
    Zhou, Yan
    PEERJ, 2015, 3
  • [28] A New Approach for Toe Recognition Using Mel Frequency Cepstral Coefficients
    Nisar, Shibli
    Ashraf, Muhammad Wasim
    2016 13TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2016, : 291 - 294
  • [29] Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
    Zhiyou Yang
    Ying Huang
    Evolutionary Intelligence, 2022, 15 : 2485 - 2494
  • [30] Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
    Yang, Zhiyou
    Huang, Ying
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (04) : 2485 - 2494