Speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method

被引:27
|
作者
Lokesh, S. [1 ]
Devi, M. Ramya [2 ]
机构
[1] Hindusthan Inst Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
[2] Hindusthan Coll Engn & Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Speech recognition; Feature extraction; Cepstral coefficient; Windowing; Framing; MFCC; BIG DATA-SECURITY; CLIMATE-CHANGE;
D O I
10.1007/s10586-017-1447-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, speech recognition systems are used in various environments, namely, healthcare, robotics, vehicle control and unmanned aerial vehicle system. In recent years, many speech recognition systems have been developed to solve various issues in real world applications. We have proposed a novel speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method. Windowing and framing method is used to remove the Gaussian white noise present in the input speech signal. The de-noising block effectively uses the nonnegative matrix factorization algorithm for factorizing the Mel-magnitude spectra of noisy input audio signal. Moreover, the mel-frequency cepstral coefficients (MFCC) is used for finding the more important features exist in the speech signal. Finally, Laplace smoothing technique is used as the language model for recognizing the audio signals. MATLAB software is used for demonstrating the proposed Mel frequency cepstral coefficient with Windowing and Framing based speech recognition system. We have compared the proposed speech recognition system with wavelet based feature extraction and artificial neural network based feature extraction methods for speech recognition. The experimental results proved the good performance of the proposed Mel frequency cepstral coefficient with windowing and framing based speech recognition system.
引用
收藏
页码:11669 / 11679
页数:11
相关论文
共 50 条
  • [1] Speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method
    S. Lokesh
    M. Ramya Devi
    Cluster Computing, 2019, 22 : 11669 - 11679
  • [2] Mel-Frequency Cepstral Coefficient Analysis in Speech Recognition
    On, Chin Kim
    Pandiyan, Paulraj M.
    Yaacob, Sazali
    Saudi, Azali
    2006 INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS (ICOCI 2006), 2006, : 291 - +
  • [3] Speech Emotion Recognition using Mel Frequency Cepstral Coefficient and SVM Classifier
    Fernandes, V.
    Mascarehnas, L.
    Mendonca, C.
    Johnson, A.
    Mishra, R.
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART), 2018, : 200 - 204
  • [4] Using Mel Frequency Cepstral Coefficient Method for Online Arabic Characters Handwriting Recognition
    Bougamouza, Fateh
    Hazmoune, Samira
    Benmohammed, Mohammed
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2016, : 87 - 92
  • [5] Speaker Recognition Using Mel Frequency Cepstral Coefficient and Locality Sensitive Hashing
    Awais, Ahmed
    Kun, She
    Yu, Yue
    Hayat, Shaukat
    Ahmed, Aftab
    Tu, Tianyi
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD), 2018, : 271 - 276
  • [6] EXTRACTION OF SPEECH SIGNAL BASED ON POWER NORMALIZED CEPSTRAL COEFFICIENT AND MEL FREQUENCY CEPSTRAL COEFFICIENT: A COMPARISON
    Bharathi
    Ponraj, Narain
    Mercy, Merlin
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT), 2016, : 1843 - 1846
  • [7] Chip design of mel frequency cepstral coefficients for speech recognition
    Wang, JC
    Wang, JF
    Weng, YS
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3658 - 3661
  • [8] Analysis of Asthma By Using Mel Frequency Cepstral Coefficient
    Dighore, V. D.
    Thool, V. R.
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 976 - 980
  • [9] Speech to Text for Indonesian Homophone Phrase with Mel Frequency Cepstral Coefficient
    Bustamin, Anugrayani
    Indrabayu
    Areni, Intan Sari
    Mokobombang, Novy Nra
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND CYBERNETICS, 2016, : 29 - +
  • [10] Robust Speech Recognition Using Pereptual Wavelet Denoising and Mel-frequency Product Spectrum Cepstral Coefficient Features
    Korba, Mohamed Cherif Amara
    Messadeg, Djemil
    Djemili, Rafik
    Bourouba, Hocine
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2008, 32 (03): : 283 - 288