Speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method

被引:27
|
作者
Lokesh, S. [1 ]
Devi, M. Ramya [2 ]
机构
[1] Hindusthan Inst Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
[2] Hindusthan Coll Engn & Technol, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
关键词
Speech recognition; Feature extraction; Cepstral coefficient; Windowing; Framing; MFCC; BIG DATA-SECURITY; CLIMATE-CHANGE;
D O I
10.1007/s10586-017-1447-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, speech recognition systems are used in various environments, namely, healthcare, robotics, vehicle control and unmanned aerial vehicle system. In recent years, many speech recognition systems have been developed to solve various issues in real world applications. We have proposed a novel speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method. Windowing and framing method is used to remove the Gaussian white noise present in the input speech signal. The de-noising block effectively uses the nonnegative matrix factorization algorithm for factorizing the Mel-magnitude spectra of noisy input audio signal. Moreover, the mel-frequency cepstral coefficients (MFCC) is used for finding the more important features exist in the speech signal. Finally, Laplace smoothing technique is used as the language model for recognizing the audio signals. MATLAB software is used for demonstrating the proposed Mel frequency cepstral coefficient with Windowing and Framing based speech recognition system. We have compared the proposed speech recognition system with wavelet based feature extraction and artificial neural network based feature extraction methods for speech recognition. The experimental results proved the good performance of the proposed Mel frequency cepstral coefficient with windowing and framing based speech recognition system.
引用
收藏
页码:11669 / 11679
页数:11
相关论文
共 50 条
  • [41] Bangladeshi Dialect Recognition using Mel Frequency Cepstral Coefficient, Delta, Delta-delta and Gaussian Mixture Model
    Das, Pronaya Prosun
    Allayear, Shaikh Muhammad
    Amin, Ruhul
    Rahman, Zahida
    2016 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2016, : 359 - 364
  • [42] Robustness of speech recognition using genetic algorithms and a Mel-cepstral subspace approach
    Selouani, SA
    O'Shaughnessy, D
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 201 - 204
  • [43] Content-based retrieval of music using mel frequency cepstral coefficient (MFCC)
    School of Computer Science and Technology, Donghua University, Songjiang Distric, Shanghai, China
    Comput. Model. New Technol., 11 (1356-1361):
  • [44] Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures
    Darch, Jonathan
    Milner, Ben
    Vaseghi, Saeed
    Journal of the Acoustical Society of America, 2009, 124 (06): : 3989 - 4000
  • [45] Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures
    Darch, Jonathan
    Milner, Ben
    Vaseghi, Saeed
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (06): : 3989 - 4000
  • [46] How many Mel-frequency cepstral coefficients to be utilized in speech recognition? A study with the Bengali language
    Hasan, Md. Rakibul
    Hasan, Md. Mahbub
    Hossain, Md Zakir
    JOURNAL OF ENGINEERING-JOE, 2021, 2021 (12): : 817 - 827
  • [47] On the Inversion of Mel-Frequency Cepstral Coefficients for Speech Enhancement Applications
    Boucheron, Laura E.
    De Leon, Phillip L.
    ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 485 - 488
  • [48] Automatic recognition of birdsongs using mel-frequency cepstral coefficients and vector quantization
    Lee, Chang-Hsing
    Lien, Cheng-Chang
    Huang, Ren-Zhuang
    IMECS 2006: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, 2006, : 331 - +
  • [49] Chicken Sound Recognition using Anti-noise Mel Frequency Cepstral Coefficients
    Lin, Ming
    Zhong, Shangping
    Lin, Lingli
    2015 THIRD INTERNATIONAL CONFERENCE ON ROBOT, VISION AND SIGNAL PROCESSING (RVSP), 2015, : 224 - 227
  • [50] Walk Identification using a smart carpet and Mel-Frequency Cepstral Coefficient (MFCC) features
    Muheidat, Fadi
    Tyrer, W. Harry
    Popescu, Mihail
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 4249 - 4252