Noise Robust Speech Recognition System using Mel Cepstral and Genetic Algorithm

被引:0
|
作者
Mamta, Garg [1 ]
Shatru, Arora Ajat [2 ]
Savita, Gupta [3 ]
机构
[1] St Longowal Inst Engn & Technol, Dept Comp Sci & Engn, Longowal, India
[2] St Longowal Inst Engn & Technol, Dept Elect & Instrumentat Engn, Longowal, India
[3] Panjab Univ, Univ Inst Engn & Technol, Dept Comp Sci & Engn, Chandigarh, India
关键词
MFCC; Speech Recognition; Genetic Algorithm; FAR; FRR; Accuracy; MFCC;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper suggested a technique based on MFCC analysis for audio signals with speech classification application. The proposed work used multi-resolution (wavelet) analysis and spectral analysis based features for feature extraction. The proposed approach uses a no. of features like Mel Frequency Cepstral Coefficient (MFCC), and FFT Coefficients combined with wavelet based features. In addition, accuracy of 99% using the proposed work is claimed.
引用
收藏
页码:3151 / 3155
页数:5
相关论文
共 50 条
  • [41] Adaptive bands filter bank optimized by genetic algorithm for robust speech recognition system
    黄丽霞
    G.Evangelista
    张雪英
    [J]. Journal of Central South University, 2011, 18 (05) : 1595 - 1601
  • [42] Adaptive bands filter bank optimized by genetic algorithm for robust speech recognition system
    Li-xia Huang
    G. Evangelista
    Xue-ying Zhang
    [J]. Journal of Central South University, 2011, 18 : 1595 - 1601
  • [43] UNDERSTANDING SARCASM IN SPEECH USING MEL-FREQUENCY CEPSTRAL COEFFICENT
    Mathur, Abhinav
    Saxena, Vikas
    Singh, Sandeep K.
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 728 - 732
  • [44] DELTA-SPECTRAL CEPSTRAL COEFFICIENTS FOR ROBUST SPEECH RECOGNITION
    Kumar, Kshitiz
    Kim, Chanwoo
    Stern, Richard M.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4784 - 4787
  • [45] Bounded cepstral marginalization of missing data for robust speech recognition
    Kafoori, Kian Ebrahim
    Ahadi, Seyed Mohammad
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 1 - 23
  • [46] CEPSTRAL DOMAIN TALKER STRESS COMPENSATION FOR ROBUST SPEECH RECOGNITION
    CHEN, YN
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (04): : 433 - 439
  • [47] Data-driven Rescaled Teager Energy Cepstral Coefficients for Noise-robust Speech Recognition
    Hsu, Miau-Luan
    Chen, Chia-Ping
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [48] Combined Waveform-Cepstral Representation for Robust Speech Recognition
    Ager, Matthew
    Cvetkovic, Zoran
    Sollich, Peter
    [J]. 2011 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2011, : 864 - 868
  • [49] Multichannel Cepstral Domain Feature Warping for Robust Speech Recognition
    Squartini, Stefano
    Fagiani, Marco
    Principi, Emanuele
    Piazza, Francesco
    [J]. NEURAL NETS WIRN10, 2011, 226 : 284 - 292
  • [50] Comparing Jacorian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition
    Pärssinen, K
    Salmela, P
    Harju, M
    Kiss, I
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 193 - 196