Mel-wiener filter for Mel-LPC based speech recognition

被引:1
|
作者
Islam, Md. Babul [1 ]
Yamamoto, Kazumasa
Matsumoto, Hiroshi
机构
[1] Shinshu Univ, Grad Sch Sci & Technol, Nagano 3800921, Japan
[2] Shinshu Univ, Fac Engn, Nagano 3800921, Japan
[3] Islam Univ, Dept Comp Sci & Engn, Kushtia, Bangladesh
[4] Shinshu Univ, Dept Elect & Elect Engn, Nagano, Japan
[5] Tohoku Univ, Dept Elect Engn, Sendai, Miyagi 980, Japan
来源
关键词
noisy speech recognition; Mel-Wiener filter; Mel-LPC analysis; bilinear transformation; Aurora; 2; database;
D O I
10.1093/ietisy/e90-d.6.935
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a Mel-Wiener filter to enhance Mel-LPC spectra in the presence of additive noise. The transfer function of the proposed filter is defined by using a first-order all-pass filter instead of unit delay. The filter coefficients are estimated based on minimization of the sum of the square error on the linear frequency scale without applying the bilinear transformation and efficiently implemented in the autocorrelation domain. The proposed filter does not require any time-frequency conversion, which saves a large amount of computational load. The performance of the proposed system is comparable to that of ETSI AFE. The optimum filter order is found to be 3, and thus filtering is computationally inexpensive. The computational cost of the proposed system except VAD is 53% of ETSI AFE.
引用
收藏
页码:935 / 942
页数:8
相关论文
共 50 条
  • [1] An Improved Mel-Wiener Filter for Mel-LPC based Speech Recognition
    Islam, Md. Babul
    Matsumoto, Hiroshi
    Yamamoto, Kazumasa
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 45 - 48
  • [2] Evaluation of MEL-LPC cepstrum in a large vocabulary continuous speech recognition
    Matsumoto, H
    Moroto, M
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 117 - 120
  • [3] GMM-Based two-stage mel-warped Wiener filter for robust speech recognition
    Lei, Jianjun
    Guo, Jun
    Liu, Gang
    Wang, Jian
    Shen, Halfeng
    Nie, Xiangfei
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 827 - 830
  • [4] Recognition of Subsampled Speech Using a Modified Mel Filter Bank
    Bhuvanagiri, Kiran Kumar
    Kopparapu, Sunil Kumar
    [J]. ADVANCES IN COMPUTING AND COMMUNICATIONS, PT 4, 2011, 193 : 293 - 299
  • [5] Recognition of subsampled speech using a modified Mel filter bank
    Kopparapu, Sunil Kumar
    Bhuvanagiri, Kiran Kumar
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (02) : 655 - 662
  • [6] Robust speech recognition by selecting mel-filter banks
    Wu, Yun-Peng
    Mao, Jia-Min
    Li, Wei-Feng
    [J]. PROCEEDINGS OF THE 2ND ANNUAL INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND INFORMATION SCIENCE (EEEIS 2016), 2016, 117 : 407 - 416
  • [7] The Improvement and Implementation of Speech Enhancement Based on Mel frequency Wiener Filtering
    Fan Binwen
    Wang Yongjun
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 1814 - 1818
  • [8] Speech Recognition-Based Automated Visual Acuity Testing with Adaptive Mel Filter Bank
    Nisar, Shibli
    Khan, Muhammad Asghar
    Algarni, Fahad
    Wakeel, Abdul
    Uddin, M. Irfan
    Ullah, Insaf
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 2991 - 3004
  • [9] Mel scaled M-band wavelet filter bank for speech recognition
    Upadhyaya P.
    Farooq O.
    Abidi M.R.
    [J]. International Journal of Speech Technology, 2018, 21 (4) : 797 - 807
  • [10] Mel filter-like admissible wavelet packet structure for speech recognition
    Farooq, O
    Datta, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) : 196 - 198