Mel-wiener filter for Mel-LPC based speech recognition

被引:1
|
作者
Islam, Md. Babul [1 ]
Yamamoto, Kazumasa
Matsumoto, Hiroshi
机构
[1] Shinshu Univ, Grad Sch Sci & Technol, Nagano 3800921, Japan
[2] Shinshu Univ, Fac Engn, Nagano 3800921, Japan
[3] Islam Univ, Dept Comp Sci & Engn, Kushtia, Bangladesh
[4] Shinshu Univ, Dept Elect & Elect Engn, Nagano, Japan
[5] Tohoku Univ, Dept Elect Engn, Sendai, Miyagi 980, Japan
来源
关键词
noisy speech recognition; Mel-Wiener filter; Mel-LPC analysis; bilinear transformation; Aurora; 2; database;
D O I
10.1093/ietisy/e90-d.6.935
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a Mel-Wiener filter to enhance Mel-LPC spectra in the presence of additive noise. The transfer function of the proposed filter is defined by using a first-order all-pass filter instead of unit delay. The filter coefficients are estimated based on minimization of the sum of the square error on the linear frequency scale without applying the bilinear transformation and efficiently implemented in the autocorrelation domain. The proposed filter does not require any time-frequency conversion, which saves a large amount of computational load. The performance of the proposed system is comparable to that of ETSI AFE. The optimum filter order is found to be 3, and thus filtering is computationally inexpensive. The computational cost of the proposed system except VAD is 53% of ETSI AFE.
引用
收藏
页码:935 / 942
页数:8
相关论文
共 50 条
  • [21] Mel-scaled discrete wavelet coefficients for speech recognition
    Gowdy, JN
    Tufekci, Z
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1351 - 1354
  • [22] Mel-Frequency Cepstral Coefficient Analysis in Speech Recognition
    On, Chin Kim
    Pandiyan, Paulraj M.
    Yaacob, Sazali
    Saudi, Azali
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS (ICOCI 2006), 2006, : 291 - +
  • [23] Amazigh CNN speech recognition system based on Mel spectrogram feature extraction method
    Boulal H.
    Hamidi M.
    Abarkan M.
    Barkani J.
    [J]. International Journal of Speech Technology, 2024, 27 (01) : 287 - 296
  • [24] Mel Frequency Cepstral Coefficients (MFCC) Based Speaker Identification in Noisy Environment Using Wiener Filter
    Chauhan, Paresh M.
    Desai, Nikita P.
    [J]. 2014 INTERNATIONAL CONFERENCE ON GREEN COMPUTING COMMUNICATION AND ELECTRICAL ENGINEERING (ICGCCEE), 2014,
  • [25] Investigation into a Mel subspace based front-end processing for robust speech recognition
    Selouani, SA
    O'Shaughnessy, D
    [J]. Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 187 - 190
  • [26] Automatic speech recognition based on cepstral coefficients and a Mel-based discrete energy operator
    Tolba, H
    O'Shaughnessy, D
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 973 - 976
  • [27] Role of Linear, Mel and Inverse-Mel Filterbanks in Automatic Recognition of Speech from High-Pitched Speakers
    Kathania, Hemant Kumar
    Shahnawazuddin, S.
    Ahmad, Waquar
    Adiga, Nagaraj
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (10) : 4667 - 4682
  • [28] Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognition
    He, XD
    Zhao, YX
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 337 - 340
  • [29] Role of Linear, Mel and Inverse-Mel Filterbanks in Automatic Recognition of Speech from High-Pitched Speakers
    Hemant Kumar Kathania
    S. Shahnawazuddin
    Waquar Ahmad
    Nagaraj Adiga
    [J]. Circuits, Systems, and Signal Processing, 2019, 38 : 4667 - 4682
  • [30] Mel Sub-Band Filtering and Compression for Robust Speech Recognition
    Nasersharif, Babak
    Akbari, Ahmad
    Homayounpour, Mohammad Mehdi
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 105 - +