Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition

被引:0
|
作者
Gomez, Randy [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan
关键词
Speech recognition; Robustness; Dereverberation; Wavelet Packets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a multiple-resolution signal analysis to suppress late reflection of reverberation for robust automatic speech recognition (ASR). Wavelet packet tree (WPT) decomposition offers a finer resolution to discriminate the late reflection subspace from the speech subspace. By selecting appropriate wavelet basis in the WPT for speech and late reflection, we can effectively estimate the Wiener gain directly from the observed reverberant data. Moreover, the selection procedure is performed in accordance with the likelihood of acoustic model used by the speech recognizer. Dereverberation is realized by filtering the wavelet packet coefficients with the Wiener gain to suppress the effects of the late reflection. Experimental evaluations with large vocabulary continuous speech recognition (LVCSR) in real reverberant conditions show that the proposed method outperforms conventional wavelet-based methods and other dereverberation techniques.
引用
收藏
页码:1242 / 1245
页数:4
相关论文
共 50 条
  • [41] Morphological filtering of spectrograms for automatic speech recognition
    Liu, WM
    Bastante, VJR
    Rodriguez, FR
    Evans, NWD
    Mason, JSD
    Proceedings of the Fourth IASTED International Conference on Visualization, Imaging, and Image Processing, 2004, : 546 - 549
  • [42] Spectrum filtering with FRM for robust speech recognition
    Hayasaka, Noboru
    Miyanaga, Yoshikazu
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 3285 - +
  • [43] Matched filtering approach to robust speech recognition
    Avadhanulu, J.V.
    Sreenivas, T.V.
    Journal of the Indian Institute of Science, 79 (03): : 185 - 196
  • [44] Binaural speech enhancement system combining dereverberation and spatial masking-based noise removal for robust speech recognition
    Tien Dung Tran
    Dang Khoa Nguyen
    Quoc Cuong Nguyen
    Huu Binh Nguyen
    2012 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2012, : 345 - 350
  • [45] Wavelet-based denoising for robust feature extraction for speech recognition
    Farooq, O
    Datta, S
    ELECTRONICS LETTERS, 2003, 39 (01) : 163 - 165
  • [46] Non-linear Dynamics Characterization from Wavelet Packet Transform for Automatic Recognition of Emotional Speech
    Vasquez-Correa, J. C.
    Orozco-Arroyave, J. R.
    Arias-Londono, J. D.
    Vargas-Bonilla, J. F.
    Noth, Elmar
    RECENT ADVANCES IN NONLINEAR SPEECH PROCESSING, 2016, 48 : 199 - 207
  • [47] Wavelet based speech recognition
    Gamulkiewicz, B
    Weeks, M
    PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 678 - 681
  • [48] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
    Satriawan, Cil Hardianto
    Lestari, Dessi Puji
    2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
  • [49] A Comparision of Multiclass SVM and HMM Classifier for Wavelet Front End Robust Automatic Speech Recognition
    Rajeswari
    Prasad, N. N. S. S. R. K.
    Sathyanarayana, V
    2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [50] Speech recognition using a wavelet packet adaptive network based fuzzy inference system
    Avci, Engin
    Akpolat, Zuhtu Hakan
    EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (03) : 495 - 503