Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition

被引:0
|
作者
Gomez, Randy [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan
关键词
Speech recognition; Robustness; Dereverberation; Wavelet Packets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a multiple-resolution signal analysis to suppress late reflection of reverberation for robust automatic speech recognition (ASR). Wavelet packet tree (WPT) decomposition offers a finer resolution to discriminate the late reflection subspace from the speech subspace. By selecting appropriate wavelet basis in the WPT for speech and late reflection, we can effectively estimate the Wiener gain directly from the observed reverberant data. Moreover, the selection procedure is performed in accordance with the likelihood of acoustic model used by the speech recognizer. Dereverberation is realized by filtering the wavelet packet coefficients with the Wiener gain to suppress the effects of the late reflection. Experimental evaluations with large vocabulary continuous speech recognition (LVCSR) in real reverberant conditions show that the proposed method outperforms conventional wavelet-based methods and other dereverberation techniques.
引用
收藏
页码:1242 / 1245
页数:4
相关论文
共 50 条
  • [1] An Improved Wavelet-based Dereverberation for Robust Automatic Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 578 - 581
  • [2] Perceptual wavelet filtering for robust speech recognition
    Van Pham, Tuan
    Stark, Michael
    Kubin, Gernot
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4385 - 4388
  • [3] Frequency and wavelet filtering for robust speech recognition
    Deviren, M
    Daoudi, K
    [J]. ARTIFICAIL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 452 - 460
  • [4] Noise suppression based on wavelet packet decomposition and quantile noise estimation for robust automatic speech recognition
    Rank, Erhard
    Van Pham, Tuan
    Kubin, Gernot
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 477 - 480
  • [5] Robust Automatic Speech Recognition Features using Complex Wavelet Packet Transform Coefficients
    Sen, Tjong Wan
    Trilaksono, Bambang Riyanto
    Arman, Arry Akhmad
    Mandala, Rila
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2009, 3 (02) : 123 - 134
  • [6] New wavelet packet model for automatic speech recognition system
    Karam, JR
    Phillips, WJ
    Robertson, W
    Artimy, MM
    [J]. CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING 2001, VOLS I AND II, CONFERENCE PROCEEDINGS, 2001, : 511 - 514
  • [7] Harmonicity based dereverberation for improving automatic speech recognition performance and speech intelligibility
    Kinoshita, K
    Nakatani, T
    Miyoshi, M
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1724 - 1731
  • [8] Denoising Using Optimized Wavelet Filtering for Automatic Speech Recognition
    Gomez, Randy
    Kawahara, Tatsuya
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1684 - 1687
  • [9] Adaptive Multichannel Dereverberation for Automatic Speech Recognition
    Caroselli, Joe
    Shafran, Izhak
    Narayanan, Arun
    Rose, Richard
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3877 - 3881
  • [10] Robust Automatic Speech Recognition Using Wavelet-Based Adaptive Wavelet Thresholding: A Review
    Shanthamallappa M.
    Puttegowda K.
    Hullahalli Nannappa N.K.
    Vasudeva Rao S.K.
    [J]. SN Computer Science, 5 (2)