Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition

被引:0
|
作者
Gomez, Randy [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan
关键词
Speech recognition; Robustness; Dereverberation; Wavelet Packets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a multiple-resolution signal analysis to suppress late reflection of reverberation for robust automatic speech recognition (ASR). Wavelet packet tree (WPT) decomposition offers a finer resolution to discriminate the late reflection subspace from the speech subspace. By selecting appropriate wavelet basis in the WPT for speech and late reflection, we can effectively estimate the Wiener gain directly from the observed reverberant data. Moreover, the selection procedure is performed in accordance with the likelihood of acoustic model used by the speech recognizer. Dereverberation is realized by filtering the wavelet packet coefficients with the Wiener gain to suppress the effects of the late reflection. Experimental evaluations with large vocabulary continuous speech recognition (LVCSR) in real reverberant conditions show that the proposed method outperforms conventional wavelet-based methods and other dereverberation techniques.
引用
收藏
页码:1242 / 1245
页数:4
相关论文
共 50 条
  • [21] Speech Emotion Recognition Based on Wavelet Packet Coefficient Model
    Wang, Kunxia
    An, Ning
    Li, Lian
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 478 - 482
  • [22] A Fast Convolutional Self-attention Based Speech Dereverberation Method for Robust Speech Recognition
    Li, Nan
    Ge, Meng
    Wang, Longbiao
    Dang, Jianwu
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 295 - 305
  • [23] Automatic Speech Recognition System Based on Wavelet Analysis
    Ziolko, Mariusz
    Galka, Jakub
    Ziolko, Bartosz
    Jadczyk, Tomasz
    Skurzok, Dawid
    Wicijowski, Jan
    2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 450 - 451
  • [24] A robust Iterative Inverse Filtering approach for Speech Dereverberation in presence of Disturbances
    Rotili, Rudy
    Cifani, Simone
    Principi, Emanuele
    Squartini, Stefano
    Piazza, Francesco
    2008 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2008), VOLS 1-4, 2008, : 434 - 437
  • [25] Two-Microphone Dereverberation for Automatic Speech Recognition of Polish
    Kundegorski, Mikolaj
    Jackson, Philip J. B.
    Ziolko, Bartosz
    ARCHIVES OF ACOUSTICS, 2014, 39 (03) : 411 - 420
  • [26] Robust Feature Extracting of Speech Signal Based on Wavelet Packet Transform
    Han Zhiyan
    Wang Jian
    Lun Shuxian
    Wang Xu
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 2832 - 2837
  • [27] A means based on wiener filtering for dereverberation in speech communication
    Zhang, De-Hui
    Chen, Guang-Ye
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2009, 43 (06): : 949 - 952
  • [28] Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition
    Nehe, N. S.
    Holambe, R. S.
    IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2, 2008, : 175 - 179
  • [29] Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood
    Gomez, Randy
    Kawahara, Tatsuya
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1708 - 1716
  • [30] Speech Emotion Recognition Based on Coiflet Wavelet Packet Cepstral Coefficients
    Huang, Yongming
    Wu, Ao
    Zhang, Guobao
    Li, Yue
    PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 436 - 443