Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition

被引：0

作者：

Gomez, Randy ^{[1
]}

Kawahara, Tatsuya ^{[1
]}

机构：

[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

Speech recognition; Robustness; Dereverberation; Wavelet Packets;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a multiple-resolution signal analysis to suppress late reflection of reverberation for robust automatic speech recognition (ASR). Wavelet packet tree (WPT) decomposition offers a finer resolution to discriminate the late reflection subspace from the speech subspace. By selecting appropriate wavelet basis in the WPT for speech and late reflection, we can effectively estimate the Wiener gain directly from the observed reverberant data. Moreover, the selection procedure is performed in accordance with the likelihood of acoustic model used by the speech recognizer. Dereverberation is realized by filtering the wavelet packet coefficients with the Wiener gain to suppress the effects of the late reflection. Experimental evaluations with large vocabulary continuous speech recognition (LVCSR) in real reverberant conditions show that the proposed method outperforms conventional wavelet-based methods and other dereverberation techniques.

引用

页码：1242 / 1245

页数：4

共 50 条

[31] Study on the dereverberation of speech based on temporal envelope filtering
Avendano, C
Hermansky, H
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 889 - 892
[32] Robust features for speech recognition based on admissible wavelet packets
Farooq, O
Datta, S
ELECTRONICS LETTERS, 2001, 37 (25) : 1554 - 1556
[33] Speech feature extraction based on wavelet modulation scale for robust speech recognition
Ma, Xin
Zhou, Weidong
Ju, Fang
Jiang, Qi
NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 499 - 505
[34] Robust Speech Dereverberation Based on WPE and Deep Learning
Li, Hao
Zhang, Xueliang
Gao, Guanglai
2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 52 - 56
[35] A Robust Speech Endpoint Detection Algorithm Based on Wavelet Packet and Energy Entropy
Zhang, Ting
Huang, Hua
He, Ling
Lech, Margaret
2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1050 - 1054
[36] Guided spectrogram filtering for speech dereverberation
Zheng, Chengshi
Tan, Zheng-Hua
Peng, Renhua
Li, Xiaodong
APPLIED ACOUSTICS, 2018, 134 : 154 - 159
[37] Speech Recognition Based on Wavelet Packet Transform and K-L Expansion
Wang, Xu
Han, Zhiyan
Wang, Han
Ma, Yujuan
2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2490 - 2493
[38] MODEL-BASED DEREVERBERATION IN THE LOGMELSPEC DOMAIN FOR ROBUST DISTANT-TALKING SPEECH RECOGNITION
Sehr, Armin
Maas, Roland
Kellermann, Walter
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4298 - 4301
[39] Speech Emotion Recognition Based on LSTM and Mel Scale Wavelet Packet Decomposition
Feng, Tian
Yang, Shuying
2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
[40] ROBUST EXCITATION-BASED FEATURES FOR AUTOMATIC SPEECH RECOGNITION
Drugman, Thomas
Stylianou, Yannis
Chen, Langzhou
Chen, Xie
Gales, Mark J. F.
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4664 - 4668

← 1 2 3 4 5 →