Dereverberation based on Wavelet Packet Filtering for Robust Automatic Speech Recognition

被引：0

作者：

Gomez, Randy ^{[1
]}

Kawahara, Tatsuya ^{[1
]}

机构：

[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

Speech recognition; Robustness; Dereverberation; Wavelet Packets;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a multiple-resolution signal analysis to suppress late reflection of reverberation for robust automatic speech recognition (ASR). Wavelet packet tree (WPT) decomposition offers a finer resolution to discriminate the late reflection subspace from the speech subspace. By selecting appropriate wavelet basis in the WPT for speech and late reflection, we can effectively estimate the Wiener gain directly from the observed reverberant data. Moreover, the selection procedure is performed in accordance with the likelihood of acoustic model used by the speech recognizer. Dereverberation is realized by filtering the wavelet packet coefficients with the Wiener gain to suppress the effects of the late reflection. Experimental evaluations with large vocabulary continuous speech recognition (LVCSR) in real reverberant conditions show that the proposed method outperforms conventional wavelet-based methods and other dereverberation techniques.

引用

页码：1242 / 1245

页数：4

共 50 条

[41] Morphological filtering of spectrograms for automatic speech recognition
Liu, WM
Bastante, VJR
Rodriguez, FR
Evans, NWD
Mason, JSD
Proceedings of the Fourth IASTED International Conference on Visualization, Imaging, and Image Processing, 2004, : 546 - 549
[42] Spectrum filtering with FRM for robust speech recognition
Hayasaka, Noboru
Miyanaga, Yoshikazu
2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 3285 - +
[43] Matched filtering approach to robust speech recognition
Avadhanulu, J.V.
Sreenivas, T.V.
Journal of the Indian Institute of Science, 79 (03): : 185 - 196
[44] Binaural speech enhancement system combining dereverberation and spatial masking-based noise removal for robust speech recognition
Tien Dung Tran
Dang Khoa Nguyen
Quoc Cuong Nguyen
Huu Binh Nguyen
2012 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2012, : 345 - 350
[45] Wavelet-based denoising for robust feature extraction for speech recognition
Farooq, O
Datta, S
ELECTRONICS LETTERS, 2003, 39 (01) : 163 - 165
[46] Non-linear Dynamics Characterization from Wavelet Packet Transform for Automatic Recognition of Emotional Speech
Vasquez-Correa, J. C.
Orozco-Arroyave, J. R.
Arias-Londono, J. D.
Vargas-Bonilla, J. F.
Noth, Elmar
RECENT ADVANCES IN NONLINEAR SPEECH PROCESSING, 2016, 48 : 199 - 207
[47] Wavelet based speech recognition
Gamulkiewicz, B
Weeks, M
PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 678 - 681
[48] Feature-based Noise Robust Speech Recognition on an Indonesian Language Automatic Speech Recognition System
Satriawan, Cil Hardianto
Lestari, Dessi Puji
2014 International Conference on Electrical Engineering and Computer Science (ICEECS), 2014, : 42 - 46
[49] A Comparision of Multiclass SVM and HMM Classifier for Wavelet Front End Robust Automatic Speech Recognition
Rajeswari
Prasad, N. N. S. S. R. K.
Sathyanarayana, V
2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
[50] Speech recognition using a wavelet packet adaptive network based fuzzy inference system
Avci, Engin
Akpolat, Zuhtu Hakan
EXPERT SYSTEMS WITH APPLICATIONS, 2006, 31 (03) : 495 - 503

← 1 2 3 4 5 →