Optimization of Dereverberation Parameters based on Likelihood of Speech Recognizer

被引：0

作者：

Gomez, Randy ^{[1
]}

Kawahara, Tatsuya ^{[1
]}

机构：

[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

Dereverberation; Robust ASR;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech recognition under reverberant condition is a difficult task. Most dereverberation techniques used to address this problem enhance the reverberant waveform independent from that of the speech recognizer. In this paper, we improve the conventional Spectral Subtraction-based (SS) dereverberation technique. In our proposed approach, the dereverberation parameters are optimized to improve the likelihood of the acoustic model. The system is capable of adaptively fine-tuning these parameters jointly with acoustic model training. Additional optimization is also implemented during decoding of the test utterances. We have evaluated using real reverberant data and experimental results show that the proposed method significantly improves the recognition performance over the conventional approach.

引用

页码：1259 / 1262

页数：4

共 50 条

[31] Harmonicity based dereverberation for improving automatic speech recognition performance and speech intelligibility
Kinoshita, K
Nakatani, T
Miyoshi, M
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1724 - 1731
[32] A model distance maximizing framework for speech recognizer-based speech enhancement
BabaAli, Bagher
Sameti, Hossein
Falk, Tiago H.
[J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2011, 65 (02) : 99 - 106
[33] Constrained Multichannel Speech Dereverberation
Yu, Meng
Soong, Frank K.
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1936 - 1939
[34] Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition
Wang, Ke
Zhang, Junbo
Sun, Sining
Wang, Yujun
Xiang, Fei
Xie, Lei
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1581 - 1585
[35] Blind dereverberation of a speech signal
V. A. Zverev
[J]. Acoustical Physics, 2008, 54 : 261 - 268
[36] SPEECH DEREVERBERATION USING A LEARNED SPEECH MODEL
Liang, Dawen
Hoffman, Matthew D.
Mysore, Gautham J.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1871 - 1875
[37] Blind dereverberation of a speech signal
Zverev, V. A.
[J]. ACOUSTICAL PHYSICS, 2008, 54 (02) : 261 - 268
[38] ESTIMATING ROOM ACOUSTIC PARAMETERS FOR SPEECH RECOGNIZER ADAPTATION AND COMBINATION IN REVERBERANT ENVIRONMENTS
Xiong, Feifei
Goetze, Stefan
Meyer, Bernd T.
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[39] Speech dereverberation of single channel
Shen, Xi-Zhong
Meng, Guang
[J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2010, 44 (02): : 229 - 233
[40] Speaker Independent Quranic Recognizer Based on Maximum Likelihood Linear Regression
Mourtaga, Ehab
Sharieh, Ahmad
Abdallah, Mousa
[J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 20, 2007, 20 : 376 - +

← 1 2 3 4 5 →