Optimization of Dereverberation Parameters based on Likelihood of Speech Recognizer

被引:0
|
作者
Gomez, Randy [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan
关键词
Dereverberation; Robust ASR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition under reverberant condition is a difficult task. Most dereverberation techniques used to address this problem enhance the reverberant waveform independent from that of the speech recognizer. In this paper, we improve the conventional Spectral Subtraction-based (SS) dereverberation technique. In our proposed approach, the dereverberation parameters are optimized to improve the likelihood of the acoustic model. The system is capable of adaptively fine-tuning these parameters jointly with acoustic model training. Additional optimization is also implemented during decoding of the test utterances. We have evaluated using real reverberant data and experimental results show that the proposed method significantly improves the recognition performance over the conventional approach.
引用
收藏
页码:1259 / 1262
页数:4
相关论文
共 50 条
  • [31] Harmonicity based dereverberation for improving automatic speech recognition performance and speech intelligibility
    Kinoshita, K
    Nakatani, T
    Miyoshi, M
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1724 - 1731
  • [32] A model distance maximizing framework for speech recognizer-based speech enhancement
    BabaAli, Bagher
    Sameti, Hossein
    Falk, Tiago H.
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2011, 65 (02) : 99 - 106
  • [33] Constrained Multichannel Speech Dereverberation
    Yu, Meng
    Soong, Frank K.
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1936 - 1939
  • [34] Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition
    Wang, Ke
    Zhang, Junbo
    Sun, Sining
    Wang, Yujun
    Xiang, Fei
    Xie, Lei
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1581 - 1585
  • [35] Blind dereverberation of a speech signal
    V. A. Zverev
    [J]. Acoustical Physics, 2008, 54 : 261 - 268
  • [36] SPEECH DEREVERBERATION USING A LEARNED SPEECH MODEL
    Liang, Dawen
    Hoffman, Matthew D.
    Mysore, Gautham J.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1871 - 1875
  • [37] Blind dereverberation of a speech signal
    Zverev, V. A.
    [J]. ACOUSTICAL PHYSICS, 2008, 54 (02) : 261 - 268
  • [38] ESTIMATING ROOM ACOUSTIC PARAMETERS FOR SPEECH RECOGNIZER ADAPTATION AND COMBINATION IN REVERBERANT ENVIRONMENTS
    Xiong, Feifei
    Goetze, Stefan
    Meyer, Bernd T.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [39] Speech dereverberation of single channel
    Shen, Xi-Zhong
    Meng, Guang
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2010, 44 (02): : 229 - 233
  • [40] Speaker Independent Quranic Recognizer Based on Maximum Likelihood Linear Regression
    Mourtaga, Ehab
    Sharieh, Ahmad
    Abdallah, Mousa
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 20, 2007, 20 : 376 - +