Optimization of Dereverberation Parameters based on Likelihood of Speech Recognizer

被引:0
|
作者
Gomez, Randy [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, ACCMS, Sakyo Ku, Kyoto 6068501, Japan
关键词
Dereverberation; Robust ASR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition under reverberant condition is a difficult task. Most dereverberation techniques used to address this problem enhance the reverberant waveform independent from that of the speech recognizer. In this paper, we improve the conventional Spectral Subtraction-based (SS) dereverberation technique. In our proposed approach, the dereverberation parameters are optimized to improve the likelihood of the acoustic model. The system is capable of adaptively fine-tuning these parameters jointly with acoustic model training. Additional optimization is also implemented during decoding of the test utterances. We have evaluated using real reverberant data and experimental results show that the proposed method significantly improves the recognition performance over the conventional approach.
引用
收藏
页码:1259 / 1262
页数:4
相关论文
共 50 条
  • [21] Strategies for reducing the complexity of a RNN based speech recognizer
    Kasper, K
    Reininger, H
    Wust, H
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3354 - 3357
  • [22] A means based on wiener filtering for dereverberation in speech communication
    Zhang, De-Hui
    Chen, Guang-Ye
    [J]. Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2009, 43 (06): : 949 - 952
  • [23] Deep Learning Based Target Cancellation for Speech Dereverberation
    Wang, Zhong-Qiu
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 941 - 950
  • [24] A new method based on spectral subtraction for speech dereverberation
    Lebart, K
    Boucher, JM
    [J]. ACUSTICA, 2001, 87 (03): : 359 - 366
  • [25] Complex Cepstrum Based Single Channel Speech Dereverberation
    Shen Xizhong
    Meng Guang
    [J]. ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 7 - +
  • [26] HMM speech recognizer based on discriminative metric design
    Watanabe, H
    Katagiri, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3237 - 3240
  • [27] DEREVERBERATION OF SPEECH SIGNALS BASED ON SUBBAND ENVELOPE ESTIMATION
    WANG, H
    ITAKURA, F
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (11): : 3576 - 3583
  • [28] Study on the dereverberation of speech based on temporal envelope filtering
    Avendano, C
    Hermansky, H
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 889 - 892
  • [29] A new method based on spectral subtraction for speech dereverberation
    Lebart, K.
    Boucher, J.M.
    Denbigh, P.N.
    [J]. Acta Acustica united with Acustica, 2001, 87 (03): : 359 - 366
  • [30] SPEECH RECOGNIZER OPTIMIZATION AND REAL-TIME IMPLEMENTATION ON A MULTITRANSPUTER ARRAY
    CARAZO, J
    ALEXANDRES, S
    MORAN, J
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1992, 34 (1-5): : 219 - 222